basis embedding: a product quantization based model compression method for language models.
-
Updated
Oct 29, 2024 - Python
basis embedding: a product quantization based model compression method for language models.
Vector search using embeddings, FAISS and Product Quantization with custom index & KMeans
Implementations of different NLP tasks
[ECCV'24] Differentiable Product Quantization for Memory Efficient Camera Relocalization
WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval
Generalized Product Quantization Network For Semi-supervised Image Retrieval - CVPR 2020
Self-supervised Product Quantization for Deep Unsupervised Image Retrieval - ICCV2021
product quantization image processing algorithm with matlab
This repo includes a Sparse Transformer implementation which utilizes PQ to derive the sparsity.
Inverted file system for billion-scale ANN search
Array quantization and compression
Fast and memory-efficient ANN with a subset-search functionality
Transformer-based Embedding Retrieval with Product Quantization for Edge Computing (JavaScript)
Pure python implementation of product quantization for nearest neighbor search
Some useful tips for faiss
Fast and memory-efficient clustering
⚡ A fast embedded library for approximate nearest neighbor search
Scene similarity for weak object discovery & classification
Orthonormal Product Quantization Network for Scalable Face Image Retrieval
A tiny approximate K-Nearest Neighbour library in Python based on Fast Product Quantization and IVF
Add a description, image, and links to the product-quantization topic page so that developers can more easily learn about it.
To associate your repository with the product-quantization topic, visit your repo's landing page and select "manage topics."