RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search (2405.12497v1)

Published 21 May 2024 in cs.DB, cs.DS, and cs.IR

Abstract: Searching for approximate nearest neighbors (ANN) in the high-dimensional Euclidean space is a pivotal problem. Recently, with the help of fast SIMD-based implementations, Product Quantization (PQ) and its variants can often efficiently and accurately estimate the distances between the vectors and have achieved great success in the in-memory ANN search. Despite their empirical success, we note that these methods do not have a theoretical error bound and are observed to fail disastrously on some real-world datasets. Motivated by this, we propose a new randomized quantization method named RaBitQ, which quantizes $D$-dimensional vectors into $D$-bit strings. RaBitQ guarantees a sharp theoretical error bound and provides good empirical accuracy at the same time. In addition, we introduce efficient implementations of RaBitQ, supporting to estimate the distances with bitwise operations or SIMD-based operations. Extensive experiments on real-world datasets confirm that (1) our method outperforms PQ and its variants in terms of accuracy-efficiency trade-off by a clear margin and (2) its empirical performance is well-aligned with our theoretical analysis.

Authors (2)

Jianyang Gao (7 papers)
Cheng Long (65 papers)

Citations (3)

View on Semantic Scholar

Summary

The paper presents RaBitQ, a new quantization technique that guarantees an O(1/√D) theoretical error bound for high-dimensional ANN search.
It employs randomized transformations and bitwise operations to construct a uniformly distributed codebook and compute unbiased distance estimations.
Experiments demonstrate that RaBitQ outperforms PQ, OPQ, and HNSW in both accuracy and efficiency across various high-dimensional datasets.

RaBitQ: A New Approach to Quantization with Theoretical Error Bounds

Searching for approximate nearest neighbors (ANN) in high-dimensional spaces is crucial for applications like recommendation engines, image retrieval, and data mining. However, the curse of dimensionality makes exact searches impractical, pushing researchers to develop methods like Product Quantization (PQ). PQ and its variants are widely recognized for achieving efficient and accurate ANN searches with fast implementations using SIMD instructions. However, these methods lack a theoretical error bound, making their performance unpredictable on unseen data. This is where RaBitQ comes into play.

What is RaBitQ?

RaBitQ is a novel quantization method for high-dimensional vectors which uniquely guarantees a sharp theoretical error bound. The key innovation lies in the way RaBitQ constructs its quantization codebook and estimates distances. It uses randomized transformations and bitwise operations to achieve highly efficient and accurate distance estimations between vectors.

The Core Idea

RaBitQ quantizes $D$ -dimensional vectors into $D$ -bit strings. The transformations ensure that the quantization codebook evenly covers the high-dimensional space:

Codebook Construction:
- RaBitQ starts by constructing a set of bi-valued vectors (with values $-1/\sqrt{D}$ or $+1/\sqrt{D}$ ) which are uniformly distributed on the unit hypersphere.
- A random orthogonal matrix is then applied to rotate these vectors uniformly, which results in a randomized codebook.
Normalization: Both data and query vectors are normalized to lie on the unit hypersphere, simplifying distance computation to inner product estimation.
Distance Estimation:
- RaBitQ uses unbiased estimators with theoretical error bounds to estimate the distances between normalized vectors. This method provides an error bound of $O(1/\sqrt{D})$ , which is asymptotically optimal according to existing theoretical results.

Comparison with Other Methods

Efficiency and Accuracy

The authors compared RaBitQ to PQ and its variants (like OPQ and LSQ) using extensive experiments on public datasets. Key observations include:

RaBitQ vs. PQ and OPQ: RaBitQ consistently outperformed both PQ and OPQ in terms of accuracy for a given time. Even when using shorter quantization codes, RaBitQ maintained lower errors and higher accuracy.
Error Stability: Unlike PQ and OPQ, RaBitQ achieved a maximum relative error of around 40% or less, which highlights its robustness.
Index Phase Time: Both RaBitQ and PQ took a reasonable amount of time (a few minutes) for the indexing phase, while LSQ timed out, proving impractical for large datasets.

ANN Search Performance

When applied to ANN search with an Inverted File Index (IVF), RaBitQ provided high recall and accuracy without the need for parameter tuning:

Parameter-Free Re-Ranking: RaBitQ's rigorous error bounds eliminated the need for empirical tuning of re-ranking parameters, which is often required for PQ and OPQ.
Comparison with HNSW: RaBitQ also showed superior performance compared to HNSW, a leading graph-based ANN method, across various datasets.

Practical Implications

RaBitQ’s sharp error bounds make it particularly appealing for applications requiring reliable and consistent performance. Data scientists can use RaBitQ for:

Robust ANN Search: Achieve stable and accurate nearest neighbor search without extensive parameter tuning.
High Computational Efficiency: Benefit from efficient distance estimations using SIMD-based operations.
Theoretical Guarantees: Ensure predictable and reliable performance with rigorous error bounds.

Future Directions

While RaBitQ already shows promise, it can be extended and integrated with other ANN methods:

Graph-Based Indexes: Combining RaBitQ with graph-based indexes like NGT-QG could yield even better performance, although it would require careful handling of the inherent complexities.
Applications in Neural Networks: RaBitQ can potentially be used for neural network quantization and maximum inner product search, offering significant advantages in model compression and fast inference.

In sum, RaBitQ stands out by combining strong theoretical foundations with practical, high-performance implementations, making it a valuable tool for tackling high-dimensional ANN search challenges.

PDF Markdown

Related Papers

Tweets

https://twitter.com/_reachsumit/status/1793103118357651653

https://twitter.com/_ashish_tiwari/status/1848456170312487200

https://twitter.com/gaoj0017/status/1930179437993308198

https://twitter.com/gm8xx8/status/1793103532712947854