HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints (2207.07940v1)

Published 16 Jul 2022 in cs.DB and cs.IR

Abstract: The in-memory approximate nearest neighbor search (ANNS) algorithms have achieved great success for fast high-recall query processing, but are extremely inefficient when handling hybrid queries with unstructured (i.e., feature vectors) and structured (i.e., related attributes) constraints. In this paper, we present HQANN, a simple yet highly efficient hybrid query processing framework which can be easily embedded into existing proximity graph-based ANNS algorithms. We guarantee both low latency and high recall by leveraging navigation sense among attributes and fusing vector similarity search with attribute filtering. Experimental results on both public and in-house datasets demonstrate that HQANN is 10x faster than the state-of-the-art hybrid ANNS solutions to reach the same recall quality and its performance is hardly affected by the complexity of attributes. It can reach 99\% recall@10 in just around 50 microseconds On GLOVE-1.2M with thousands of attribute constraints.

Authors (6)

Wei Wu (482 papers)
Junlin He (7 papers)
Yu Qiao (563 papers)
Guoheng Fu (1 paper)
Li Liu (311 papers)
Jin Yu (39 papers)

Citations (9)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

YouTube

Show All Videos

HQANN: Efficient and Robust Similarity Search for Hybrid Queries with Structured and Unstructured Constraints (2207.07940v1)

Summary

Related Papers

YouTube