Approximate Nearest Neighbor Search Using Query-Directed Dense Graph

Hongya Wang; Zeng Zhao; Kaixiang Yang; Hui Song; Yingyuan Xiao

Approximate Nearest Neighbor Search Using Query-Directed Dense Graph

Hongya Wang, Zeng Zhao, Kaixiang Yang, Hui Song, Yingyuan Xiao

Published: 01 Jan 2021, Last Modified: 30 Sept 2024DASFAA (Workshops) 2021EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: High-dimensional approximate nearest neighbor search (ANNS) has drawn much attention over decades due to its importance in machine learning and massive data processing. Recently, the graph-based ANNS become more and more popular thanks to the outstanding search performance. While various graph-based methods use different graph construction strategies, the widely-accepted principle is to make the graph as sparse as possible to reduce the search cost. In this paper, we observed that the sparse graph incurs significant cost in the high recall regime (close or equal to 100%). To this end, we propose to judiciously control the minimum angle between neighbors of each point to create more dense graphs. To reduce the search cost, we perform K-means clustering for the neighbors of each point using cosine similarity and only evaluate neighbors whose centroids are close to the query in angular similarity, i.e., query-directed search. PQ-like method is adopted to optimize the space and time performance in evaluating the similarity of centroids and the query. Extensive experiments over a collection of real-life datasets are conducted and empirical results show that up to 2.2x speedup is achieved in the high recall regime.

Loading