Single-index filtering vs. Categorical HNSW

mgoldenbe · October 27, 2023, 10:57am

I am trying to decide on the right platform for my application. In my application, a user will need to be able to search his documents. The solution needs to scale to thousands of users and millions of documents per user. It should have memory requirements proportional to the number of documents for one user, not the overall number of documents of all users.

Here is my question. Is single-index filtering in Pinecone similar to Categorical HNSW in Qdrant (i.e. adding edges that connect vertices corresponding to the same meta-value)?

tim · October 28, 2023, 4:42am

Pinecone doesnt even use HNSW (which can have these performance issues) it appears they use FreshDiskANN (Great Algorithms Are Not Enough | Pinecone) with some modifications I’m sure.

ARXIV: [2105.09613] FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity Search

system · November 11, 2023, 4:43am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.