Let’s imagine I have these 5 vectors:
index.upsert([
("Andrew", [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1], {"genre": "comedy", "year": 2020}),
("Lulu", [0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2], {"genre": "comedy", "year": 2021}),
("Love", [0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3], {"genre": "comedy", "year": 2020}),
("Dog", [0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4], {"genre": "comedy", "year": 1999}),
("Children", [0.5, 0.5, 0.5, 0.5, 0.5, 0.5, 0.5, 0.5], {"genre": "horror" })
], namespace="movies")
When I query pinecone by ID of “Andrew” and a top_k of 1 i get back
In fact, I need to set my top_k to 5 to get it back at all!
Similarly, if i query by a vector of [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1] I would expect to get back the “Andrew” vector. But I do not.
Anyone know why? It works the way I expect it to when I use euclidean, but it appears to be the least related result to my search when i use cosine or dot product.