Misleading documentation

DanyStinson · January 28, 2024, 7:43pm

First of all congrats on the docs, super useful.

Just one small mislead I caught:

In the distance metrics page, for euclidean it states:

This is used to calculate the distance between two data points in a plane. It is one of the most commonly used distance metric. For an example, see our [image similarity search example].

And in that section of the doc, it uses cosine similarity:

In this example, to compare embeddings, we will use the [cosine similarity score]because this model generates un-normalized probability vectors. While this calculation is trivial when comparing two vectors, it will take quite a long time when needing to compare a query vector against millions or billions of vectors and determine those most similar with the query vector.

jesse · January 29, 2024, 3:14pm

Thanks for reporting this, @DanyStinson. We’ve updated that page to link to a correct example for each distance metric.

system · January 30, 2024, 3:15pm

This topic was automatically closed 24 hours after the last reply. New replies are no longer allowed.