Pinecone Query Embedding threshold for text-embedding-3-large and curbing dimension

gopalika.sharma · August 5, 2024, 2:58pm

Hello there!

I had a quick question about setting a threshold for query embedding using text-embedding-3-large model.
Before I was working with text-embedding-ada-002 model with 1536 as the index dimension. And the threshold for query search I was working with 0.79 based on the other benchmarks out there.

I updated my model but used the dimensions parameter to embed 3072 dimension objects in 1536 index space like below (sample code):

 embedding_model = AzureOpenAIEmbeddings(deployment=text-embedding-3-large,
        model=text-embedding-3-large,
        openai_api_base = xxx  ,
        openai_api_type = azure ,
        dimensions=1536)

Then I ran my query embedding the same way with the same model text-embedding-3-large (sample code):

embedding_model = AzureOpenAIEmbeddings(
            deployment=text-embedding-3-large,
            openai_api_type=azure,
            dimensions=1536,
        )

I didn’t change the threshold, and kept it at 0.79, I got 0 hits which was weird, and then just to test, I amended it to 0.079 and I got plenty hits?

May I ask why is that happening, is the text-embedding-3-large, value ranges from 0 to 0.1 unlike ada-002 which was 0 to 1? Can I just use 0.079 instead?
Also do you recommend forcing the dimensions to be 1536 when embedding a vector in the pinecone space when using a 3072 dimension model?

ZacharyProser · August 20, 2024, 12:11pm

Hi @gopalika.sharma, and thanks for your question!

The core of the problem here lies in the compatibility of vectors output by different models.

If you’re using text-embedding-3-large and truncating dimensions (forcing it to use half the dimensions), that will likely result in some inaccuracy and poor retrieval performance.

To resolve this, be sure to re-embed all vectors in your index with the new model and ensure that both your query vector (that you send into Pinecone to search) and your indexed vectors are generated using the same embedding model and dimensions.

You will want to continue experimenting with the relevance score to filter on - but I’d bet that things will improve significantly if you re-embed all your vectors with the same model.

By the way, you may want to use text-embedding-3-small if you want 1536 dimensions.

Hope this helps!

Best,
Zack