Similarity score treshold not working - langchain

Artic · May 17, 2024, 11:18am

    # Set up the conversational retrieval chain
    chain = ConversationalRetrievalChain.from_llm(
        ChatOpenAI(model_name="gpt-4o", temperature=0, streaming=True),
        chain_type="stuff",
        retriever=docsearch.as_retriever(
        search_type="similarity_score_threshold",
        search_kwargs={'score_threshold': 0.8}
        ),
        memory=memory,
        return_source_documents=True,
    )

I am trying to return only documents with similarity score > 0.8 but for some reason this is not working, it’s always returning 5 results (i guess default is k:5). Does anyone have idea how to make this to work?

ZacharyProser · May 17, 2024, 11:23am

Hi @Artic,

Welcome to the Pinecone forums and thanks for your question!

Could you please share the code you’re using to index the documents?

On your Pinecone dashboard, how many vectors do you see and what happens if you query results directly from your console - do you see expected results?

Best,
Zack

Artic · May 17, 2024, 12:54pm

Hi Zack,

Thanks for your response! Here is the code I am using to index the documents in Pinecone:

# Initialize the database (create tables if they don't exist)
init_db()

# Assuming the namespace and embeddings are the same as in the original app
index_name= os.getenv('INDEX_NAME')
namespace = os.getenv('NAMESPACE')

embeddings = OpenAIEmbeddings()

# Connect to the Pinecone index
docsearch = Pinecone.from_existing_index(
    index_name=index_name, embedding=embeddings, namespace=namespace
)

In my Pinecone dashboard, I can see the expected number of vectors. However, when querying the results, it seems to always return 5 results regardless of the similarity score threshold I set. Any insights on this would be appreciated.

Artic · May 17, 2024, 12:59pm

For additional context, complete code is somewhat similar to this example:

github.com

Chainlit/cookbook/blob/main/pinecone/app.py

import os
from typing import List
from langchain.embeddings.openai import OpenAIEmbeddings
from langchain.vectorstores.pinecone import Pinecone
from langchain.chains import ConversationalRetrievalChain
from langchain.chat_models import ChatOpenAI
from langchain.memory import ChatMessageHistory, ConversationBufferMemory
from langchain.docstore.document import Document
import pinecone
import chainlit as cl

pinecone.init(
    api_key=os.environ.get("PINECONE_API_KEY"),
    environment=os.environ.get("PINECONE_ENV"),
)


index_name = "langchain-demo"

# Optional

This file has been truncated. show original

patrick1 · May 17, 2024, 1:15pm

We do not currently offer a minimum score filter. Pinecone queries will always return the top_k most similar records in the query result.

For more details, please see: Filtering by score