.as_retriever() is invalid in Pinecone?

Noob_Adveat · April 6, 2024, 9:57am

When I use a local Vector DB and FAISS, as_retriever() works fine, but if I try to use an Index I’ve stored on Pinecone, I am unable to feed it into my RetrievalQA chain. This is my code:

pc = Pinecone(api_key = pinecone_api_key)
index = pc.Index(host = host_url)
text_field = 'text' # key of dict that stores the text metadata in the index
vectorstore = Pinecone(index,embeddings = embeddings, text_field = text_field)
 
RetrievalQA.from_chain_type(
            llm=chat,
            chain_type="stuff",
            retriever=vectorstore.as_retriever(),
        )

I am getting the following error:

retriever=vectorstore.as_retriever(),
AttributeError: ‘Pinecone’ object has no attribute ‘as_retriever’

How should I proceed?

I wish to further use Maximal Marginal Separation, I thought this would work, would this is still giving the same error.

retriever = searcher.as_retriever()
retriever.search_kwargs['top_k'] = 25
retriever.search_kwargs['maximal_marginal_relevance'] = True
retriever.search_kwargs['k'] = 15

Cory_Pinecone · April 6, 2024, 2:39pm

Looks like you may be using an older version of Langchain, which uses the same name for its vector store object that we do for the Pinecone connection object. Please be sure to update to the latest version of Langchain which uses this format to import and instantiate the Langchain object:

from langchain_pinecone import PineconeVectorStore

index_name = "langchain-test-index"

docsearch = PineconeVectorStore.from_documents(docs, embeddings, index_name=index_name)

Noob_Adveat · April 6, 2024, 2:58pm

Please correct me if I am wrong, but I thought that from_documents was used to add to the VectorStore.

I had already used that when creating the Index.

Now I wanted to access that very Index in a new session, would from_documents still be used?

Cory_Pinecone · April 6, 2024, 3:23pm

Right, but from_documents is part of Langchain. Not the Pinecone SDK. Since older versions of Langchain used the same name we do for a different class trying to call Pinecone.from_documents() will fail.

luis.br02 · April 30, 2024, 2:41am

Hello !

I am having the same issue since i tried to use Pinecone as a retreiver this way:

vectorstore = Pinecone(index_name, "combined")

qa = RetrievalQA.from_chain_type(
    llm = llm,
    chain_type = "combined",
    retriever = vectorstore.as_retriever()
    )

i have read to use langchain PineconeVectorStore.from_documents but here is the thing. This function will embed text into vectors and store it in the index, but what happens if I already have the text embedded and I don’t want to re-embed it again and instead to “retreive or fetch” the index vectors and use it as a retreiver. do you know how can i do it ?

akhilakamma1998 · June 9, 2024, 7:37pm

Hello !

I was able to resolve that issue by calling the pinecone as below.
from langchain.vectorstores import Pinecone as PineconeStore

index = pc.Index(index_name)
vectordb = PineconeStore(index, embeddings,“text”)
qa = RetrievalQA.from_chain_type(
llm=llm,
retriever=vectordb.as_retriever(search_kwargs={‘k’: 2}),
chain_type=“stuff”,
chain_type_kwargs=chain_type_kwargs,
return_source_documents=True
)
while True:
user_input=input(f"Input Prompt:")
result=qa({“query”: user_input})
print("Response : ", result[“result”])

This code will be helpful if you already have a Pinecone index setup

ZacharyProser · June 10, 2024, 1:34pm

@akhilakamma1998, great - glad you got it working, and thank you so much for coming back and sharing!

Best,
Zack