Vectors aren't immediately deleted after sending delete request

shobrookj · April 21, 2024, 4:50am

Hi, my company recently migrated to Pinecone Serverless. We noticed that deleting vectors by ID does not behave as expected. When we send a delete request (using the Python Pinecone SDK), we immediately get a response. But if we check the index afterwards, we see that the vectors are still there. It’s only after several seconds that they actually get deleted. It appears that the delete endpoint just queues up the vectors to be deleted.

Is this the case? If so, how can we know exactly when the vectors are in fact deleted?

zeke · April 22, 2024, 2:55pm

Hi @shobrookj, Pinecone is eventually consistent, so there can be a slight delay before new or changed records are visible to queries and other read requests.

After adding, updating, or deleting records, use the describe_index_stats operation to check if the current record count matches the number of records you expect.

Please note that Pinecone serverless is in public preview. Performance may fluctuate, and we are continuously improving the architecture, including data freshness capabilities.

shobrookj · April 22, 2024, 5:06pm

Thanks for the response. For our use case, we need to know for sure that the vectors are deleted before performing another operation, otherwise our system will break. It sounds like the best way to do this is to continuously poll describe_index_stats until it reflects the right record count? Are there plans to implement a better solution down the line, such as simply waiting to return a response from the /delete request until the vectors are actually deleted?

zeke · April 22, 2024, 8:42pm

@shobrookj Yes, you are correct that the best approach at this time is to poll describe_index_stats until you observe the expected vector count.

There are a few considerations regarding how this challenge will be handled in the future:

We aim to continue improving our data freshness capabilities on serverless so that this situation is less likely to emerge.
We are working on functionality that allows users to bind their read requests to ensure that certain previously issued writes are reflected in the freshness layer used for the read. This will effectively ensure that you can perform a read, knowing that the impact of your write requests is present.

Lastly, I’m curious how many records you are deleting in one request before you test the effect with a fetch.

shobrookj · May 4, 2024, 9:11am

Okay, point #2 sounds great.

One last question: I’m noticing that Pinecone serverless does not support metadata filtering for the describe_index_stats method. I group together vectors based on metadata, and this means I cannot check if the deletion for a particular group of vectors was successful or not. What would you recommend I do?

zeke · May 7, 2024, 8:07pm

@shobrookj You can query using a metadata filter targeting the desired field and value. If the query yields no results, the records were deleted successfully.

banumathi.thangavelu · March 18, 2025, 2:23pm

Am facing an issue in Pinecone:

In the recommendation flow, while comparing the user profile with the jobs table in Pinecone, it first provides the matched job results along with their matching scores.
Then, I check whether the matched job results are available in our (dev) database. Because We can display these job details as recommended jobs only if they exist in our database. Otherwise, we cannot display them, even if they are available in the Pinecone database and considered a match.
Therefore, I delete jobs from Pinecone that appear in the matched results but are not present in our database. (This happens because sometimes we manually delete jobs from our database, but they are not updated in Pinecone, leaving them available only in the Pinecone database.)
However, even after deleting these jobs, Pinecone still returns the same matched results, including the deleted jobs.
I checked the documentation, and this issue occurs because Pinecone does not update immediately after deletion. They suggest (describe_index_stats) checking the count before and after deleting to verify the update.
I checked the job count , but there is no change in the count before and after deletion. Pinecone continues to return the same matched results, all of which are deleted jobs.

How can i resolve this