Compare 2 long texts

alexcoch · May 3, 2023, 1:08pm

Hello,
How can I compare for long texts using Pinecone. The questions are of interest to the following:

What is new in this article compared to the articles previously marked as read, by points,
What of this article has already been in previously read articles, point by point.
I guess that the solution may lie through a bundle of Langchain, Vector DB, and GPT, but it is not entirely clear how to formulate the above questions. I am glad to any ideas.
What I did before:
1. I broke the text of the first article into pieces,
2. I got embedding with the GPT API to them,
3. Saved it in Pinecone,
4. I sent requests to Pinecone and then, together with the results obtained, to the GPT API and received human-like responses.
  Perhaps, it’s worth storing chunks in the view of embeddings to DB and then try compare this article with previous one. Is it possible?
  Best,
  Alex