Hey y’all,
My team and I are aiming create embeddings for different sections of a documents that are not uniquely formatted.
We’re experimenting with several python scripts that will divide documents into sections based on paragraphs, headings, or custom criteria.
Just curious to know if anyone has a great work flow or smooth way to programmatically solved the problem of dividing a document into sections, depending on the structure and content of the document?