Hello Everyone, My name is Emmanuel Katto. I am new to this community. What are the best practices for scaling applications that utilize Pinecone for vector search? How do I handle increased loads?
We have a reference architecture that demonstrates how you can build a system that scales resources up and down for data ingestion and retrieval. There’s a long form article on it here, as well as code samples: Exploring the Pinecone AWS Reference Architecture | Pinecone