High Token Consumption

pietrosoftware.34 · December 24, 2024, 11:38am

Good morning,
I have an issue with very high token consumption for simple queries.

I send a message through an API call to a Pinecone AI Agent that has the task to extract the name from the message and find the closest match based on the names present in the document it is trained with.

but the token consumption for just outputting the product name is 2800-3200 tokens.

Is there a way to lower it?

Thank you!