High query latency

We are experiencing very high latency, up to minutes when querying our index.

We are using:

  • serverless index.
  • dimensions 768.
  • about 42 000 000 vectors with metadata.

We’re using the python sdk (grpc and rest gave same results)

It often occurs when no query were performed since some time. Our guess is, the delay is caused by the time to load the vectors like explain here. However, more than a few seconds make it completely unready for production.

Hello @waapi.machine.learni welcome back to the forums.

I know you have an open Support case on this topic, so we will continue to work on it there.

I will post an update here for the community once we get to the bottom of the issue.

1 Like