Description
I did load testing on my API today, and found it had a rps about 40. This is surprising to me considering how fast each weaviate query is.
It turns out the weaviate search queries are not concurrent - anyone willing to share how you made APIs making weaviate queries more performant?
Server Setup Information
- Weaviate Server Version:
- Deployment Method: docker
- Multi Node? Number of Running Nodes: 1
- Client Language and Version: python v4
Any additional Information
I read about multi-node setup in weaviate. Would that help with this issue? what about this - QUERY_DEFAULTS_LIMIT: 25?