Read query speed

Saketh · August 20, 2025, 7:54am

I noticed the documentation says query latency is not improved by sharding—only by replication. My mental model is that, when a query arrives, the coordinator contacts every shard-hosting pod, each pod searches its local data in parallel, and the coordinator then merges the partial results. Intuitively that sounds faster as we add more shards. Where does my reasoning diverge from how Weaviate actually processes a query?

DudaNogueira · August 20, 2025, 9:34pm

Hi @Saketh !!

Your mental model is correct. However, the bottleneck will not be searching, but merging the results.

More sharding means:

Wait for all shards to complete their searches
Merge and sort results from all shards
Apply global limits and consistency checks

This means query latency is bounded by the slowest shard, not improved by parallelization.

Replication, on the other hand, will help as it provides multiple copies of the same data, allowing the system to route queries to the fastest available replica rather than waiting for a specific slow shard.

Let me know if this clarifies it for you

Happy coding!

Saketh · August 21, 2025, 2:18am

In the case of sharding wont the slowest shard be still faster than the case of not sharding, because search in each shard is only limited to a part of the data not the entire data.

DudaNogueira · August 21, 2025, 12:16pm

Yes, but more shards will also mean more overhead on merging results. So adding more shards will make the merging more costful, and less effective if you want increased query latency.

Saketh · August 21, 2025, 12:30pm

Oh I see, thank you @DudaNogueira !

Topic		Replies	Views
High Query latency in Weaviate Support	13	1065	October 1, 2024
Multiple shards at one node General	1	477	February 6, 2024
How to support concurrent near_text search queries Support	1	553	February 21, 2024
Horizontally scaling weaviate (sharding & replication) Support	3	1490	April 8, 2024
Increase number of shards and update HNSW vector index parameters Support python	6	1103	August 28, 2024

Read query speed

Related topics