Hybrid Search Recall Inconsistency: limit parameter significantly alters Top-1 retrieval results (False Negatives at lower limits)

Carloszone · December 19, 2025, 7:28am

Description

I am running a Hybrid Search on a collection with hundreds of thousands of objects. I am trying to retrieve a specific chunk which I know exists in the database. I have performed a controlled experiment using the exact same query but varying the alpha and limit parameters.

The Experiment: I tested three alpha settings: 0 , 0.5, and 1 . For each alpha, I compared the results between limit: 1000 and limit: 10000.

The Observation:

When limit: 1000: The target chunk was NOT found in the returned list for any of the alpha settings (0, 0.5, or 1).
When limit: 10000: The target chunk was successfully retrieved and, notably, it was ranked #1 in the results.

The Confusion: This behavior is counter-intuitive. My understanding is that without an explicit sort order, Weaviate returns results by relevance score (descending). Therefore, I expected the Top-1000 results of a limit=1000 query to be identical to the Top-1000 subset of a limit=10000 query.

The fact that the #1 ranked item (at limit 10k) completely disappears when the limit is reduced to 1k suggests that the limit parameter is implicitly controlling the search depth (e.g., HNSW ef parameter or WAND pruning threshold) rather than just truncating the final result list.

My Concerns & Questions:

Search Scope: Does Weaviate dynamically adjust the underlying ANN search scope (ef) or the inverted index pruning aggressiveness based on the requested limit?
Reliability: If the search scope is indeed tied to the limit, I am concerned about false negatives. If a target chunk is missed at limit=10000 (which is often the default hard cap), is there any way to ensure it is considered during the retrieval phase?
Configuration: How can I configure the search to ensure that high-scoring candidates are not pruned early in the process, even if I use a smaller limit? (I am already using ID filters to narrow down the scope, but the candidate pool remains large).

I would appreciate any insights into the underlying mechanism and advice on how to guarantee recall for top-ranking items without having to request excessively large limits.

Server Setup Information

Weaviate Server Version:1.31.2
Deployment Method: docker
Multi Node? Number of Running Nodes: 1 node
Client Language and Version: 4.15.2
Multitenancy?: No

Any additional Information

DudaNogueira · December 19, 2025, 1:21pm

hi @Carloszone !!

Can you reproduce this on latest version? We had a lot of fixes since 1.31.2, and this will help us narrow it down from the get go.

Try to always be on at least 1.31.latest, as we backport the most important fixes.

However, it is important to note:

Yes, Weaviate dynamically adjusts search scope based on the limit parameter. The limit affects both the HNSW ef parameter for vector search and the WAND pruning threshold for BM25 search, which explains why your top-ranked item appears at limit=10000 but disappears at limit=1000.

If you want to dive in on more explanation tied to our codebase and suggested nobs to change on your configuration, check out this cool link

Let me know if this helps!

trengrj · December 24, 2025, 3:27am

Hi @Carloszone ,

Because Weaviate does an approximate kNN search you can get these sort of results when increasing the limit.

With a limit = 100, Weaviate will search the HNSW graph by default with ef = 500 (due to the dynamicEf values). However when you set limit = 10000, Weaviate will search the graph with ef = 10000. The ef value can be thought of as the “working set” in the priority queue that is tracked before returning the results. With a larger ef value you will discover more vectors and so have a better chance of finding the best vector (i.e. the best chunk as you mentioned). When returned this vector will appear as the top result due to the lowest distance but much more of the graph was searched to find this vector.

You can improve results by setting the ef value in the vector index config to a larger value though this will reduce QPS. Additionally you can modify other HNSW build parameters like maxConnections or efConstruction to hit the desired recall settings. Please note however there will always be some randomness due to the approximate nature of the algorithms.

Topic		Replies	Views
The log shows "Limit" is 100 when I only need 5 Support	1	87	December 11, 2025
Rerank with HybridFusion.RELATIVE_SCORE - How many are ranked? Support	3	755	May 8, 2024
Controlling hybrid search parameters Support	1	72	September 8, 2025
Limit parameter change results of near_vector query Support	3	591	November 6, 2024
Issue with Weaviate Hybrid Search (Alpha = 1) Not Returning Exact Match General	2	436	March 27, 2025

Hybrid Search Recall Inconsistency: limit parameter significantly alters Top-1 retrieval results (False Negatives at lower limits)

Description

Server Setup Information

Any additional Information

Related topics