Description
Trying to understand what is actually happening with the combination of
rerank with HybridFusion.RELATIVE_SCORE.
HybridFusion.RELATIVE_SCORE documentation says:
“To mitigate this effect, Weaviate automatically performs a search with a higher limit (100) and then trims the results down to the requested limit.”
If I specify a limit in the search query:
response = collection.query.hybrid(
query=instring,
alpha=0.6, #proportion of vector search 1.0 = 100% vector
return_metadata=wvc.query.MetadataQuery(
score=True,
distance=True,
explain_score=True),
fusion_type=HybridFusion.RELATIVE_SCORE,
limit=8,
rerank=Rerank(
prop="search_text",
query=instring
),
)
The code above works fine and returns both the normalized hybridFusion normalized scores and the rerank scores.
Does the rerank “rerank” only the limit=8 documents, or does it rerank the 100 initial documents retrieved by HybridFusion.RELATIVE_SCORE behind the scenes?
I’d prefer a wider rerank, just unclear on what this combination does…
If I want to rerank the top 50 documents, I guess I could start with limit=50 then trim the result to the final top 8?
Server Setup Information
- Weaviate Server Version: 1.24.1
- Deployment Method: docker
- Multi Node? Number of Running Nodes: 1
- Client Language and Version: python 4.5.5
Any additional Information
all above