Rerank with HybridFusion.RELATIVE_SCORE - How many are ranked?

Description

Trying to understand what is actually happening with the combination of
rerank with HybridFusion.RELATIVE_SCORE.

HybridFusion.RELATIVE_SCORE documentation says:
“To mitigate this effect, Weaviate automatically performs a search with a higher limit (100) and then trims the results down to the requested limit.”

If I specify a limit in the search query:

response = collection.query.hybrid(
    query=instring,
    alpha=0.6, #proportion of vector search 1.0 = 100% vector
    return_metadata=wvc.query.MetadataQuery(
                    score=True,
                    distance=True,
                    explain_score=True),
    fusion_type=HybridFusion.RELATIVE_SCORE,
    limit=8,
    rerank=Rerank(
        prop="search_text",
        query=instring
    ),
)

The code above works fine and returns both the normalized hybridFusion normalized scores and the rerank scores.

Does the rerank “rerank” only the limit=8 documents, or does it rerank the 100 initial documents retrieved by HybridFusion.RELATIVE_SCORE behind the scenes?

I’d prefer a wider rerank, just unclear on what this combination does…
If I want to rerank the top 50 documents, I guess I could start with limit=50 then trim the result to the final top 8?

Server Setup Information

  • Weaviate Server Version: 1.24.1
  • Deployment Method: docker
  • Multi Node? Number of Running Nodes: 1
  • Client Language and Version: python 4.5.5

Any additional Information

all above

hi @Gene_Mc !!

Weaviate will do a 2 phase process. It will first perform the search you requested (hybrid, near vector, etc), and subsequently it will pass over those objects to the reranker module to get them resorted.

So on your case, it will only perform the rerank over the 8 objects, as your limited for your query.

from:

Let me know if this helps!

Thanks!

Hi @DudaNogueira

This answer does help me understand, so I’ve gone the route of returning 100 documents then plucking the top N for my RAG application.

        response = collection.query.hybrid(
            query=instring,
            alpha=0.6, #proportion of vector search 1.0 = 100% vector
            return_metadata=wvc.query.MetadataQuery(
                score=True,
                explain_score=True,
                ),
            limit=100,
            fusion_type=HybridFusion.RELATIVE_SCORE,
            rerank=Rerank(
                prop="page_content",
                query=instring
            ),
        )

This combination works pretty well… maybe too well? Now I wish I had an easy way to add back some diversity and/or eliminate redundancy… I’ll keep reading…

thank you for your help!

1 Like

That’s great, @Gene_Mc !!

Let us know if you have any other doubts along your journey learning those cool technology :sunglasses:

Just be aware that with more objects being passed to rerank, costs will be higher :money_mouth_face:

We are here to help!

Thanks!