Short Chunks Being Penalized in Reranking Pipeline — How to Fix?

Aadil_5122 · April 8, 2025, 4:19am

Description

Hey everyone,I’ve indexed around 1000 documents in Weaviate, and my pipeline works as follows:

Perform a hybrid search to retrieve the top 100 chunks (not full documents).
Rerank these 100 chunks using Cohere.
Send the top 15 reranked chunks to an LLM for generation.

The issue I’m facing is this:If a document is very small—say, a one-liner—it often gets excluded from the final 15 chunks, even though it is part of the initial 100 retrieved by Weaviate. On the other hand, when the same piece of information exists as part of a larger document, it more reliably appears in the final top 15.
This is problematic because some of these short documents contain high-signal content that should ideally be prioritized in generation. I’m using a hybrid reranker setup—Cohere for semantic relevance and BM25 for keyword overlap—and both are supposed to be independent of document length during reranking.How can I mitigate this issue and ensure that short, high-signal chunks aren’t overlooked in the reranking phase? Any suggestions or best practices would be greatly appreciated!

Server Setup Information

Weaviate Server Version: 1.29.0
Deployment Method: WCS
Multi Node? Number of Running Nodes: High Availability Cluster
Client Language and Version: weaviate-client==4.11.3
Multitenancy: Enabled

DudaNogueira · April 10, 2025, 6:26pm

hi @Aadil_5122 !!

Welcome to our community

That’s an interesting finding Unfortunately this seems to be totally dependable on the reranker model

One possible solution is to get those 100 objects, remove the short ones with a considerable good score, then pass them to reranker and finally llm.

The drawback of this is that you will need to add some more code, as you will not be using Weaviate rerank nor generative modules.

That will give you more control over the pipeline, and then can better tweak it for your needs.

Let me know if that helps!

Topic		Replies	Views
Rerank with HybridFusion.RELATIVE_SCORE - How many are ranked? Support	3	369	May 8, 2024
Python client v4, is Cohere reranker still enabled by default? Support python	12	605	June 1, 2024
Hybrid query with rerank， get context deadline exceeded ERROr Support	1	235	August 7, 2024
Reranker- change batch size? Support	2	363	January 8, 2024
Using a local reranker-transformers reduces performance by 100x Support python	3	465	July 15, 2024

Short Chunks Being Penalized in Reranking Pipeline — How to Fix?

Description

Server Setup Information

Related topics