How to handle concurrent requests in re-rerank feature in Weaviate?

ChengCheng · January 22, 2024, 5:32am

Currently, re-rank does not support concurrent requests. Will we develop concurrent requests of rerank in the future? Or how to reduce the time consumption of re-rank for multiple collections？Thanks

DudaNogueira · January 22, 2024, 9:57pm

Hi @ChengCheng !! Welcome to our community

Not sure I follow. AFAIK, the rerank API will require all documents to be sent at the same time, for example, cohere api:

Can you elaborate on that?

Thanks!

ChengCheng · January 23, 2024, 9:20am

Hi , @DudaNogueira Thanks for your reply. We are using the reranker-transformers. There are multiple classes (collections) in Weaviate. ‘rerank-transformer’ cannot be processed in parallel.
e.g. query 9 collections in parallel, weaviate returns chunks from 9 collections in parallel (you get chunks from 9 collection at the same time), but the weaviate re-ranker re-rank the collections one by one (sequentially)
How can we speed up retrieval when we want to call multiple classes with rerank feature in parallel?

Benjamin_Lush · May 24, 2024, 10:41pm

Just a shot in the dark here… perhaps you can run multiple instances of Weaviate and each can have it’s own reranker-transformers instance? I’m having the same problem and am interested in the solution. I’ll let you know how my experiment turns out.

DudaNogueira · May 29, 2024, 7:29pm

Hi!

Welcome to our community @Benjamin_Lush !

I believe you will need to run multiple instances of the inference/reranker of the same models , and run that behind a load balancer.

With that you could spread the load?

Topic		Replies	Views
Does weaviate support reranker model provided by aws in typescript version? Support	1	115	January 16, 2025
Python client v4, is Cohere reranker still enabled by default? Support python	12	605	June 1, 2024
Reranker- change batch size? Support	2	363	January 8, 2024
Using a local reranker-transformers reduces performance by 100x Support python	3	465	July 15, 2024
What endpoints are required for a custom reranker? Support integration	6	1017	November 14, 2023

How to handle concurrent requests in re-rerank feature in Weaviate?

Related topics