Async BATCH request for real-time ANN search

justatom · February 7, 2024, 2:01pm

Hi everyone!

I am running weaviate instance locally and referring to this stackoverflow post.

So, let’s say that all the logic with respect to weaviate API incapsulated into IWeaviateDocStore class. I’m receiving requests in async manner from the client and I’d like to perform ANN search on them using batches instead of manual one by one iteration. Is that possible now with weavaite? The pseudo-code is below:


import json
import aiohttp
import asyncio
from sir.storing.weaviate import IWeaviateDocStore

async def search(query_string: str, url:str):   

    query = '''
        {
            Get{
                Article(
                    limit: 20
                    hybrid: {
                        query: "<QUERY_PLACEHOLDER>"
                        alpha: 0.75
                        fusionType: relativeScoreFusion
                    }){
                        texte
                        title
                    }
                    _additional {
                        score
                    }
                }
            }
        }
    '''.replace("<QUERY_PLACEHOLDER>", query_string)

    async with aiohttp.ClientSession() as session:
        async with session.post(url, headers=headers, data=json.dumps({'query': query})) as response:
            return await response.json()


def main():
   BATCH_SIZE = 10
   INCOMING_REQS: List[str] = ["What are the rules of the hunger games by S.Collins?", "What is lumos maxima?", ... ,]
   # HELP. Do I need to manually perform `batching` or I can simply send it to weavaite and it'll handle it automatically?

   session = aiohttp.ClientSession()
   URL = "http://localhost:8080"
   
   batch_reqs = await asyncio.gather(*[search(session=session, url=URL, query_string=query) for query in INCOMING_REQS]
   #
   await session.close()

      
   

if __name__ == "__main__":
   asyncio.run(main())

ps. Thx in advance and really sorry if this functionality is already in the docs, haven’t found it.

Add Async Client to be able to make asynchronous requests · Issue #88 · weaviate/weaviate-python-client · GitHub

DudaNogueira · February 7, 2024, 9:17pm

Hi!

if you consume the endpoints directly, I believe this would work

There are some work in progress for allowing this from the client, specially with the new GRPC connection that improves performance.

Topic		Replies	Views
How to do a async call to search in weaviate General	1	1334	September 27, 2023
WeaviateHybridSearchRetriever with WeaviateAsyncClient Support	2	146	September 8, 2024
What is the best practice to use v4 python client for query with fastapi(or other async python framework) Support	5	1045	January 19, 2024
Batch thread died unexpectedly Support developer-experience	11	194	July 22, 2024
Async weaviate query hybrid General bug , python	1	179	December 9, 2024

Async BATCH request for real-time ANN search

Related topics