Low QPS when using gRPC (v4) to batch insert data

AnnTade · January 23, 2025, 1:09pm

Hello everyone.
We are faced with the task to migrate around 10 million records from one weaviate instance to another weaviate cluster hosted on openstack, that has 3 nodes. Our schema’s sharding is set to 3, replication factor is set to 3. We are using weaviate client 4
In our client connection code, in additional config, we have Timeout(init = 120, query = 120, insert = 400)

We have tried multiple experiments and we are getting suspiciously low QPS of around 10-11.

Here are some of the things we’ve tried.

Having bulk 10k records, using dynamic batching with default consistency level - it inserted only about 2k records.
Having bulk 2k records, using dynamic batching with default consistency level - it inserted only about 1.9k records.

The above ones gave us the following error in our logs after the run has been completed

ERROR:weaviate-client:{‘message’: ‘Failed to send all objects in a batch of 903’, ‘error’: ‘WeaviateBatchError('Query call with protocol GRPC batch failed with message <AioRpcError of RPC that terminated with:\n\tstatus = StatusCode.DEADLINE_EXCEEDED\n\tdetails = “Deadline Exceeded”\n\tdebug_error_string = “UNKNOWN:Error received from peer {created_time:“2025-01-22T20:13:54.414688058+00:00”, grpc_status:4, grpc_message:“Deadline Exceeded”}”\n>.')’}ERROR:weaviate-client:{‘message’: ‘Failed to send 903 objects in a batch of 903. Please inspect client.batch.failed_objects or collection.batch.failed_objects for the failed objects.’}

Having bulk 1k objects, using dynamic batching with default consistency level - the insertion was successful and took about 90 seconds = about 11 QPS
Having bulk 1.5k objects, using dynamic batching with default consistency level - the insertion was successful and took about 134 seconds = about 11 QPS
Having 5 parallel processes, using dynamic batching with default consistency level, each process having about 2k bulk objects - it failed to insert all of them and gave us the same error
Having 5 parallel processes, using dynamic batching with consistency level set to ONE, each process having about 2k bulk objects - it failed to insert all of them and gave us the same error
Having 5 parallel processes, using fixed batching with batch size 200, concurrent requests 2, process having about 2k bulk objects - it failed to insert all of them and gave us the same error
Having 5 parallel processes, using fixed batching with batch size 200, concurrent level 2, consistency level set to ONE, process having about 2k bulk objects - it failed to insert all of them and gave us the same error

We have tried other variations too, mix and match of these, such as each processes having 5k objects to insert, increasing the batch size to 500 in fixed batch size, increasing concurrent requests to 5. However, we always get the gRPC DEADLINE EXCEEDED error after the run in the logs, it doesn’t insert all the objects and with however many objects are inserted, we are getting QPS of 10-11.

Shouldn’t QPS be higher with gRPC? What are the possible causes of this issue?

DudaNogueira · January 27, 2025, 8:36pm

hi @AnnTade !!

What is the server version?

This scenario point fingers at not enough resource allocated. Do you have any readings from memory?

What is the dimensionality and what was you resource plan?

Also, do you see anything on server logs?

Thanks!

Topic		Replies	Views
Weaviate Cloud Serverless - Batch Insert 502 Server Side errors with v4 client Support	3	149	August 29, 2024
Weaviate Batch Errors during Batch Insertion with v4 client Support bug , developer-experience , wcs , python , documentation	11	1034	May 15, 2024
Query call with protocol GRPC batch failed with message Deadline Exceeded Support	4	1806	March 31, 2025
gRPC failed due to SSL handshake failure using v4 Support	1	249	January 22, 2025
Batch inserts failing for weaviate Support python , technical	8	144	September 17, 2024

Low QPS when using gRPC (v4) to batch insert data

Related topics