Cannot process batch: not enough memory

Dharanish · March 25, 2024, 8:18am

Hi , configured batch on
client.batch.configure(batch_size=100,dynamic=True,consistency_level=“ALL”,connection_error_retries=3,num_workers=2)
perform batch insertion and getting exception
UnexpectedStatusCodeException: batch response! Unexpected status code: 500, with response body: {‘error’: [{‘message’: ‘batch objects: &fmt.wrapError{msg:“cannot process batch: not enough memory”, err:(*errors.errorString)(0xc00004c020)}’}]}.

Weaviate version :1.23.11
Deployment type: Kubernetes cluster
Python client : 3.24.2
available memory per container: 4GB

DudaNogueira · March 25, 2024, 10:38am

hi @Dharanish !

Can you try using the new python v4 client?

Also, does the server has enough memory considering the amount of objects to be ingested?

And finally, does it happens if you reduce the batch size?

If you try to the python v4 client, I suggest using the dynamic batch size, so Weaviate server can adjust the batch size while communicating with the client.

Thanks!

Dharanish · August 8, 2024, 10:28am

Hi @DudaNogueira , My cluster has enough memory and batch size is 20 and using client 4.5.7, still getting this error

DudaNogueira · August 8, 2024, 2:31pm

This is the origin of this error:

github.com

weaviate/weaviate/blob/6fd24322f15ecf46d40e1e35754b524fe96e61f5/adapters/repos/db/batch.go#L39


      
          	originalIndex []int
          }
          
          func (db *DB) BatchPutObjects(ctx context.Context, objs objects.BatchObjects,
          	repl *additional.ReplicationProperties, schemaVersion uint64,
          ) (objects.BatchObjects, error) {
          	objectByClass := make(map[string]batchQueue)
          	indexByClass := make(map[string]*Index)
          
          	if err := db.memMonitor.CheckAlloc(estimateBatchMemory(objs)); err != nil {
          		db.logger.WithError(err).Errorf("memory pressure: cannot process batch")
          		return nil, fmt.Errorf("cannot process batch: %w", err)
          	}
          
          	for _, item := range objs {
          		if item.Err != nil {
          			// item has a validation error or another reason to ignore
          			continue
          		}
          		queue := objectByClass[item.Object.Class]
          		queue.objects = append(queue.objects, storobj.FromObject(item.Object, item.Object.Vector, item.Object.Vectors))

It may be hitting the limits while ingesting.

Does it also happen if you reduce the size of the batch or increase the memory?

Also, have you tried this on recent version?

bharath97 · December 11, 2024, 1:00pm

Hi @DudaNogueira , @Dharanish ,
I’ve also seen the same issue today.
Running weaviate: 1.24.11

Restarting the weaviate service has resolved this(temporarily I guess).

DudaNogueira · December 11, 2024, 8:15pm

hi @bharath97 !!

In order to mitigate this, some recommendations would be upgrading to latest version (a lot have improved from from 1.24 to 1.28). Also, we need to be aware that while ingesting data with vectors, Weaviate will also index that data, while write it accordingly. For example, Weaviate 1.28 version, that was just released, had some interesting improvements on ASYNC_INDEXING

So allocating more memory so the ingestion can happen smoothly is an option.

Another route here is to enable ASYNC_INDEXING. This will allow Weaviate to “take it’s time” to index everything and will also make the ingestion process quicker, as it will not perform the indexing right away, but asynchronously.

Let me know if that helps!

THanks!

bharath97 · December 12, 2024, 5:45am

Hey @DudaNogueira ,
Thanks for responding to this.
Let me try setting the ASYNC_INDEXING config to true and see if this improved as I see it’s available from 1.22.
If not, I’ll try upgrading to 1.28.

Thanks!

bharath97 · December 16, 2024, 11:27am

Hello @DudaNogueira,
I have set the ASYNC_INDEXING to true and also upgraded weaviate to 1.28, however the issue still seems to persist.
Can you help please?
FYI - my client version is: 3.24.1 (Python SDK)

Some logs from weaviate that could be of any help:

cannot load vector into cache due to memory pressure
find and connect neighbors: at level 0: search layer at level 0: calculate distance between candidate and query: not enough memory

DudaNogueira · December 16, 2024, 1:53pm

It is best to use the python version 4+.

Also, do you have any observability on this server?

What is the memory and cpu allocated? have you done something as documented here?

Ps: it is best to open a new thread, so we can answer it from there.

Thanks!

Topic		Replies	Views
Batch inserts failing for weaviate Support python , technical	8	302	September 17, 2024
Payload Too Large Support	1	272	February 23, 2024
Error: 'WeaviateBatchError('Query call with protocol GRPC batch failed with message <>) Support	4	350	January 2, 2025
Weaviate Cloud Serverless - Batch Insert 502 Server Side errors with v4 client Support	3	278	August 29, 2024
Weaviate Batch Errors during Batch Insertion with v4 client Support bug , developer-experience , wcs , python , documentation	11	1224	May 15, 2024

Cannot process batch: not enough memory

Related topics