Error loading data

Ilan_Manouach · March 3, 2024, 7:00pm

Description

I have been using Ragtriever for a few months, (with openAI, Cohere API keys) and loading hundreds of different documents. After a month with no use, and with no particular document to load that is different from the rest, I come back to the station and I get the following message:

✘ Loading data failed [E088] Text of length 1053169 exceeds maximum of
1000000. The parser and NER models require roughly 1GB of temporary memory per 100,000 characters in the input. This means long texts may cause memory allocation errors. If you’re not using the parser or NER, it’s probably safe to increase the nlp.max_length limit. The limit is in number of characters, so you can check whether your inputs are too long by checking len(text).

Any idea what has gone wrong?

Server Setup Information

Weaviate Server Version:
Deployment Method:
Multi Node? Number of Running Nodes:
Client Language and Version:

Any additional Information

DudaNogueira · March 4, 2024, 12:37pm

Hi! Do you mean Verba?

This seems some issue while chunking

what versions are you running?

Ilan_Manouach · March 6, 2024, 9:16pm

Hi! Yes, verba.
I am running weaviate-client==3.23.1 and cohere==4.33.
I am wondering if this has to do with my tier subscription in Cohere but I guess chunking comes before the embeddings.
Any possible fixes?

DudaNogueira · March 6, 2024, 9:23pm

What is the chunking configuration?

Leaving it too high can cause this error. It is basically saying that you got some characters exceeding the the max they support.

Topic		Replies	Views
Issue with python client v4 - GRPC batch failed with message Sent message larger than max Support	2	576	February 9, 2024
Weaviate Openai Embedding Models General	8	370	August 23, 2024
Error loading numbers into a TEXT Field. Error = invalid text property \'trim\' on class \'test_class\': not a string, but float64") Support python , technical	1	168	September 30, 2024
WeaviateQueryError max_tokens is too large with generative search and gpt-4-1106-preview Support	5	319	July 25, 2024
GRPC trying to send message larger than max error : when trying collection.query.fetch_objects Support bug , python	11	1305	December 6, 2024

Error loading data

Description

Server Setup Information

Any additional Information

Related topics