Weaviate Openai Embedding Models

spark · August 16, 2024, 3:01pm

do we have any models for text2vec-openai embedding module which has token limit greater than 8192?

the message i’m getting:
weaviate.exceptions.UnexpectedStatusCodeException: Create class! Unexpected status code: 422, with response body: {'error': [{'message': "module 'text2vec-openai': wrong OpenAI model name, available model names are: [ada babbage curie davinci text-embedding-3-small text-embedding-3-large]"}]}.

"moduleConfig": {
                "generative-openai": {},
                "text2vec-openai": {
                    "model": "?????",
                }
            },

DudaNogueira · August 16, 2024, 6:47pm

hi @spark !!

by default, if you do not provide a model, it will use ada.

However, you can use any of the supported models, as stated in the error message:

ada
babbage
curie
davinci
text-embedding-3-small
text-embedding-3-large

Notice that with the last two you can also specify the dimensions.

You can find more informations on this here:

Let me know if that helps!

THanks!

spark · August 16, 2024, 7:22pm

I totally understand @DudaNogueira
but could you please help me out in this regard which I’m facing, I was using the default.

{'error': [{'message': "update vector: connection to: OpenAI API failed with status: 400 error: This model's maximum context length is 8192 tokens, however you requested 9655 tokens (9655 in your prompt; 0 for the completion). Please reduce your prompt; or completion length."}]}

{'error': [{'message': "update vector: connection to: OpenAI API failed with status: 400 error: This model's maximum context length is 8192 tokens, however you requested 9745 tokens (9745 in your prompt; 0 for the completion). Please reduce your prompt; or completion length."}]}

JK_Rider · August 17, 2024, 5:00pm

All of Openai’s embedding models currently max out 8192 tokens. Some open-source embedding models support larger context windows, but I’d suggest chunking your data and you’ll(probably) get better performance that way too.

spark · August 18, 2024, 3:08pm

Could you please guide in this regard?
@DudaNogueira @JK_Rider

JK_Rider · August 18, 2024, 3:53pm

Here’s a quick guide on Chunking which should help out:A Guide to Chunking Strategies for Retrieval Augmented Generation (RAG) — Sagacify.

DudaNogueira · August 19, 2024, 12:56pm

hi @spark !!

as @JK_Rider mentioned, the issue is about passing too much context.

If you see this when vectorizing (which seems to be the case, considering the “update vector” part of the log), it is probably be because your chunks are too big to fit in that context windows.

However, if you see this while generating, you are probably passing too much objects (limit=X) to the generation step.

here are some other content on chunking. As you will soon discover, there isn’t a “one size fits all”, as it will depend on a lot of requirements.

And also this video on advanced RAG techniques:

Thanks!

DudaNogueira · August 19, 2024, 1:01pm

By the way we an upcoming webinar on this topic:

Chunking

Live workshop
Wednesday, August 28th
9am PDT, 12pm EDT, 6pm CEST

SomebodySysop · August 23, 2024, 8:34am

Since the subject is chunking, my two cents: Using gpt-4 API to Semantically Chunk Documents - #166 by SomebodySysop - API - OpenAI Developer Forum

Topic		Replies	Views
Recommendations for free ML models of Weaviate text2vec-transformers for Semantic Search purposes? Support	5	1092	November 10, 2023
Facing maximum context length exceed issue during vectorizing Support python	1	610	April 16, 2024
Errors: text too long for vectorization. Tokens for text: 10440, max tokens per batch: 8192, ApiKey absolute token limit: 1000000' Support bug	12	672	November 1, 2024
Verba: Changing OpenAI Embeddings & Chunk Size Support	4	845	February 8, 2024
WeaviateQueryError max_tokens is too large with generative search and gpt-4-1106-preview Support	5	524	July 25, 2024

Weaviate Openai Embedding Models

Chunking

Related topics