I totally understand @DudaNogueira
but could you please help me out in this regard which I’m facing, I was using the default.
{'error': [{'message': "update vector: connection to: OpenAI API failed with status: 400 error: This model's maximum context length is 8192 tokens, however you requested 9655 tokens (9655 in your prompt; 0 for the completion). Please reduce your prompt; or completion length."}]}
{'error': [{'message': "update vector: connection to: OpenAI API failed with status: 400 error: This model's maximum context length is 8192 tokens, however you requested 9745 tokens (9745 in your prompt; 0 for the completion). Please reduce your prompt; or completion length."}]}
All of Openai’s embedding models currently max out 8192 tokens. Some open-source embedding models support larger context windows, but I’d suggest chunking your data and you’ll(probably) get better performance that way too.
as @JK_Rider mentioned, the issue is about passing too much context.
If you see this when vectorizing (which seems to be the case, considering the “update vector” part of the log), it is probably be because your chunks are too big to fit in that context windows.
However, if you see this while generating, you are probably passing too much objects (limit=X) to the generation step.
here are some other content on chunking. As you will soon discover, there isn’t a “one size fits all”, as it will depend on a lot of requirements.