Text2vec-openai Batch API

SmitNGRA · July 6, 2024, 12:40am

Description

I have been using Weaviate free 14 day cluster and am trying to use OpenAI embeddings and store in a Weaviate collection. OpenAI’s website mentions about having a Batch API where there are higher rate-limits and the resulting embeddings are ready after 24 hours (at a 50% cost of the original real-time vector generation which is right now 3 requests per minute)

Is there a way to configure the Weaviate vectorizer using the same OpenAI batch API?
More details are here
Batch - OpenAI API

Server Setup Information

Client Language and Version: Python
Multitenancy: No

DudaNogueira · July 8, 2024, 11:55am

hi @SmitNGRA ! Welcome to our community

AFAIK, we do not have this implemented. Yet.

I have found this Github issue that is related:

github.com/weaviate/weaviate

Async Vectorizer Modules

opened 02:54AM - 03 Apr 24 UTC

trengrj

feature request

### Describe your feature request With async indexing being released, the bottl…eneck to import data has moved to the vectorizer modules which often require 3rd party api calls to convert text/image chunks into vectors. There has been recent work https://github.com/weaviate/weaviate/pull/4546 https://github.com/weaviate/weaviate/pull/4578 to switch vectorizer modules to use batching wherever possible. We can further reduce import times by enabling async indexing at the module level as well. At a high level: - [ ] Switch to async indexing straight to disk https://github.com/weaviate/weaviate/pull/3974. - [ ] If a module is enabled, async workers should read a batch of objects from disk (not vectors), use the module's `BatchVectorizer` to generate the vectors, and then write to the vector index as usual. - [ ] There needs to be a solution for persistent failures including surfacing errors to users. ### Code of Conduct - [X] I have read and agree to the Weaviate's [Contributor Guide](https://weaviate.io/developers/contributor-guide) and [Code of Conduct](https://weaviate.io/service/code-of-conduct)

Please, leave your thumbs up there so we can track it’s popularity!

Thanks!

Topic		Replies	Views
Weaviate with OpenAi Support	6	1625	August 18, 2023
Use DeepInfra as provider for Embedding Model Support	5	35	October 3, 2025
Text2vec_openai redundancy via multiple providers? Support integration , technical	7	607	October 1, 2025
Cannot use vec. param. using OpenAI API key via GPT assistant yaml / json schema Support	3	817	November 21, 2023
Weaviate Batch Errors during Batch Insertion with v4 client Support bug , developer-experience , wcs , python , documentation	11	1658	May 15, 2024

Text2vec-openai Batch API

Description

Server Setup Information

Related topics