Hybrid Queries on new OpenAI Embedding Models failing server restart

D3x · February 23, 2024, 4:33am

@DudaNogueira I thought I would create a new thread for this issue rather than hijacking New OpenAI Embedding Models - #21 by SomebodySysop

The problem statement is that once a weaviate server is configured with an OpenAI vectorizer using the new model of text-embedding-3-large and dimensions of 1024, hybrid queries fails with a vector search: vector lengths don't match: 1024 vs 3072 error message upon a server reboot.

I was able to replicate this issue on codesandbox. This is using Weaviate v1.23.10 and python client 4.4.4.

Steps to reproduce

https://codesandbox.io/p/sandbox/interesting-morse-hgvggd
Sign-in using SSO of choice
Open up setup.py and query.py and update line 16 with an OpenAI API Key. As this is being done codesandbox will “seamlessly fork” to your own private sandbox. If the URL does not change, you may have to go back to the dashboard CodeSandbox, go to My drafts, and open the newly created sandbox.
Go to top left corner and select the “Restart Devbox” option. This should trigger sandbox initialization. Wait for container to be started and the pip -r requirements.txt job to complete.
Open up a new terminal in the center bottom pane.
Run the following in sequence:

docker compose down -v
docker compose up -d
python setup.py
python query.py

Note the following:
1. setup.py creates a collection and inserts a single object
2. The single object we stored in weaviate has a vector length of 1024, indicating vectorizer is working properly
3. We can fetch that object from weaviate, confirming that the inserted object is persisted
4. We can hybrid query from weaviate

Now run:

docker compose restart
python query.py

All we’ve done here is restart the weaviate container. Notice now that we can still fetch the inserted object (see output above the exception output), but now hybrid query fails with a vector length not matching error.

DudaNogueira · February 23, 2024, 12:21pm

Hi @D3x !

Thanks for reporting.

I will try to reproduce this on my end and get back to you!

D3x · February 27, 2024, 9:50pm

Hi @DudaNogueira were you able to reproduce this given the instructions?

DudaNogueira · February 28, 2024, 1:17pm

Hi! Sorry, I couldn’t get to it yet.

have you tried running this locally?

Those sandboxes usually has a lot of limitations that may affect it, so removing that component may give us a hint if the issue is on there on in the server.

D3x · February 29, 2024, 4:07am

@DudaNogueira yes this is reproducible locally.

The same behaviors as I noted above persists. A simple server restart makes hybrid queries fail which seems like a fairly serious problem. Would appreciate your team’s attention on this asap.

DudaNogueira · March 4, 2024, 4:41pm

Hi D3x!

Sorry for the delay here.

I was not able to reproduce this:

❯ python3 setup.py
UUID for new object created: 117a7993-a2aa-4847-9bd2-f69cbdac1160
fetch_objects: 117a7993-a2aa-4847-9bd2-f69cbdac1160 (1024) | Properties: {‘text’: ‘Some data’}
hybrid query: 117a7993-a2aa-4847-9bd2-f69cbdac1160 (1024) | Properties: {‘text’: ‘Some data’}
❯ python3 query.py
fetch_objects: 117a7993-a2aa-4847-9bd2-f69cbdac1160 (1024) | Properties: {‘text’: ‘Some data’}
hybrid query: 117a7993-a2aa-4847-9bd2-f69cbdac1160 (1024) | Properties: {‘text’: ‘Some data’}

Could we connect in Slack so I can take a closer look?

Thanks!

D3x · December 17, 2024, 11:41pm

Hi @DudaNogueira

I’ve recently looked into upgrading our local setup to 1.28 but when validating it failed the same Hybrid Query issue again. I recall it was resolved with your help on Slack, but I’m unable to view older messages to verify.

I’ve refreshed the demo repo to reproduce the issue: GitHub - d3xtemp/weaviate-issue. The local weaviate instance was initialized exactly as specified in the Weaviate docs Docker | Weaviate. Also, the issue now does not require a server restart to be demonstrated.

Again, a quick explanation of the issue is that I’ve used OpenAI’s text-embedding-3-large embedding model with a dimension of 1024 to create a collection. When I simply fetch objects from this collection, I can verify that these objects have a vector length of 1024 as expected. However, when I attempt hybrid queries against this collection, I receive the error message below.

Error: Query call with protocol GRPC search failed with message <AioRpcError of RPC that terminated with:
        status = StatusCode.UNKNOWN
        details = "explorer: get class: vector search: object vector search at index mycollection: shard mycollection_66Yf5V7XYzHQ: vector search: knn search: distance between entrypoint and query node: 1024 vs 1536: vector lengths don't match"
        debug_error_string = "UNKNOWN:Error received from peer  {grpc_message:"explorer: get class: vector search: object vector search at index mycollection: shard mycollection_66Yf5V7XYzHQ: vector search: knn search: distance between entrypoint and query node: 1024 vs 1536: vector lengths don\'t match", grpc_status:2, created_time:"2024-12-17T15:29:04.353870673-08:00"}"

Your help to confirm this issue and orchestrate a fix is appreciated.

DudaNogueira · December 17, 2024, 11:45pm

hi @D3x !!

Welcome back

You probably have your vectors stored with one dimensionality, and have the vectorizer of your collection configured to use different one.

You can get the collection configuration and check that:

collection.config.get().vectorizer_config

The solution here is to create a second collection (or on a different server), specifying the exact model and dimensions of your vectorized data, and migrate your data over.

There is a fairly easy migration guide here:

Let me know if this helps.

Thanks!

D3x · December 18, 2024, 12:23am

Hi @DudaNogueira , I’m unclear what you’re suggesting.

The repo I provided you demonstrates the problem in a fresh instance of weaviate, creates a collection from scratch, inserts a few records, and then attempts to hybrid query. No migration of data is needed to demonstrate the issue.

In weaviate-issue/setup.py at 7a9bfdd08791a33daad96c074c7fc2e90779c9a5 · d3xtemp/weaviate-issue · GitHub I’ve configured the vectorizer simply and yes with one dimensionality only. In this simple case, shouldn’t I be able be hybrid query without issue whenever I use a reference to that same collection (i.e. client.collections.get(collection_name).query.hybrid())?

DudaNogueira · December 18, 2024, 2:11pm

Oh Right! sorry! completely missed the repo

This is indeed a bug

for some reason, in this scenario, it is using the default module configuration.

This is the payload it will send:

client = weaviate.connect_to_local(
    headers={
         "X-OpenAI-Api-Key": os.getenv("OPENAI_APIKEY", "CHANGE_ME"),
         "X-OpenAI-BaseUrl": "https://webhook.site/beef60de-4d45-4c61-9928-b20fa619f91e",
    }
)
collection = client.collections.get("Test")

response = collection.query.hybrid(
    query="hybrid query with 1024 dimensions",
    alpha=0.75,
    limit=5,
    include_vector=True
)
for obj in response.objects:
    print(
        f"hybrid query: {obj.uuid} ({len(obj.vector['default'])}) | Properties: {obj.properties}")
    
# we get this payload
payload = {
  "input": [
    "hybrid query with 1024 dimensions"
  ],
  "model": "text-embedding-3-small",
  "dimensions": 1536
}

I have raised it internally.

Thanks you very much for raising this here.

D3x · December 18, 2024, 5:09pm

Thanks for confirming the issue!

Assuming that we have no visibility into the status of the internal issues, I would appreciate it if you can provide an update when it’s resolved and which upcoming versions would contain the fix. We are eager to stay on top of the releases.

DudaNogueira · December 18, 2024, 7:15pm

Sure. Our team is already looking into this.

As soon as they confirm, I’ll open a github issue so we can keep track of it.

I’ll update it here.

Thanks!

DudaNogueira · December 18, 2024, 8:16pm

hi @D3x !!

The issue is this one:

github.com/weaviate/weaviate

Hybrid search falling back to default vectorizer confs when dimensions is set

opened 08:15PM - 18 Dec 24 UTC

dudanogueira

bug

### How to reproduce this bug? Here is a reproducible code: ```python impor…t os import weaviate from weaviate import classes as wvc import weaviate.error_msgs client = weaviate.connect_to_local( headers={ "X-OpenAI-Api-Key": os.getenv("OPENAI_APIKEY", "CHANGE_ME"), } ) print(f"Client: {weaviate.__version__}, Server: {client.get_meta().get('version')}") client.collections.delete("Test") collection = client.collections.create( name="Test", vectorizer_config=wvc.config.Configure.Vectorizer.text2vec_openai( model="text-embedding-3-large", dimensions=1024, #type_="text", vectorize_collection_name=False ), properties=[ wvc.config.Property( name="text", data_type=wvc.config.DataType.TEXT, tokenization=wvc.config.Tokenization.WORD ) ] ) # Create a single object response = collection.data.insert( properties={ "text": "COVID-19 has many symptoms." } ) # objects indeed has 1024 dimensions response = collection.query.fetch_objects( limit=5, include_vector=True ) for obj in response.objects: print( f"fetch_objects: {obj.uuid} ({len(obj.vector['default'])}) | Properties: {obj.properties}") # you can perform a neartext response = collection.query.near_text( query="hybrid query with 1024 dimensions", #alpha=0.75, limit=5, include_vector=True ) for obj in response.objects: print( f"near text query: {obj.uuid} ({len(obj.vector['default'])}) | Properties: {obj.properties}") #but it fails to hybrid try: response = collection.query.hybrid( query="hybrid query with 1024 dimensions", alpha=0.75, limit=5, include_vector=True ) for obj in response.objects: print( f"hybrid query: {obj.uuid} ({len(obj.vector['default'])}) | Properties: {obj.properties}") except Exception as e: print("ERROR!!!", e) # if we close the client client.close() # and point it to a catch endpoint client = weaviate.connect_to_local( headers={ "X-OpenAI-Api-Key": os.getenv("OPENAI_APIKEY", "CHANGE_ME"), "X-OpenAI-BaseUrl": "https://webhook.site/beef60de-4d45-4c61-9928-b20fa619f91e", } ) collection = client.collections.get("Test") response = collection.query.hybrid( query="hybrid query with 1024 dimensions", #alpha=0.75, limit=5, include_vector=True ) for obj in response.objects: print( f"hybrid query: {obj.uuid} ({len(obj.vector['default'])}) | Properties: {obj.properties}") # we get this payload payload = { "input": [ "hybrid query with 1024 dimensions" ], "model": "text-embedding-3-small", "dimensions": 1536 } ``` ### What is the expected behavior? The hybrid search should work. It should generate the query vectorization payload as: ```json { "input": [ "hybrid query with 1024 dimensions" ], "model": "text-embedding-3-large", "dimensions": 1024 } ``` ### What is the actual behavior? The generated payload to vectorize a hybrid query is passing the wrong model and dimension as the payload: ```json { "input": [ "hybrid query with 1024 dimensions" ], "model": "text-embedding-3-small", "dimensions": 1536 } ``` ### Supporting information Client: 4.10.2, Server: 1.28.1 ### Server Version 1.28.1 ### Weaviate Setup Single Node ### Nodes count 1 ### Code of Conduct - [X] I have read and agree to the Weaviate's [Contributor Guide](https://weaviate.io/developers/contributor-guide) and [Code of Conduct](https://weaviate.io/service/code-of-conduct)

franciscoracosta · January 8, 2025, 11:30am

Any updates on this? Issue seems to be on server version 1.28 and in weaviate cloud it’s not possible to select a prior version

Jose-Coutinho_cmore · January 8, 2025, 11:31am

Hello @DudaNogueira,

What is the status or the bug fix?
We are attempting to deploy our application using the paid serverless option, but the server version is locked to 1.28.2 (we can’t use the 1.27.0 we used to test locally) and are therefore unable to deploy anything!!

Please let me know if you have any suggestion to go around this issue, even if its temporary.

DudaNogueira · January 8, 2025, 1:58pm

hi there @Jose-Coutinho_cmore !! Welcome to our community

This seems like a popular issue

I have pinged our team again so we can prio this.

Thanks!

Topic		Replies	Views
Distance between entrypoint and query node Support	2	205	December 20, 2024
New OpenAI Embedding Models Support	20	3085	February 21, 2024
Hybrid search raises an API Key error when used with azure-openai Support bug , python	4	241	March 27, 2025
Wrong retrieval results with near_vector and hybrid search Support	1	146	June 27, 2024
Help Needed: Resolving WeaviateQueryError with Nil or Zero-Length Vector at docID 715 Support	18	927	May 11, 2024

Hybrid Queries on new OpenAI Embedding Models failing server restart

Steps to reproduce

Related topics