Cannot query multimodal vectorized collection from Next.js

momo · April 23, 2025, 7:12pm

Hi, I have a python script that handles creating a collection as follows (on sandbox cluster), using voyage ai voyage-multimodal-3:

The collection gets created successfully on the sandbox and inserts are created, generating the vectorized embeddings for the text content.

# Define properties for multimodal data
            properties = [
                Property(name="title", data_type=DataType.TEXT, description="File generated title"),
                Property(name="classification", data_type=DataType.TEXT, description="File generated classification"),
                Property(name="tags", data_type=DataType.TEXT_ARRAY, description="File generated tags"),
                Property(name="file_name", data_type=DataType.TEXT, description="Original file name"),
                Property(name="file_type", data_type=DataType.TEXT, description="Original file MIME type"),
                Property(name="content_type", data_type=DataType.TEXT, description="'text' or 'image'"),
                Property(name="chunk_index", data_type=DataType.INT, description="Index of the chunk within the file"),
                # Store text content directly
                Property(name="text_content", data_type=DataType.TEXT, description="Text chunk content", skip_vectorization=False), # Make sure this is vectorized
                 # Store images as base64 encoded blobs
                Property(name="image_content", data_type=DataType.BLOB, description="Base64 encoded image content", skip_vectorization=False), # Ensure this is vectorized too
            ]

            client.collections.create(
                name=collection_name,
                # Configure the multimodal vectorizer using Voyage AI multimodal
                vectorizer_config=[
                    Configure.NamedVectors.multi2vec_voyageai(
                        model='voyage-multimodal-3',
                        name="namedvector-name",
                        # Define the fields to be used for the vectorization - using image_fields, text_fields
                        image_fields=[
                            Multi2VecField(name="image_content", weight=0.25)
                        ],
                        text_fields=[
                            Multi2VecField(name="text_content", weight=0.75)
                        ],
                        # Configure vector index (HNSW is default and usually good)
                        vector_index_config=Configure.VectorIndex.hnsw(
                            quantizer=Configure.VectorIndex.Quantizer.bq(),
                            distance_metric=VectorDistances.COSINE, # Cosine is common for embeddings
                            filter_strategy=VectorFilterStrategy.SWEEPING  # or ACORN (Available from Weaviate v1.27.0)
                        ),
                    )
                ],

                multi_tenancy_config=Configure.multi_tenancy(enabled=False), # Assuming single tenancy for now
                properties=properties                
            )

When fetching list of objects from the collection on Nextjs frontend, just for testing, everything seems to work fine and I get a list of all chunked objects, using the following code:

const collection = weaviateClient.collections.get('collection-name');
const result = await collection.query.fetchObjects({
        limit: 1,
        returnProperties: ['title', 'tags', 'text_content'],
      })

But when I try to query based on nearText ( also providing the named vector name), it doesn’t work! Just errors in a try catch block. I tried the following:

const result = await collection.query.nearText('any search text', {
        limit: 2,
      })
for (let object of result.objects) {
        console.log(JSON.stringify(object.properties, null, 2));
        //console.log(JSON.stringify(object.metadata?.distance, null, 2));
      }

const result = await myNVCollection.query.nearText('any search text', {
  targetVector: 'namedvector-name',
  limit: 2,
})
for (let object of result.objects) {
        console.log(JSON.stringify(object.properties, null, 2));
        //console.log(JSON.stringify(object.metadata?.distance, null, 2));
      }

What am I doing wrong? Why can I fetch all objects directly but can’t use nearText? Am I configuring the collection inappropriately for nearText or nearVector search? Thank you!

Server Setup Information

Weaviate Server Version: 1.30.0
Deployment Method: localhost connecting to Weaviate Cloud Sandbox
Multi Node? none
Client Language and Version: Python & Next.js (JS/TS Client v3)
Multitenancy?: False

DudaNogueira · April 23, 2025, 7:47pm

Hi!

Do you mean you are able to query using the python client but not the typescript client?

momo · April 23, 2025, 8:38pm

I didnt try querying the collection through the python client. But, the python client v4 (which is an adhoc script im running) works fine as far as creating the collection and batch inserting the data into weaviate. I’m able to explore the collection, and created data using the weaviate web admin.

So, I’m really lost at where is the disconnect. Am I missing something on the collection creation? Or is my code wrong for querying using nearText?

Also, I’m using the sandbox cluster to test out my development, with multimodal embeddings using voyage ai modal 3.

I hope this clarifies things for you.

momo · April 24, 2025, 4:02pm

I found the problem! It was not passing the voyage ai api key in the header when creating the weaviate client in the typescript code of frontend.

Before:

const client = await weaviate.connectToWeaviateCloud(
        process.env.WEAVIATE_HOST,
        {
          authCredentials: new weaviate.ApiKey(process.env.WEAVIATE_API_KEY)
        }
      );

Fix:

const client = await weaviate.connectToWeaviateCloud(
        process.env.WEAVIATE_HOST,
        {
          authCredentials: new weaviate.ApiKey(process.env.WEAVIATE_API_KEY),
          headers: {
            'X-Voyageai-Api-Key': process.env.VOYAGE_API_KEY,
          }
        }
      );

This resolved the issue and I’m able to use nearText when querying!

DudaNogueira · April 24, 2025, 6:20pm

Awesome! Thanks for sharing!

Topic		Replies	Views
WeaviateQueryError when using weaviate.collections.collection.Collection.query.near_text() Support wcs , python	1	347	April 26, 2024
Error during nearText call: panic occurred: ValidateParam was called without any known params present Support	22	907	May 10, 2024
Is nearText() completely a Weaviate cloud operation, no outbound LLM call? General wcs	12	532	April 22, 2024
Multimodal search with Bring your own vector Support	8	248	October 21, 2024
WeaviateQueryError when using JS/TS client to query named vectors - class has multiple vectors, but no target vectors were provided Support	2	193	October 25, 2024

Cannot query multimodal vectorized collection from Next.js

Server Setup Information

Related topics