How do I store just the vector not the image

lored · February 17, 2024, 11:19pm

Description

I am building a image search engine with text queries.
I am planning to have two fields in my img collection: id and img_url. I’m using multi2vec-clip model and currently feed each image as base64 encoding into the database. I’d like to store just the vector representation of each image and get the most similar entries.
Could any of you legends give me some pointers? Much appreciated.

Server Setup Information

Weaviate Server Version: v1.23.9
Deployment Method: docker
Multi Node? Number of Running Nodes: 1
Client Language and Version: python v4

Any additional Information

rjalex · February 19, 2024, 7:48am

One possibility would be not configuring a vectorizer for the collection and doing your manual vectorization of the image and just store that (along with the other data you need).

sebawita · February 19, 2024, 7:42pm

Hi @lored,
If you are using multi2vec-clip then Weaviate will also store the base64 representation of the image.

The only workaround is to generate the vector manually, as suggested by @rjalex.

Github issue

I’ve created a GitHub issue with a similar request. Please upvote, as that might help move it closer to the front of the queue

lored · February 20, 2024, 1:47am

thanks guys, I just created another collection, imported my images there and copied the vectors over to the vector only collection.

Topic		Replies	Views
Multi2vec-clip without storing image Support	2	429	May 8, 2024
How to store data in the weaviate vector DB Support	6	279	October 1, 2024
Issue while inserting the image in weaviate Support integration , developer-experience	2	186	July 6, 2024
[Question] Multimodal search Support technical	3	170	August 27, 2024
How to provide embeddings instead of Weavite calculating embeddings using t2v-transformers? Reason for doing that is to avoid storing content in weavite? Support	1	625	August 8, 2023

How do I store just the vector not the image

Description

Server Setup Information

Any additional Information

Github issue

Related topics