I am building a image search engine with text queries.
I am planning to have two fields in my img collection: id and img_url. I’m using multi2vec-clip model and currently feed each image as base64 encoding into the database. I’d like to store just the vector representation of each image and get the most similar entries.
Could any of you legends give me some pointers? Much appreciated.
One possibility would be not configuring a vectorizer for the collection and doing your manual vectorization of the image and just store that (along with the other data you need).