Suggestion for an image caption similarity exercise

rjalex · January 9, 2024, 8:55am

Dear experts,
I am really beginning to explore this fascinating world so please bear with my naivety.
I have around 200.00 image captions from a newspaper and want to test if I can retrieve similar captions given one of them. The captions are in Italian. I do have an OpenAI key but would vastly prefer to find embeddings from free/open models.

Just to try the first time I did successfully started a Weaviate container on Linux but can you suggest which configuration I should strive to build?

Thank you very much.

DudaNogueira · January 15, 2024, 6:20pm

Hi @rjalex !

Are those the image only, the text only, or both?

If mixing modalities (text + image) you can check this cool workshop we did last year:

Also, there is this nice project here:

Let me know if that helps

rjalex · January 16, 2024, 12:36pm

Will definitely take a look but for now my case would start with a simple text similarity, but multimodality could be cool to explore.

Topic		Replies	Views
Best way of using DINOv2 image similarity in Weavite General technical	2	37	February 6, 2026
Support for Voyage embedding multimodal model General technical	4	444	December 4, 2024
Custom Model integration Instead of CLIP Support	3	449	October 7, 2024
RAG with image search General	1	333	August 3, 2024
Text2vec-openai Batch API Support integration , wcs , python	1	565	July 8, 2024

Suggestion for an image caption similarity exercise

Related topics