We are beginning to try out multimode as we will need to soon upload images. For now we only have text. We have simply replaced the text vec with multi but the performance is really really slow. Text takes around 2-3 sec for our test data, but multi takes about 20s - should this be the case?
Hi @systemz - and welcome!
The one big difference is that text2vec-openai
sends data to OpenAI to get vectors, but multi2vec-clip
is using a local inference container (i.e. generating vectors locally). This is probably why you’re seeing a big difference in performance.
If you have an nVidia GPU you should be able to get a pretty sizeable boost with multi2vec
by enabling CUDA. (see multi2vec-clip | Weaviate - vector database)
Hey @systemz sorry I didn’t see this.
I suspect you mean managed instance as in someone else is running your Weaviate cluster, as WCS doesn’t support multi2vec-clip
. Are you able to ask whomever is running that cluster whether it has a GPU?