ColbertV2 Support

JK_Rider · August 8, 2024, 6:03pm

I wanted to know if Weaviate has any plans to support Colbert. I saw a post about this a little while ago Weaviate & ColBERTv2? - #3 by Draco, unfortunately, the multiple-named vector approach mentioned there isn’t a great fit, as the number of Colbert vectors is dependent on the number of tokens given(I.E Not static).

DudaNogueira · August 12, 2024, 8:22pm

hi @JK_Rider !!

Have you seen this recipe?

github.com

weaviate/recipes/blob/main/weaviate-features/named-vectors/NamedVectors-ColPali-POC.ipynb

{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "provenance": [],
      "machine_shape": "hm",
      "gpuType": "A100"
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "language_info": {
      "name": "python"
    },
    "accelerator": "GPU",
    "widgets": {
      "application/vnd.jupyter.widget-state+json": {
        "3d39837b65574ef4b76bbb79eb0d778b": {

This file has been truncated. show original

I am not sure the colBERTv2 support itself. I saw our team discussing something on this, I will ping them.

Thanks!

JK_Rider · August 15, 2024, 6:59am

Hi @DudaNogueira ,

Took a look at the recipe, the issue I see is that recipe assumes the number of vectors generated would be static as your dealing with a image of a fixed size, the number of vectors per document with colbertV2 is based on the number of tokens in a text string which is usually not static.

CShorten · August 17, 2024, 4:45am

Hey both, thanks so much for sharing this notebook @DudaNogueira!

Hey @JK_Rider, could you please point me to a more specific passage where this is mentioned? ColBERT / v2 / PLAID variants will all zero-pad queries and documents to have a fixed input length as far as I understand.

→ I think the key innovations in subsequent ColBERT works is compressing the vectors along the length dimension with forms of low-rank decompositions or maybe PCA – I think the discrete PQ-style methods won out in PLAID.

It does make sense to think the variable-length decoding stuff could make its way into embedding models, but I haven’t seen too many examples of this outside of maybe Cohere Compass.

JK_Rider · August 17, 2024, 5:22am

Quote: The ColBERT v2.0 library transforms a text chunk into a matrix of token-level embeddings. The output is a 128-dimensional vector for each token in a chunk. This results in a two-dimensional matrix, which doesn’t align with the current LangChain interface that outputs a list of floats.
Link:Introduction to ColBERT | RAGStack | DataStax Docs.

The passage you gave refers to the dimensions of the individual vectors, but the issue is the number of vectors. Colbert creates vectors based on the number of tokens in a document. I.E highly variable

If it helps I can also provide code samples from a Jupyter notebook if that would be of assistance.

CShorten · August 17, 2024, 1:59pm

Hey @JK_Rider, yes the notebook would be super helpful.

From this reference – #2 supports my initial assumption that queries and documents are zero-padded to fixed length:

“2. BERT manages this additional depth by pre-processing documents and queries into uniform lengths with the Wordpiece tokenizer, ideal for batch processing on GPUs.”

So for example if a document has 30 tokens – you will then zero-pad it to 512 tokens. You then apply an attention mask so the gradient only goes to those original 30 tokens, but you still need it to have the 512 size as BERT models expect a fixed-length input. This is a key distinction between encoder-only versus decoder-only or hybrid encoder-decoder / seq2seq models.

bobvanluijt · August 17, 2024, 3:19pm

Additional note: we are on this and you can expect something soon.

Please follow: HNSW Multi-value vectors (colbert) · Issue #4278 · weaviate/weaviate · GitHub

JK_Rider · August 17, 2024, 3:44pm

@CShorten
Ex 1: Using BGE-M3 with Flag Embedding

Ex 2: Using JinaColbert with official colbert repo

If you need any more examples or some more context, let me know.

Topic		Replies	Views
Weaviate & ColBERTv2? Support	3	600	May 16, 2024
I am unable to create multi dimensional vectors in Weavaite Support technical	1	129	May 13, 2025
Weaviate 1.29 is here! Announcements	3	2070	March 19, 2025
Issue with Vector Query in Weaviate V4 on AWS EC2 Support technical	2	220	August 21, 2024
[Question] YOUR TOPIC Support python	1	130	July 30, 2024

ColbertV2 Support

Related topics