Different query weightings on properties inside one collection

Lawrence_Hope · February 13, 2024, 7:25pm

For example, I have title, description and review in one book collection. When do the near text query, is there any way to assign different weights of each field? like title is 5, description is 2, review is 1.

I know there is rerank, but the weighting results will get the CORRECT set of first query results. I think this is better than increase the limit range to 3 or 5 times then use rerank to focus back on the best query results.

Thanks,

sebawita · February 15, 2024, 11:11am

Hi @Lawrence_Hope,

Yes, Weights for keyword-based queries

You can add weights to a query that uses keyword search – bm25, or hybrid (which combines keyword and vector search).

Here is a python example from docs - hybrid search:

jeopardy = client.collections.get("JeopardyQuestion")
    response = jeopardy.query.hybrid(
        query="food",
        query_properties=["question^2", "answer"],
        alpha=0.25,
        limit=3
    )

The key element is the ^2 part, which tells Weaviat to 2x the score for a match on that property. You can add more of these values:

query_properties=["title^3", "genre", "author^2"],

No weights for vector search (yet… see below)

However, for vector search like near_text, you cannot give extra weights for matching to specific properties.
This is because, each object has one vector embedding. When you run a vector search, there is no way to distinguish between a match on a specific field, as we search on the whole embedding.

NamedVectors coming soon

As an FYI, we are working to add support for multiple vector embeddings per object, so you could have a separate embedding for a title, and another one for description, and a third one that combines multiple properties.

Note, in the first release, we will only allow you to search on one named vector, so you won’t be able to add weights to it just yet. But we are planning to add mixed vector search next.

I hope this helps

Lawrence_Hope · February 22, 2024, 6:00pm

Great to know this feature is in plan. This will help Weaviate jumps out from other vector DB.

Topic		Replies	Views
Keyword Weighting Explanation Support	1	188	May 31, 2024
Changing keyword weight of only one out of 30 properties Support python	4	189	November 29, 2024
How can we make hybrid search results more predictable? Support	8	1187	November 4, 2023
How do I improve hybrid search on Weaviate? Been poking at this for too long but haven't made much headway General	2	845	April 23, 2024
How to manage the merging of an hybrid query on a property and a BM25 on another General	2	271	May 15, 2024

Different query weightings on properties inside one collection

Yes, Weights for keyword-based queries

No weights for vector search (yet… see below)

NamedVectors coming soon

Related topics