URGENT: Filtering Retrieval Search in Weaviate Based on Tenant-Specific Uploaded Files

AmanAda · January 16, 2024, 5:35am

I am currently working on a critical project that involves implementing a multi-tenant system using Weaviate for RAG, and I’m faced with a challenge related to filtering retrieval searches based on certain uploaded files for each tenant.

Specifically, I would like to understand how I can configure Weaviate to allow retrieval searches that are filtered to specific files uploaded by a tenant. Are there specific query parameters or configurations that I need to consider to work with langchain?

For example, a tenant has 10 files and I want to search only 5 of them, how can I do it?

Any guidance or examples on how to achieve this would be greatly appreciated. I want to ensure that the retrieval search results are scoped to the relevant files associated with the selected tenant in the system.

sebawita · January 16, 2024, 11:45am

Hi @AmanAda,
I am not an expert in langchain, but I can speak from a Weaviate level.

Overall, you don’t need to configure Weaviate in any special way. You only need a property with a file_name (or some other identificator), and you can filter using contains_any.

Get the tenant

First, as part of your query, you need to specify the tenant you are searching on. With the Python client, this looks like this:

my_collection = client.collections.get("MyCollectionName")

# Get the specific tenant's version of the collection
my_tenant = my_collection.with_tenant("tenant_name")

See more here.

Filter (contains_any) on files

Then, you need to run a query on the tenant with a filter using contains_any, which should contain all the files you want to search on. Here is an example with a Weaviate Python client:

import weaviate.classes as wvc

response = my_tenant.query.near_text(
    query="search term here",
    filters=wvc.Filter("file_name").contains_any(["file1", "file2", "file3"]),
    limit=4
)

Topic		Replies	Views
Need help combining weaviate with langchain Support	8	3051	April 5, 2024
Can we add multiple tenants in a vector similarity search Support	2	570	January 22, 2024
Langchain WeaviateHybridSearchRetriever with filters? Support	7	1150	August 13, 2024
Issue regarding collections in weaviate Support python	3	411	March 3, 2025
Searching across multiple tenancies Support	2	510	September 13, 2024

URGENT: Filtering Retrieval Search in Weaviate Based on Tenant-Specific Uploaded Files

Get the tenant

Filter (contains_any) on files

Related topics