Data model feasibility with extensive array filtering

Fogapod · August 13, 2025, 2:42pm

Hello, im currently in the process of selecting vector db for my use case:

tens of millions of dense + sparse vectors for hybrid search with extensive metadata filtering.

I expect to have primary int array field with ~300 numbers on average and a few other indexed scalar fields which would be used together.

I tried qdrant and it took more than a week on 32 core machine to index the data. Is this something weaviate could potentially do better? Is there any way to estimate the indexing/recall perf and memory usage?

DudaNogueira · August 13, 2025, 8:59pm

hi @Fogapod !!

Welcome to our community

We do have some calculations on how much of memory your vectors will be using:

https://docs.weaviate.io/weaviate/concepts/resources

However, for indexing, it will depend on the data type, replication and sharding, so I believe it would be better to do a test before

So I believe that some experimentation is necessary in order to map out the possible changes you can do in order to improve performance.

Let me know if this helps!

Topic		Replies	Views
Assistance Needed to Improve Weaviate's Vector Search Performance General	2	953	March 6, 2025
Is weaviate good for my use-case? Comparing with mongo Support	1	952	April 29, 2024
Why does search speed suffer (and RAM consumption increases) when there are a large number of vectors in Weaviate? General	3	636	January 14, 2025
Why my Weaviate vector search performance is low? General	3	1534	March 4, 2024
Infra Configuration for Docker Setup of Weaviate Support	1	380	June 5, 2024

Data model feasibility with extensive array filtering

Related topics