Why enabled PQ significantly impacted recall. (version 1.23.7)

fairymane · February 27, 2024, 2:57am

Description

Hey Weaviate community, I am benchmarking the recall of vector search for a dataset of 10M, with each vector dimension as 768 , Weaviate server version 1.23.7
I compared the impact on recall with Product Quantization(PQ) enabled/ disabled, the only different config change when pq is enabled:

‘pq’: {‘enabled’: True, ‘bitCompression’: False, ‘segments’: 96, ‘centroids’: 256, ‘trainingLimit’: 100000, ‘encoder’: {‘type’: ‘kmeans’, ‘distribution’: ‘log-normal’}}},

Or when pq is disabled,

‘pq’: {‘enabled’: False, …}

However, the recall result reduced significantly when the PQ enabled, see attached screen for reference. The result is inconsistent with this weaviate blog on PQ.
Could anyone provide some insight on why such significantly impact when the PQ enabled?

Server Setup Information

Weaviate Server Version: 1.23.7
Deployment Method: k8s through helm chart
Multi Node? Number of Running Nodes: 1
Client Language and Version: Python client 3.21.0

Any additional Information

DudaNogueira · February 28, 2024, 5:51pm

Hi @fairymane! Welcome to our community

This is very interesting. I will relay that to our team.

Have you tried with a higher segments parameter?

Thanks!

fairymane · February 28, 2024, 6:20pm

@DudaNogueira
By setting the “segments” : 0 when PQ enabled, the result is actually not bad, as attached.

The only issue is changing such configuration needs to migrate data to a new collection with new config settings to be effective, which is time consuming.

DudaNogueira · February 28, 2024, 7:54pm

Yes, that’s the AutoPQ kicking in, so it will evaluate your vectors and find out the best number of segments to train your data with.

What was the resulting segment value on that case?

Thanks!

fairymane · March 1, 2024, 12:56am

I just configured “segments” : 0 at collection creation time.

How do I find out the resulting segment value?

Topic		Replies	Views
How to determine the optimal number of segments in PQ to reduce requests latency (search)? Support python , technical	4	182	December 30, 2024
Configuring PQ compression in a collection Support	7	356	February 29, 2024
[Question] Quantized Vectors in Weaviate Support technical	2	163	January 28, 2025
Choosing optimal 'segment' size in PQ Support	1	200	March 6, 2024
No change in vector size after turning Product Quantization on Support	3	356	February 5, 2024

Why enabled PQ significantly impacted recall. (version 1.23.7)

Description

Server Setup Information

Any additional Information

Related topics