Description
Hey Weaviate community, I am benchmarking the recall of vector search for a dataset of 10M, with each vector dimension as 768
, Weaviate server version 1.23.7
I compared the impact on recall with Product Quantization(PQ) enabled/ disabled, the only different config change when pq is enabled:
‘pq’: {‘enabled’: True, ‘bitCompression’: False, ‘segments’: 96, ‘centroids’: 256, ‘trainingLimit’: 100000, ‘encoder’: {‘type’: ‘kmeans’, ‘distribution’: ‘log-normal’}}},
Or when pq is disabled,
‘pq’: {‘enabled’: False, …}
However, the recall result reduced significantly when the PQ enabled, see attached screen for reference. The result is inconsistent with this weaviate blog on PQ.
Could anyone provide some insight on why such significantly impact when the PQ enabled?
Server Setup Information
- Weaviate Server Version: 1.23.7
- Deployment Method: k8s through helm chart
- Multi Node? Number of Running Nodes: 1
- Client Language and Version: Python client 3.21.0