How to planning HNSW index ef, efConstruction and maxConnections parameters with PQ?

DevelMyCry · January 6, 2025, 9:53am

Due to the decrease in performance of Weaviate with the use of product quantization, it is necessary to select the optimal parameters of the HNSW index for minimal loss of accuracy with higher performance compared to HNSW without compression with current (not modified parameters).

Weaviate Server Version: 1.26.0
Deployment Method: Docker-compose
Standalone

Current HNSW parameters for two named vectors:
Vector A:
ef = 320, efConstruction = 320, maxConnections = 100.
Vector Length = 768.

Vector B:
ef = 480, efConstruction = 480, maxConnections = 120

Without PQ search average QPS: 145 obj/s
With PQ (segments=6) search average QPS: 75 obj/s. With other segments values QPS is not higher, and often lower. Dataset size was 5 million and training size = 100 000 - 150 000. It is expected to store 20+ million vectors.

Why does vector compression degrade performance so much? I suspect that the parameters of the HNSW need to be adjusted, but it is not yet clear how to maintain balance and how much they should change for PQ…

DudaNogueira · January 6, 2025, 6:54pm

Hi @DevelMyCry !!

First, any specific reason to use 1.26.0?

The latest one from this branch is 1.26.13, and those 13 releases are the ones that bring all patches we backported from 1.28

Of course, 1.28 will probably get you even better results. For example, you may try using ACORN, that can help on specific queries.

By the way, not sure if you saw this article in our academy:

there is some interesting information there.

Thanks!

Topic		Replies	Views
How to determine the optimal number of segments in PQ to reduce requests latency (search)? Support python , technical	4	172	December 30, 2024
Why does search speed suffer (and RAM consumption increases) when there are a large number of vectors in Weaviate? General	3	169	January 14, 2025
Increase number of shards and update HNSW vector index parameters Support python	6	574	August 28, 2024
Limit parameter change results of near_vector query Support	3	194	November 6, 2024
cluster performance or compression Support	1	288	January 19, 2024

How to planning HNSW index ef, efConstruction and maxConnections parameters with PQ?

Related topics