Rotational Quantization moderate compression

Maxence_Oden · March 26, 2026, 9:37am

Hello,

I recently read the excellent blog post on 8-bit Rotational Quantization, which showcases its effectiveness. In the article, there is a table demonstrating the key metrics across low, moderate, and high compression levels.

However, I noticed in the documentation that the 4-bit “moderate compression” does not currently seem to be available in Weaviate. Only the Low 8-bit and High 1-bit are implemented. I would love to be able to use it in my setup as it looks like a great sweet spot between space usage and recall.

Is the implementation of moderate compression on the roadmap, or are there current technical limitations preventing it ?

Thanks in advance for your time and insights!

DudaNogueira · March 27, 2026, 7:26pm

Hi @Maxence_Oden !!

Welcome back!

I will raise this with our research team, and circle back here.

Thanks for using Weaviate and pushing it

Happy coding!

fraulty · April 6, 2026, 2:28pm

I am just learning about this type of Quantization - looks super cool - maybe this might be interesting too (by Google Research):
TurboQuant: Redefining AI efficiency with extreme compression

Just gonna plug why I’m into quantization - I’m working on a way to democratize huge OSS LLMs for us GPUPoor folk haha - 4Bit-Forge (Compression: Quantization and Sparsity + CUDA Hella)

DudaNogueira · April 6, 2026, 5:57pm

hi @fraulty !!

Welcome to our community

Our team was just discussing about it internally

fraulty · April 6, 2026, 6:27pm

@DudaNogueira oh that’s super cool to know! Thank you - hope to learn a lot more from the community!

Topic		Replies	Views
8-bit RQ quantization is not enabled by default for 1.33.9 Support	3	195	December 10, 2025
cluster performance or compression Support	1	558	January 19, 2024
[Question] Quantized Vectors in Weaviate Support technical	2	467	January 28, 2025
Configuring PQ compression in a collection Support	7	846	February 29, 2024
Binary Quantization Vector Not Found in Collection Support developer-experience , technical	3	495	November 12, 2024

Rotational Quantization moderate compression

Related topics