Server specs and setup for production

olaf-ho · May 25, 2023, 9:04am

Hi Weaviate community,

I want to set up a Weaviate vector DB for a small production environment. What server specs and setup would you recommend to handle peak usage of up to 100 queries per second?
Object count will probably stay below 1M for a while.
Would a single docker setup be sufficient for that or should I be looking at Kubernets (seems overkill)? What about CPU and RAM recommendations?

These posts give a good overview, but I’m looking for answers on how many requests a single node can handle with which hardware.

zainhas · May 25, 2023, 1:57pm

Hi @olaf-ho

What is the dimensionality of your data? For now, let’s assume its in the ~100d ball park. In general, pretty much any setup should be able to handle 100 queries per second(qps) on 1M 100d objects.

For reference see the figure below where every single AWS and GCP machine was able to achieve over 100qps on the SIFT1M dataset (which has 1M objects each being 128 dimensional) single-threaded even.

olaf-ho · May 25, 2023, 3:38pm

Hi @zainhas, I’m using OpenAI‘s text-embedding-ada-002 which has 1536 dimensions if I understand it correctly.

What would you recommend in terms of CPU and RAM? I’ll probably choose GCP for hosting.

etiennedi · May 25, 2023, 4:30pm

n2-standard-4 or n2-standard-8 for best performance/cost ratio

Kavali_Kranthi_Kumar · February 2, 2024, 12:48pm

HI @etiennedi @zainhas any suggestions for azure vms.

DudaNogueira · February 5, 2024, 1:59pm

Hi @Kavali_Kranthi_Kumar !

One nice way to calculate the resource usage is to just ask Verba:

https://verba.weaviate.io/

That will give you an estimate on the memory consumption. With that in hands, you can properly select the VM size in your cloud provider.

Topic		Replies	Views
Infra Configuration for Docker Setup of Weaviate Support	1	75	June 5, 2024
HIgh Cpu Usage Support technical	3	192	May 28, 2025
Minimum system requirements for local setup Forum feedback	1	438	March 6, 2025
Weaviate AWS instace class size flavour and does it support multi region capabilities Support	1	156	April 17, 2024
Sizing disk storage for Weaviate Support	3	222	May 2, 2025

Server specs and setup for production

Related topics