To setup Weaviate on the EC2 instances with 3 nodes

I am seeking detailed information and documentation on installing and configuring Weaviate DB on AWS EC2 instances with three replication nodes. Specifically, I need the best configuration methods and minimum requirement details (OS, CPU, RAM, and volume size). Additionally, I would like to know the available options for monitoring the cluster and performing backup/restore administrative activities. If you provide these details, I will begin preparing to implement them on AWS."

Hi @RamuA !!

Welcome to our community :hugs:

The best - and recommended - way to deploy a Weaviate multi node cluster is using our oficial helm chart and kubernetes:

OS, CPU, RAM, and volume size will depend on your usage, specially how many dimensions you plan on storing. For this, we have a comprehensive resource planning documentation here:

For observing your cluster, you can use prometheus and grafana, as explained on this blog post:

Let me know if this answer your questions!

THanks!

Hello @DudaNogueira,

Thank you for your response. Could you please share the minimum requirements for instances to build a Weaviate development cluster with replication configuration? Additionally, should we use Kubernetes or Docker containers for this setup?

I also have a few more questions:

  1. How will it integrate with IDP (Azure)?
  2. What are the steps to restore from an AWS S3 bucket, and how can we encrypt the backups stored in S3?
  3. How can we block outgoing traffic?
  4. How can we restrict client-based access behind a VPN and use Nginx as a proxy?
  5. What metrics are needed to measure the database performance?

Thank you for your assistance.