Resource Usage

Dharanish · January 31, 2024, 4:41am

Hi Team , I have a schema with the configuration
class_schema = {
“class”:“class_”, # class name
“vectorizer”: “none”,
“properties”: [{
“name”: “record_id”,
“dataType”: [“int”]
}],
“vectorIndexType”: “hnsw”,
“vectorIndexConfig”: {
“skip”: False,
“ef”: 256,
“efConstruction”: 256,
“maxConnections”: 64,
“vectorCacheMaxObjects”:300000,
“distance”: “l2-squared”
},
“shardingConfig”: {
“virtualPerPhysical”: 128,
“desiredCount”: 2,
“actualCount”: 2,
“key”: “_id”,
“strategy”: “hash”,
“function”: “murmur3”
},
“replicationConfig”:{
“factor”:1
}
}

I currently have 50K objects with 128 dimension vector. I recognize that memory consumption is more than double the disk space occupied.
disk = 53.2M,memory =110M.
queries:

is that memory consumption always greater than disk occupied for the above vectorIndexConfig?
does any thing have to done to reduce memory usage without comprising search accuracy?
What if replica to 2, does memory consumption also doubled ?

DudaNogueira · February 6, 2024, 7:01pm

Hi! Not sure I can answer all questions.

But if you set replica to 2, it will store your objects twice. So if you have only one node, it should double.

Regarding 2, there are some research on using DiskAnn. Of course, PQ and BQ are other options, but it will looose some accuracy. I believe you can take a step back, and consider the ammount of dimensions you are using. Maybe a lower dimension can get you the results you want, while consuming less memory.

For 1, it will read the data from disk into memory, as well as the connections. usually, right after start, Weaviate will load all those and use a lot of memory. So after a while, garbage collection should kick in and do some cleaning.

Let me know if that helps

Topic		Replies	Views
Documentation - Maximum index size, disk paging Support	3	900	December 6, 2023
Why does search speed suffer (and RAM consumption increases) when there are a large number of vectors in Weaviate? General	3	191	January 14, 2025
[Clarification on resource planning] General	1	190	March 6, 2024
Resource usage on multiple replica General developer-experience	1	257	February 2, 2024
How much resource is needed for 30M 1536d vector records index with bm25 index? Support	6	939	March 19, 2024

Resource Usage

Related topics