[Clarification on resource planning]

fairymane · March 5, 2024, 11:07am

Reading this resource planning doc and this post, the calculation seems to be the total storage size needed for a example dataset instead of storing index in memory.

To clarify, based on the calculation, does it mean the complete vector is stored in memory as index (together with other overhead such max maxConnection)?

If that is the case, what data are stored in the storage, are most data in memory and storage overlap?

DudaNogueira · March 6, 2024, 9:05pm

Hi!

That’s the total memory used. The HNSW will be stored in memory.

Those calculations in resource planning doc will show the RAM memory to have those vectors in READY shards

If your class is compressed, for example, it will load the compressed HNSW in memory (reduced memory usage), and it will now store both the original vectors and the compressed/trained ones in disk.

Depending on the configurations of your index, you can expect the storage size to be between 0.1 and 0.6 of the memory size. Because of the different configurations that will impact on the graph, it’s hard to have an assertive formula.

Again, thanks for those awesome questions

Topic		Replies	Views
Resource Usage General developer-experience	1	369	February 6, 2024
Where is the index stored? Support	1	691	August 2, 2023
Documentation - Maximum index size, disk paging Support	3	900	December 6, 2023
How to choose the correct `PERSISTENCE_HNSW_MAX_LOG_SIZE` Support developer-experience , technical	1	124	May 16, 2025
How much resource is needed for 30M 1536d vector records index with bm25 index? Support	6	939	March 19, 2024

[Clarification on resource planning]

Related topics