Here is the log; shows first file has 18 chucks loaded; 2nd has 10. When final counts the object, it shows 20 2024-11-13 16:32:09,935 - INFO -
=== file_path: /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf
chunking_recursiveCharacterTextSplitter.py: file is being chunked: /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“warning”,“msg”:“prop len tracker file weaviate_data/pdf_collection/Ib2JkZKYAJpm/proplengths does not exist, creating new tracker”,“time”:“2024-11-13T16:32:10-06:00”}
{“action”:“hnsw_prefill_cache_async”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“not waiting for vector cache prefill, running in background”,“time”:“2024-11-13T16:32:10-06:00”,“wait_for_cache_prefill”:false}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Created shard pdf_collection_Ib2JkZKYAJpm in 3.017834ms”,“time”:“2024-11-13T16:32:10-06:00”}
2024-11-13 16:32:10,833 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 0 - Chunk 0
2024-11-13 16:32:11,310 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 1 - Chunk 1
2024-11-13 16:32:11,640 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 2 - Chunk 2
2024-11-13 16:32:12,396 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 3 - Chunk 3
2024-11-13 16:32:12,842 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 4 - Chunk 4
2024-11-13 16:32:13,188 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 5 - Chunk 5
2024-11-13 16:32:13,701 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 6 - Chunk 6
2024-11-13 16:32:14,206 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 7 - Chunk 7
2024-11-13 16:32:14,969 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 8 - Chunk 8
{“action”:“hnsw_compressed_vector_cache_prefill_progress”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“elapsed_total”:5000034500,“level”:“info”,“loaded”:0,“msg”:“loaded 0 vectors in 5s, current rate is 0 vectors/s, total rate is 0 vectors/s”,“rate_per_second”:0,“time”:“2024-11-13T16:32:15-06:00”,“total_rate_per_second”:0}
2024-11-13 16:32:15,531 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 9 - Chunk 9
2024-11-13 16:32:15,951 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 10 - Chunk 10
2024-11-13 16:32:16,424 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 11 - Chunk 11
2024-11-13 16:32:16,754 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 12 - Chunk 12
2024-11-13 16:32:16,976 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 13 - Chunk 13
2024-11-13 16:32:17,388 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 14 - Chunk 14
2024-11-13 16:32:17,832 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 15 - Chunk 15
2024-11-13 16:32:18,164 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 16 - Chunk 16
2024-11-13 16:32:19,974 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 17 - Chunk 17
{“action”:“hnsw_compressed_vector_cache_prefill_progress”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“elapsed_total”:10000003084,“level”:“info”,“loaded”:0,“msg”:“loaded 0 vectors in 10s, current rate is 0 vectors/s, total rate is 0 vectors/s”,“rate_per_second”:0,“time”:“2024-11-13T16:32:20-06:00”,“total_rate_per_second”:0}
2024-11-13 16:32:20,372 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 18 - Chunk 18
2024-11-13 16:32:20 - All chunks inserted for /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf
2024-11-13 16:32:20,373 - INFO -
Document /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf Processing Status:
{
“status”: true,
“message”: ,
“error”:
}
2024-11-13 16:32:20,373 - INFO -
=== file_path: /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf
chunking_recursiveCharacterTextSplitter.py: file is being chunked: /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf
2024-11-13 16:32:20,745 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 0 - Chunk 0
2024-11-13 16:32:21,027 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 1 - Chunk 1
2024-11-13 16:32:21,489 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 2 - Chunk 2
2024-11-13 16:32:22,146 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 3 - Chunk 3
2024-11-13 16:32:22,590 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 4 - Chunk 4
2024-11-13 16:32:22,965 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 5 - Chunk 5
2024-11-13 16:32:23,338 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 6 - Chunk 6
2024-11-13 16:32:23,568 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 7 - Chunk 7
2024-11-13 16:32:23,940 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 8 - Chunk 8
2024-11-13 16:32:24,390 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 9 - Chunk 9
2024-11-13 16:32:24,672 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 10 - Chunk 10
2024-11-13 16:32:24,971 - INFO - HTTP Request: POST http://localhost:8079/v1/objects “HTTP/1.1 200 OK”
Inserted: Page 11 - Chunk 11
2024-11-13 16:32:24 - All chunks inserted for /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf
2024-11-13 16:32:24,972 - INFO -
Document /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf Processing Status:
{
“status”: true,
“message”: ,
“error”:
}
2024-11-13 16:32:24,972 - INFO - === utils.py url: http://localhost:8079/v1/objects/
2024-11-13 16:32:24,978 - INFO - === utils.py
{‘classes’: [{‘class’: ‘PDF_COLLECTION’, ‘invertedIndexConfig’: {‘bm25’: {‘b’: 0.75, ‘k1’: 1.2}, ‘cleanupIntervalSeconds’: 60, ‘indexNullState’: True, ‘indexPropertyLength’: True, ‘indexTimestamps’: True, ‘stopwords’: {‘additions’: None, ‘preset’: ‘en’, ‘removals’: None}}, ‘moduleConfig’: {‘generative-cohere’: {}, ‘text2vec-openai’: {‘baseURL’: ‘https://api.openai.com’, ‘model’: ‘text-embedding-3-small’, ‘vectorizeClassName’: True}}, ‘multiTenancyConfig’: {‘autoTenantActivation’: False, ‘autoTenantCreation’: False, ‘enabled’: False}, ‘properties’: [{‘dataType’: [‘text’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: True, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘page_content’, ‘tokenization’: ‘word’}, {‘dataType’: [‘int’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: False, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘page_number’}, {‘dataType’: [‘text’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: True, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘source’, ‘tokenization’: ‘word’}], ‘replicationConfig’: {‘asyncEnabled’: False, ‘deletionStrategy’: ‘DeleteOnConflict’, ‘factor’: 1}, ‘shardingConfig’: {‘actualCount’: 1, ‘actualVirtualCount’: 128, ‘desiredCount’: 1, ‘desiredVirtualCount’: 128, ‘function’: ‘murmur3’, ‘key’: ‘_id’, ‘strategy’: ‘hash’, ‘virtualPerPhysical’: 128}, ‘vectorIndexConfig’: {‘bq’: {‘enabled’: True}, ‘cleanupIntervalSeconds’: 300, ‘distance’: ‘cosine’, ‘dynamicEfFactor’: 8, ‘dynamicEfMax’: 500, ‘dynamicEfMin’: 100, ‘ef’: -1, ‘efConstruction’: 128, ‘filterStrategy’: ‘sweeping’, ‘flatSearchCutoff’: 40000, ‘maxConnections’: 32, ‘pq’: {‘bitCompression’: False, ‘centroids’: 256, ‘enabled’: False, ‘encoder’: {‘distribution’: ‘log-normal’, ‘type’: ‘kmeans’}, ‘segments’: 0, ‘trainingLimit’: 100000}, ‘skip’: False, ‘sq’: {‘enabled’: False, ‘rescoreLimit’: 20, ‘trainingLimit’: 100000}, ‘vectorCacheMaxObjects’: 1000000000000}, ‘vectorIndexType’: ‘hnsw’, ‘vectorizer’: ‘text2vec-openai’}]}
2024-11-13 16:32:24,984 - INFO -
=== utils.py total objects 20 in PDF_COLLECTION
2024-11-13 16:32:24,984 - INFO - === utils.py counts per file
{
“/Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf”: 11,
“/Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf”: 9
}
{‘status’: True, ‘message’: [‘20 already in http://localhost:8079/v1/objects/’], ‘error’: }
2024-11-13 16:32:24,985 - INFO - === *created.py - url: http://localhost:8079/v1/objects/
2024-11-13 16:32:24,985 - INFO - === *created.py - object_count: 20
2024-11-13 16:32:24,985 - INFO -
Document Processing Status: for /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data
{
“status”: true,
“message”: [
“20 already in http://localhost:8079/v1/objects/”
],
“error”:
}
{“action”:“restapi_management”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Shutting down… “,“time”:“2024-11-13T16:32:24-06:00”,“version”:“1.27.3”}
{“action”:“restapi_management”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Stopped serving weaviate at http://127.0.0.1:8079”,“time”:“2024-11-13T16:32:24-06:00”,“version”:“1.27.3”}
{“action”:“hnsw_compressed_vector_cache_prefill_progress”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“elapsed_total”:15000027792,“level”:“info”,“loaded”:0,“msg”:“loaded 0 vectors in 15s, current rate is 0 vectors/s, total rate is 0 vectors/s”,“rate_per_second”:0,“time”:“2024-11-13T16:32:25-06:00”,“total_rate_per_second”:0}
{“action”:“telemetry_push”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“telemetry terminated”,“payload”:”\u0026{MachineID:857f2ecf-d343-4a0c-8023-8363be41eafc Type:TERMINATE Version:1.27.3 NumObjects:0 OS:darwin Arch:arm64 UsedModules:[generative-cohere text2vec-openai]}”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing raft FSM store …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“shutting down raft sub-system …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“transferring leadership to another server”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“error”:“cannot find peer”,“level”:“error”,“msg”:“transferring leadership”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing raft-net …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing log store …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing data store …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing loaded database …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing raft-rpc client …”,“time”:“2024-11-13T16:32:25-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“closing raft-rpc server …”,“time”:“2024-11-13T16:32:25-06:00”}
(.venv) connie.wang@Connies-MacBook-Pro-M3 fastapi_onazure % python app/rag/with_weaviate/utils/utils.py
2024-11-13 16:32:40,122 - INFO - === configs.py - blob_name for azure: rag/data/constitution.pdf
2024-11-13 16:32:40,122 - INFO - === configs.py - pdf_file_path : /Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data
2024-11-13 16:32:40,127 - INFO - Started /Users/connie.wang/.cache/weaviate-embedded: process ID 17425
{“action”:“startup”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“default_vectorizer_module”:“none”,“level”:“info”,“msg”:“the default vectorizer modules is set to "none", as a result all new schema classes without an explicit vectorizer setting, will use this vectorizer”,“time”:“2024-11-13T16:32:40-06:00”}
{“action”:“startup”,“auto_schema_enabled”:true,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“auto schema enabled setting is set to "true"”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“No resource limits set, weaviate will use all available memory and CPU. To limit resources, set LIMIT_RESOURCES=true”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“module offload-s3 is enabled”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“warning”,“msg”:“Multiple vector spaces are present, GraphQL Explore and REST API list objects endpoint module include params has been disabled as a result.”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“open cluster service”,“servers”:{“Embedded_at_8079”:51416},“time”:“2024-11-13T16:32:40-06:00”}
{“address”:“192.168.1.44:51417”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“starting cloud rpc server …”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“starting raft sub-system …”,“time”:“2024-11-13T16:32:40-06:00”}
{“address”:“192.168.1.44:51416”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“tcp transport”,“tcpMaxPool”:3,“tcpTimeout”:10000000000,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“loading local db”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“local DB successfully loaded”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“schema manager loaded”,“n”:0,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“metadata_only_voters”:false,“msg”:“construct a new raft node”,“name”:“Embedded_at_8079”,“time”:“2024-11-13T16:32:40-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“index”:61,“level”:“info”,“msg”:“initial configuration”,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“last_snapshot_index”:0,“last_store_applied_index_on_start”:62,“level”:“info”,“msg”:“raft node constructed”,“raft_applied_index”:0,“raft_last_index”:62,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“hasState”:true,“level”:“info”,“msg”:“raft init”,“time”:“2024-11-13T16:32:40-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“index”:61,“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“entering follower state”,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“attempting to join”,“remoteNodes”:[“192.168.1.44:51416”],“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“attempted to join and failed”,“remoteNode”:“192.168.1.44:51416”,“status”:8,“time”:“2024-11-13T16:32:40-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“attempting to join”,“remoteNodes”:[“192.168.1.44:51416”],“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“attempted to join and failed”,“remoteNode”:“192.168.1.44:51416”,“status”:8,“time”:“2024-11-13T16:32:41-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader-address”:“”,“leader-id”:“”,“level”:“warning”,“msg”:“heartbeat timeout reached, starting election”,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“time”:“2024-11-13T16:32:41-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“entering candidate state”,“node”:{},“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“term”:29,“time”:“2024-11-13T16:32:41-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“from”:“Embedded_at_8079”,“id”:“Embedded_at_8079”,“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“pre-vote successful, starting election”,“needed”:1,“node”:{},“refused”:0,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“tally”:1,“term”:29,“time”:“2024-11-13T16:32:41-06:00”,“votesNeeded”:1}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“from”:“Embedded_at_8079”,“id”:“Embedded_at_8079”,“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“election won”,“needed”:1,“node”:{},“refused”:0,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“tally”:1,“term”:29,“time”:“2024-11-13T16:32:41-06:00”,“votesNeeded”:1}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“follower”:{},“from”:“Embedded_at_8079”,“id”:“Embedded_at_8079”,“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader”:{},“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“entering leader state”,“needed”:1,“node”:{},“refused”:0,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51277}]]”,“tally”:1,“term”:29,“time”:“2024-11-13T16:32:41-06:00”,“votesNeeded”:1}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [7/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [8/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [19/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [22/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [23/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [24/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [57/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Schema catching up: applying log entry: [62/62]”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“last_store_log_applied_index”:62,“level”:“info”,“log_index”:62,“log_name”:“LogCommand”,“log_type”:0,“msg”:“reloading local DB as RAFT and local DB are now caught up”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“reload local db: update schema …”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“index”:“PDF_COLLECTION”,“level”:“info”,“msg”:“reload local index”,“time”:“2024-11-13T16:32:41-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“configured versions”,“server_version”:“1.27.3”,“time”:“2024-11-13T16:32:42-06:00”,“version”:“1.27.3”}
{“action”:“grpc_startup”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“grpc server listening at [::]:50050”,“time”:“2024-11-13T16:32:42-06:00”}
{“address”:“192.168.1.44:51416”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“current Leader”,“time”:“2024-11-13T16:32:42-06:00”}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“attempting to join”,“remoteNodes”:[“192.168.1.44:51416”],“time”:“2024-11-13T16:32:42-06:00”}
{“action”:“raft”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“command”:0,“follower”:{},“from”:“Embedded_at_8079”,“id”:“Embedded_at_8079”,“index”:61,“last-leader-addr”:“”,“last-leader-id”:“”,“leader”:{},“leader-address”:“”,“leader-id”:“”,“level”:“info”,“msg”:“updating configuration”,“needed”:1,“node”:{},“refused”:0,“server-addr”:“192.168.1.44:51416”,“server-id”:“Embedded_at_8079”,“servers”:“[[{Suffrage:Voter ID:Embedded_at_8079 Address:192.168.1.44:51416}]]”,“tally”:1,“term”:29,“time”:“2024-11-13T16:32:42-06:00”,“votesNeeded”:1}
{“action”:“restapi_management”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Serving weaviate at http://127.0.0.1:8079”,“time”:“2024-11-13T16:32:42-06:00”,“version”:“1.27.3”}
2024-11-13 16:32:42,325 - INFO - HTTP Request: GET http://localhost:8079/v1/.well-known/openid-configuration “HTTP/1.1 404 Not Found”
2024-11-13 16:32:42,336 - INFO - HTTP Request: GET http://localhost:8079/v1/meta “HTTP/1.1 200 OK”
2024-11-13 16:32:42,337 - INFO - HTTP Request: GET http://localhost:8079/v1/.well-known/ready “HTTP/1.1 200 OK”
2024-11-13 16:32:42,440 - INFO - HTTP Request: GET https://pypi.org/pypi/weaviate-client/json “HTTP/1.1 200 OK”
2024-11-13 16:32:42,468 - INFO - === vectore_stores.py - embeded client initated <weaviate.client.WeaviateClient object at 0x1040461e0>
2024-11-13 16:32:42,468 - INFO - === utils.py url: http://localhost:8079/v1/objects/
2024-11-13 16:32:42,472 - INFO - === utils.py
{‘classes’: [{‘class’: ‘PDF_COLLECTION’, ‘invertedIndexConfig’: {‘bm25’: {‘b’: 0.75, ‘k1’: 1.2}, ‘cleanupIntervalSeconds’: 60, ‘indexNullState’: True, ‘indexPropertyLength’: True, ‘indexTimestamps’: True, ‘stopwords’: {‘additions’: None, ‘preset’: ‘en’, ‘removals’: None}}, ‘moduleConfig’: {‘generative-cohere’: {}, ‘text2vec-openai’: {‘baseURL’: ‘https://api.openai.com’, ‘model’: ‘text-embedding-3-small’, ‘vectorizeClassName’: True}}, ‘multiTenancyConfig’: {‘autoTenantActivation’: False, ‘autoTenantCreation’: False, ‘enabled’: False}, ‘properties’: [{‘dataType’: [‘text’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: True, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘page_content’, ‘tokenization’: ‘word’}, {‘dataType’: [‘int’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: False, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘page_number’}, {‘dataType’: [‘text’], ‘indexFilterable’: True, ‘indexRangeFilters’: False, ‘indexSearchable’: True, ‘moduleConfig’: {‘text2vec-openai’: {‘skip’: False, ‘vectorizePropertyName’: True}}, ‘name’: ‘source’, ‘tokenization’: ‘word’}], ‘replicationConfig’: {‘asyncEnabled’: False, ‘deletionStrategy’: ‘DeleteOnConflict’, ‘factor’: 1}, ‘shardingConfig’: {‘actualCount’: 1, ‘actualVirtualCount’: 128, ‘desiredCount’: 1, ‘desiredVirtualCount’: 128, ‘function’: ‘murmur3’, ‘key’: ‘_id’, ‘strategy’: ‘hash’, ‘virtualPerPhysical’: 128}, ‘vectorIndexConfig’: {‘bq’: {‘enabled’: True}, ‘cleanupIntervalSeconds’: 300, ‘distance’: ‘cosine’, ‘dynamicEfFactor’: 8, ‘dynamicEfMax’: 500, ‘dynamicEfMin’: 100, ‘ef’: -1, ‘efConstruction’: 128, ‘filterStrategy’: ‘sweeping’, ‘flatSearchCutoff’: 40000, ‘maxConnections’: 32, ‘pq’: {‘bitCompression’: False, ‘centroids’: 256, ‘enabled’: False, ‘encoder’: {‘distribution’: ‘log-normal’, ‘type’: ‘kmeans’}, ‘segments’: 0, ‘trainingLimit’: 100000}, ‘skip’: False, ‘sq’: {‘enabled’: False, ‘rescoreLimit’: 20, ‘trainingLimit’: 100000}, ‘vectorCacheMaxObjects’: 1000000000000}, ‘vectorIndexType’: ‘hnsw’, ‘vectorizer’: ‘text2vec-openai’}]}
{“action”:“hnsw_prefill_cache_async”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“not waiting for vector cache prefill, running in background”,“time”:“2024-11-13T16:32:42-06:00”,“wait_for_cache_prefill”:false}
{“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“level”:“info”,“msg”:“Completed loading shard pdf_collection_Ib2JkZKYAJpm in 5.526791ms”,“time”:“2024-11-13T16:32:42-06:00”}
{“action”:“hnsw_compressed_vector_cache_prefill”,“build_git_commit”:“4258bdfc2”,“build_go_version”:“go1.23.3”,“build_image_tag”:“HEAD”,“build_wv_version”:“1.27.3”,“count”:31,“level”:“info”,“maxID”:30,“msg”:“prefilled compressed vector cache”,“time”:“2024-11-13T16:32:42-06:00”,“took”:344667}
2024-11-13 16:32:42,480 - INFO -
=== utils.py total objects 20 in PDF_COLLECTION
2024-11-13 16:32:42,481 - INFO - === utils.py counts per file
{
“/Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/constitution.pdf”: 11,
“/Users/connie.wang/Desktop/connie/inspiration_azure/fastapi_onazure/app/rag/with_weaviate/data/what_is_a_constitution.pdf”: 9
}