Local Hugginface vector embedding

fabriziof64 · September 14, 2024, 8:13pm

I’m trying to use (on Windows) the docker image of weaviate 1.26.0 to get vector embedding local to my computer, but when vectorization should occur I get the error ‘failed with status: 429 error: Rate limit reached. Please log in or use a HF access token’.

This is the docker-compose.yml:
version: ‘3.4’

services:
weaviate:
image: semitechnologies/weaviate:1.26.0
ports:
- “9090:8080”
environment:
QUERY_DEFAULTS_LIMIT: 100
AUTHENTICATION_ANONYMOUS_ACCESS_ENABLED: ‘true’
ENABLE_MODULES: ‘text2vec-huggingface’ # Enable Hugging Face module
HUGGINGFACE_INFERENCE_MODEL: ‘sentence-transformers/all-MiniLM-L6-v2’
DEFAULT_VECTORIZER_MODULE: text2vec-huggingface
HUGGINGFACE_INFERENCE_API: ‘local’
PERSISTENCE_DATA_PATH: “/var/lib/weaviate” # Inside the container

It seems that it is not possibile to use local vectorization, but only remote (which implies sending text to HuggingFace). Is it correct? What is the meaning of HUGGINGFACE_INFERENCE_API: ‘local’? Thanks!

DudaNogueira · September 16, 2024, 6:09pm

hi @fabriziof64 !!

Welcome to our community

This message usually comes from the vectorizer service.

Changing HUGGINGFACE_INFERENCE_API will set the url to where Weaviate will look for Hugging face inference service.

If you want to run transformers locally, check this option:

Also, you can run models using ollama:

Let me know if this helps!

Thanks!

Topic		Replies	Views
Local Embedding Model Support	1	502	April 16, 2024
Text2vec-huggingface and text2vec-openai API key is invalid after installing weaviate through docker General	3	1197	July 24, 2023
Verba: Failing to embed using Weaviate Service Support	5	572	October 24, 2024
Weaviate with text2vec-huggingface not work for me Support	1	944	July 5, 2023
Local Embed vs Weaviate Module Support	6	1550	October 19, 2023

Local Hugginface vector embedding

Related topics