How to use Generative Search with locally hosted LLM (within the firewall) and front-ended with REST API with basic authentication?

pon_raj · July 11, 2024, 6:14pm

Thank for all your help and I am at the final step to get the generative search working. As I am going through the document at Generative Search - OpenAI | Weaviate - Vector Database, I understand that weaviate will work only with openAI and OPENAI_API_KEY. Is that right?

We have locally hosted (within the firewall) LLM with front-ended REST API with basic authentication.

For an example, the following API call works:

prompt=’{“inputs”: “What is Docker?”}’

curl –silent –output -X POST $URL -H “Authentication Basic $BASIC_AUTH” -H “Content-Type: application-json” –data “$prompt”

Where URL is https:///gpt/api/v1/models/Llama-2-70b-chat-hf/generate

Is there a way in weaviate configuration to set the internal REST API URL and BASIC_AUTH while using generative search?

So far, I have been successful in hosting weaviate on OpenShift and loading data using text2vec-contextionary, and querying the data. Now I am trying to call the internally hosted REST API (LLM) with basic_auth to send the prompt and weaviate query result.

Please let me know how to accomplish this last step with Weaviate.

Thanks.

DudaNogueira · July 11, 2024, 8:18pm

hi @pon_raj !!

I believe this is a case of crafting a custom module

You can override the baseurl for openai modules (check here the code) but not sure on the basic auth part.

Also, that would mean that if initializing the client with a different base url, it would replace both for text2vec-openai and generative.

pon_raj · July 15, 2024, 12:06pm

Thank you for the helpful advice. I will give a try.

Topic		Replies	Views
Can we use my own LLM for generative search? General	3	494	April 16, 2024
Confused by the example of generative query Support	4	610	February 10, 2024
How to use weaviate with LM Studio? Support	3	742	July 24, 2024
Generative-Azure-OpenAI module Error Support	3	925	January 22, 2024
Cannot use vec. param. using OpenAI API key via GPT assistant yaml / json schema Support	3	851	November 21, 2023

How to use Generative Search with locally hosted LLM (within the firewall) and front-ended with REST API with basic authentication?

Related topics