Would like support for using the nvidia mig driver for gpu

djjeffr · April 12, 2024, 8:44pm

in values yaml
muti2vec-clip:
…
enable_cuda: true
…
#added this line to make it work
cuda_visible_devices: 0
…
nvidia.com/mig-1g.5gb: 1

Also had to modiify file transformersInferenceDeployment.yaml to allow the line cuda_visible_devices
-helm chart 16.4.0

Was expecting it to work without adding cuda_visible_devices line, the same as it worked when using nvidia.com/gpu: 1

If cuda_visible_device is not there it fails to start with error cuda_visible_device must be there to use nvidia mig driver.

No response

1.20.5

DudaNogueira · April 15, 2024, 10:18am

Hi @djjeffr ! Welcome to our community!

Thanks you for sharing this.

Just to make it clear: now with those changes, it will work as expected, right?

I believe it’s worth investigating if this is possible to be defined in the docker image itself:

And the changes can be done in the helm repo:

So we could open the PR or issue request there so our team can check it out.

What do you think?

Thanks!

djjeffr · April 17, 2024, 8:39pm

Sound good to raise a PR, would be nice if in image/helm chart so I don’t need to modify the file after every new helm chart.

Topic		Replies	Views
Version of pytorch used in modules doesn't support Nvidia sm90 driver Support	2	184	January 17, 2025
[Question] reranker image seems to fail when use gpu? Support bug , integration	3	276	October 4, 2024
Unable to deploy python app with weaviate using a single docker-compose.yml file Support	6	2365	September 26, 2023
Self Hosting 5 min Timeout? Support	3	656	August 11, 2023
Weaviate Helm Chart Failing General	5	558	April 5, 2024