ray head node can’t run with error
Warning Failed 5m4s kubelet Error: failed to start container "ray-node": Error response from daemon: OCI runtime create failed: container_linux.go:346: starting container process caused "process_linux.go:449: container init caused \"process_linux.go:432: running prestart hook 0 caused \\\"error running hook: exit status 1, stdout: , stderr: nvidia-container-cli: device error: unknown device id: no-gpu-has-9MiB-to-run\\\\n\\\"\"": unknown
after some search, I believe that it is because of the cuda version mismatch.
the cuda version on my server is 11.4
nvidia-smi
Tue Jul 19 15:04:11 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.57.02 Driver Version: 470.57.02 CUDA Version: 11.4
and I don’t know how to find docker image with cuda 11.4
newest on docker hub is rayproject/ray:nightly-py38-cu113
can anybody help with this?