Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Release Test] Update cuda version in gpu docker cluster launcher ima…
…ge to 12.1 (#42246) After the Ray 2.9 release, the release test for the GPU Docker example cluster YAML file started failing with 2023-12-23 03:00:43,078 VINFO command_runner.py:371 -- Running `docker run --rm --name ray_nvidia_docker -d -it -e LC_ALL=C.UTF-8 -e LANG=C.UTF-8 --shm-size='2301055426.56b' --runtime=nvidia --net=host rayproject/ray:latest-gpu bash` 24897079968c098daccf1ed65a0bea5d3d9e3df84de201ea20f1a34b0363975c docker: Error response from daemon: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy' nvidia-container-cli: requirement error: unsatisfied condition: cuda>=11.8, please update your driver to a newer version, or use an earlier cuda container: unknown. The likely cause is Ray 2.9 increased the required CUDA version to 11.8. This PR updates the CUDA version used in the GCP VM image in the example cluster YAML file from 11.3 to 12.1. The test passes after this change. Related issue number Closes #42134 --------- Signed-off-by: Archit Kulkarni <[email protected]>
- Loading branch information