GPU configuration with Cluster Launcher + On-premise Cluster

Quick update. I solved the issue adding the --gpus all to the run_options section in the config file. For example:

docker:
  container_name: ray_container
  image: rayproject/ray-ml:latest-gpu
  pull_before_run: true
  run_options:
  - --ulimit nofile=65536:65536
  - --gpus all

2 Likes