I have a local server with 8 GPUs.
No matter the configuration I’m using, ray is always ignoring the first GPU
ray.init(local_mode=False, num_cpus=10, num_gpus=8)
Ideas?
I have a local server with 8 GPUs.
No matter the configuration I’m using, ray is always ignoring the first GPU
ray.init(local_mode=False, num_cpus=10, num_gpus=8)
Ideas?
Ray scheduler doesn’t assign the GPU to each worker, but it is just for the scheduling. My guess is your application code is not properly using the first GPU for some reason. Are you using Tune or Rllib?
Thanks for the response. I’m using tune. When running in local mode Ray uses the first gpu. When I’m not using Ray at all the first gpu is chosen and runs properly.
cc @rliaw Do you know what’s the cause of this issue?
No, I don’t unfortunately. Maybe you can look at the dashboard to see what’s going on?