How do Ray actors share a GPU?

Lacruche · December 9, 2021, 6:42pm

Hi,

I see here that it seems possible to allocate fractions of GPU to a Ray actor with something like @ray.remote(num_gpus=0.25).

If N actors are on the same GPU, how do they run?

A) they run in parallel on different cuda cores?
B) the GPU is time-sliced :The CUDA kernels requested by each actor will be run in sequence (what I believe to be the default in NVIDIA - if 2 processes talk to the GPU, there kernels are done one after the other - Running more than one CUDA applications on one GPU - Stack Overflow)

If it’s A, then it’s pretty revolutionary, as I think only MPS, cuda streams or MIG enable true concurrency on NVIDIA GPUs. If it’s B, then I encourage putting (A) in the roadmap to make Ray even more appealing.

sangcho · December 10, 2021, 2:33am

I think it is just (B) Ray’s role is just to assign a task to where GPU is available and specify the environment variable to use GPUs.

cc @kai can you confirm if this is correct?

Lacruche · December 15, 2021, 5:29pm

Confirmed by @rkn here python - Ray: How to run many actors on one GPU? - Stack Overflow

thanks!

Topic		Replies	Views
How to distribute actors to multiple GPUs Ray Core	6	1154	May 5, 2022
How to assign a specific actor to a specific GPU Ray Core	15	1552	February 16, 2021
Run Python function in parallel on GPU Ray Core	10	4752	January 28, 2022
Actor running on gpu Ray Core	1	437	August 4, 2022
Spread accross several fractional GPUs or 1< num_gpus < 2 Ray Core	1	367	February 13, 2024

How do Ray actors share a GPU?

Related topics