Creation of multiple actors on a single GPU in Ray leads to multiple Cuda context loading, causing increased memory usage and slower speed

Went-Liang · June 25, 2023, 11:21am

Dear maintainers of the Ray open-source project,

We have recently discovered that when multiple Actors are created on a single GPU in Ray, it uses multiple processes to implement them, which causes multiple loading of Cuda contexts. This leads to significant increase in GPU memory usage, slower launch speed of Cuda kernels, and more time for communication due to additional memcpy. We have also found a similar issue reported by other users on Does ray load the CUDA context multiple times?.

We would like to inquire if there are any possible solutions to mitigate this issue, or if there are any plans to address this problem in the future. We appreciate your help and advice with resolving this issue.

Thank you for your attention to our issue.

Topic		Replies	Views
Does ray load the CUDA context multiple times? Ray Core	3	561	October 16, 2022
How do Ray actors share a GPU? Ray Core	2	2326	December 15, 2021
Why increasing the number of parallel GPU tasks make it faster Ray Core	2	330	April 5, 2023
Single GPU multiprocessing Ray Core	1	663	October 15, 2022
[Ray Core] RuntimeError: No CUDA GPUs are available Ray Core	5	4987	October 15, 2022

Creation of multiple actors on a single GPU in Ray leads to multiple Cuda context loading, causing increased memory usage and slower speed

Related topics