Question about resource management in Ray

javigm98 · April 23, 2021, 9:19am

Okay, @sangcho , that was what I was looking for. In fact, I saw that only on thread per worker was executed at each time, but Why does Ray creates many threads for each worker? I mean, I always see a thread in state Running for each worker (in fact in the image you can see trhree threads running, one of them corresponding to the htop process), but why are many threads created for ecah one and they execute alternatively? Maybe this a more question of understanding RLlib and PPO way of work and it’s more aproppiate to move the question To RLLIB section…

Anyway thanks in advance for your answers

sangcho · April 24, 2021, 5:33am

Ray worker is not just a python process, but it has cpp code attached (for Ray related operations), and they are written in multi threaded code to optimize the performance!

sangcho · April 24, 2021, 5:34am

Also, there are other ray components at each ray node (e.g., Raylet, a scheduler, dashboard agent, and etc.), which also uses some CPUs. They shouldn’t use many CPUs, but they all are using some of them (and it can use any CPU in your machine as ray doesn’t do resource isolation like Docker)

Topic		Replies	Views
Most efficient way to use only a CPU for training RLlib	3	2962	April 22, 2021
Ray on a local machine with 2 CPUs Ray Core	5	1051	February 17, 2021
Specifying overall maximum number of cores to be used in RayTune RLlib	1	654	June 7, 2023
How does multi-CPU work within Ray? Ray Core	4	615	April 21, 2021
All ray resources mapped to only two physical processors Configure Algorithm, Training, Evaluation, Scaling	0	189	December 8, 2023

Question about resource management in Ray

Related topics