Okay, @sangcho , that was what I was looking for. In fact, I saw that only on thread per worker was executed at each time, but Why does Ray creates many threads for each worker? I mean, I always see a thread in state Running for each worker (in fact in the image you can see trhree threads running, one of them corresponding to the htop process), but why are many threads created for ecah one and they execute alternatively? Maybe this a more question of understanding RLlib and PPO way of work and it’s more aproppiate to move the question To RLLIB section…
Anyway thanks in advance for your answers