Hyperparameters of PPO on Ray Cluster

Ali_Shehper · July 18, 2023, 10:43pm

Hey guys,

This is more of an RL question, but I thought I might be able to get some help here.

I have a manual implementation of PPO and a custom environment. I previously tuned hyperparameters of PPO for my problem using random search and obtained very good performance. In particular, I found the optimal number of parallel actors to be 4.

Now I am trying to speed up my algorithm using Ray and Ray Clusters. As I will have access to more cores, can I expect the same kind of performance if I use a different (higher) number of parallel actors, keeping all the other hyperparameters the same?

I am aware that I could use RLlib if I want faster performance, but I thought it would be a good exercise to speed up my PPO using Ray and Ray Clusters.

Thank you!

Topic		Replies	Views
Unable to replicate original PPO performance RLlib	0	173	May 10, 2024
Increasing the number of rollout worker doesn´t increase the performance Configure Algorithm, Training, Evaluation, Scaling	0	217	December 24, 2023
Run PPO on multiple nodes RLlib	1	599	September 4, 2022
Performance of algorithms RLlib	3	621	September 2, 2021
How to get the best performance of Ray´s RLlib when running python scripts (using PPO) via a SLURM file on a HPC? Debugging and performance tuning	0	294	December 12, 2023

Hyperparameters of PPO on Ray Cluster

Related topics