Ray Tune and Ray RLLIB

Archana_R · February 7, 2023, 3:20pm

Hi ,
I have finally managed to run the PPOTrainer.train() without much issues ( Thanks to @mannyv ) .
However, i see that my agent is not learning when i look into the rewards.

Should i first use Ray.tune for hyperparamter tuning and then use PPOTrainer.train() ?
How can i display the results from train() better ?

Thank you in advance!

arturn · April 14, 2023, 12:06am

Always visualize your results to understand what your agent is doing.
A good starting point for the work you are facing is http://joschu.net/docs/nuts-and-bolts.pdf

Topic		Replies	Views
A little help for a novice RLlib	1	429	October 26, 2022
Compute/display actions from ray.tune RLlib	10	1677	March 30, 2021
Agent.train() vs ray.tune.run RLlib	1	770	September 4, 2022
Some questions about tune	0	377	April 19, 2023
Learning curves Ray Tune	5	788	March 15, 2022

Ray Tune and Ray RLLIB

Related topics