Episode Reward Drops Without Recovery

max_ronda · November 9, 2023, 7:20pm

How severe does this issue affect your experience of using Ray?: High

Hello all,

Running into an issue where my training completely stops and reward drops without any attempts to recover. I am using PPO algorithm on a custom environment using Ray 2.7.1. This is how the episode reward mean looks like for all my trials:

Has anyone had this issue and if so, how did you configure your run to make it work ?

Thanks so much !

Topic		Replies	Views
Unexpected dramatic drop in reward RLlib	8	966	November 13, 2023
Unable to replicate original PPO performance RLlib	0	177	May 10, 2024
PPO.train incorrect result RLlib	1	260	May 23, 2023
PPO only run several steps in one episode RLlib	1	54	September 10, 2024
Repeating cycles in PPO algorithm RLlib	0	96	March 24, 2024

Episode Reward Drops Without Recovery

Related topics