Callback on_episode_end is not triggered

zoe_tsekas · May 31, 2023, 5:14pm

I execute the PPO training I see the callback method “on_episode_start” called once,
I don’t see the on_episode_end called at all.

When is “on_episode_start” and “on_episode_end” called?

Do they depend on algo config parameter combination? e.g. count_steps_by=“env_steps” vs count_steps_by=“agent_steps”

for i in range(env_cfg['max_episode_steps']):
    print(f"Episodes:{i}")
    result = algo.train()
    checkpoint_dir = algo.save()
algo.stop()

Topic		Replies	Views
Callbacks.on_episode_step called an extra time during the first episode played (after the first call to env.reset) RLlib	5	719	April 9, 2021
How to tell RLLIB trainer (Not Tune) to run that many number of episodes RLlib	7	973	June 9, 2023
Post process trajectory with full episode RLlib	1	339	October 17, 2023
Num_agent_steps_trained: 0 Configure Algorithm, Training, Evaluation, Scaling	1	46	April 7, 2024
Rollout storage with callbacks does not capture starting state RLlib	2	226	August 7, 2021