Train on episode end

Souphis · December 8, 2021, 10:36pm

Hello!

Is there possible to perform off-policy agent training at the episode end? My case is the following: episode can last up to 50 iterations and for example, agent terminates episode after 30 iterations. During interaction with the environment, there is no training. The training is performed at the end of the episode with 30 train iterations. I know that I can set “batch_mode” to “complete_episode”, however, I do not how to dynamically set train iteration. Thanks in advance for the help!

Greg

rusu24edward · December 28, 2021, 8:08pm

The configuration contains a batch_mode argument, which indicates whether the trainer should truncate episodes to generate the rollouts or whether it should wait for completed episodes. This may be what you’re looking for.

Topic		Replies	Views
Is there a mix between truncate_episodes and complete_episodes? RLlib	0	238	July 20, 2022
How to tell RLLIB trainer (Not Tune) to run that many number of episodes RLlib	7	1173	June 9, 2023
Delayed Learning Due To Long Episode Lengths RLlib	9	1295	September 10, 2021
Does training_iteration correspond to number of episodes? RLlib	1	1054	February 19, 2022
[RLlib] Batch size for complete_episodes issue RLlib	6	2146	February 3, 2022

Train on episode end

Related topics