Does training_iteration correspond to number of episodes?

carlorop · February 18, 2022, 3:38pm

According to the documentation, ‘training_iteration’ counts the number of times tune.report() has been called. Would that always be equivalent to the number of training episodes when training a RL agent on RLLIB?

Lars_Simon_Zehnder · February 19, 2022, 10:41am

Hi @carlorop ,

training_iteration does count the number of iterations in which a training step has been made. This is however not identical with the number of episodes in RLlib. The reason for this is that whenever the RolloutWorkers in RLlib collect new experiences from the environment they can do so either, by using a predefined number of steps in the environment or by stepping for as long as an episode takes. We define one or the other by setting batch_mode to either truncate_episodes (the default) or complete_episodes. These settings define what data gets collected into a training batch.

Note, a training batch can then contain multiple episodes for both cases, however, complete_episodes ensures that there are always complete episodes in the training batch (as long as there is no horizon set).

Coming back to your question now: A single training batch usually contains not a single episode and as the Trainer trains on a batch training_iteration and number of episodes stepped in the environment are not the same.

For the configuration setting take a look into the Trainer configuration.

Hope this helps

Topic		Replies	Views
What does the 'training_iteration' parameter relate to in the RLlib? RLlib	3	701	May 4, 2021
Does the agent train per episode or per iteration RLlib	1	564	November 1, 2021
[Tune] [RLlib] Episodes vs iterations vs trials vs experiments RLlib	1	2333	June 3, 2021
How to tell RLLIB trainer (Not Tune) to run that many number of episodes RLlib	7	1173	June 9, 2023
Get the number of training steps when loading a trained agent RLlib	2	601	March 16, 2021

Does training_iteration correspond to number of episodes?

Related topics