I train and save my RL agents (using PPO) and later load them again for inference using agent = restore(path)
.
Is there any way to get the number of steps the loaded agent was trained? I’d like to print/show them in my application.
I saw there is agent.training_iteration
and agent.config
, which holds a ton of config settings, but I can’t seem to find the number of training steps.
I also don’t think I can easily derive it from the number of training iterations. In my example, the agent was trained for 90 iterations with a batch size defaulting to 4000. Still, my results show that the agent was trained for 5600 steps per iteration, resulting in 504000 training steps.
I don’t understand where the 5600 comes from and also don’t find it anywhere in the configuration.