How to estimate the total number of timesteps?

XavierM · June 22, 2021, 9:35am

When using RLlib with configuration parameters unchanged, I see in the reports that timesteps_total=4000 per iteration when using ‘PPO’ or ‘A2C’ and timesteps_total=10000 per iteration when using ‘TD3’.

Is there a way to programatically get this timesteps_total value before a “run”? (I mean the expected timesteps_total value, as of course, the real value will be obtained when the “run” is done.)

stefanbschneider · June 28, 2021, 5:50pm

For PPO, the number of time steps per iteration depend on config["train_batch_size"], which defaults to 4000: RLlib Algorithms — Ray v1.4.0
So you can use the config dict to programmatically change or read the time steps per iteration.

For A2C and TD3, I’m not so sure how the time steps are determined. Here are the default config values:
https://docs.ray.io/en/latest/rllib-algorithms.html#advantage-actor-critic-a2c-a3c
https://docs.ray.io/en/latest/rllib-algorithms.html#deep-deterministic-policy-gradients-ddpg-td3

Topic		Replies	Views
'timesteps_per_iteration' parameter RLlib	1	801	July 21, 2021
How to change default agent_timesteps_total in rllib_trainer.train() RLlib	3	477	June 29, 2021
How to get timesteps_total from environment RLlib	0	321	July 17, 2022
[RLlib] Timesteps total gets reset everytime 'num_healthy_workers' goes down RLlib	1	259	December 30, 2020
Understanding agent_timesteps_total RLlib	2	576	February 3, 2023

How to estimate the total number of timesteps?

Related topics