How to change default agent_timesteps_total in rllib_trainer.train()

Arif_Jahangir · June 29, 2021, 5:35pm

Hi
Could some please tell me how to change default agent_timesteps_total in rllib_trainer.train(). Defualt is 4000 steps. How can we change this. Thanks

arturn · June 29, 2021, 7:37pm

Hi Arif,

agent_timesteps_total is a metric that shows you what it’s name suggests.
In the tune documentation you find mutliple possibilities to stop a training:

stop (dict | callable | Stopper ) – Stopping criteria. If dict, the keys may be any field in the return result of ‘train()’, whichever is reached first. If function, it must take (trial_id, result) as arguments and return a boolean (True if trial should be stopped, False otherwise). This can also be a subclass of ray.tune.Stopper , which allows users to implement custom experiment-wide stopping (i.e., stopping an entire Tune run based on some time constraint).

So in your case, you could call tune.run() like this:
tune.run(stop={“timesteps_total”: 4000}).

Hope this helps

Arif_Jahangir · June 29, 2021, 7:39pm

Thank you a lot @arturn

mannyv · June 29, 2021, 11:02pm

Hi @Arif_Jahangir,

There is also a key in the config called “timesteps_per_iteration” that controls how many new timesteps of experience are collected for each call to train(). For PPO the default is 4000 but you can adjust that if you want.

The intended usage in rllib is that the train function will be called many times in a loop. Either by you or automatically by tune. You can use the stopping criteria @arturn mentioned in combination with tune to determine when training should stop.

Topic		Replies	Views
Understanding agent_timesteps_total RLlib	2	574	February 3, 2023
How to train until convergence RLlib	1	601	July 6, 2022
Understanding the Stopping Process for ray.rllib.agents.dqn.DQNTrainer.train() RLlib	4	580	May 26, 2021
Stopping criteria for PPOTrainer RLlib	2	836	January 30, 2022
Compute/display actions from ray.tune RLlib	10	1668	March 30, 2021

How to change default agent_timesteps_total in rllib_trainer.train()

Related topics