Rllib, parameters for nb_steps_warmup?

sylph · August 31, 2021, 4:21am

Hi there, I’m wondering is there a parameter to control the training of rl agent after a certain steps in trainer.train() and tune.run(), like in keras-rl package nb_steps_warmup,
see
a use case
internal details
Thanks in advance.

sven1977 · August 31, 2021, 6:44am

Some algos have a learning_starts parameter. Those that use a replay buffer. For on-policy algos, such a setting wouldn’t really make sense, since samples from timesteps less than learning_starts would simply be discarded w/o any effect on anything.

sylph · August 31, 2021, 7:04am

Ohh… Thanks! That could make some sense.Originally, I intend to let the agent explore more, and see more, coz recentlly, I discovered the PPO for my custom env (large action and state space) stuck in a local optimum, and rarely it can get out of it and obtain a better result. Maybe I should first try to incease the train_batch_size other than the default configuration (just start to learn rl, and quite confused, lol)?

Topic		Replies	Views
Rllib trainig step customize RLlib	6	546	March 31, 2021
Jump-Start Reinforcement Learning RLlib	33	243	February 12, 2025
Use only specific timesteps during agent training RLlib	3	503	June 21, 2022
RLlib experiments Configure Algorithm, Training, Evaluation, Scaling	0	228	October 22, 2023
Multi-agent setting different step sizes for agents and how actions are passed? RLlib	2	600	April 26, 2022

Rllib, parameters for nb_steps_warmup?

Related topics