Is there a way to set num_env_steps_sampled?

radillus · March 31, 2023, 2:51pm

Low: It annoys or frustrates me for a moment.

I’m now trying to train a PPO agent in custom env, the step() will cost about 3min.

algo = PPOConfig()
algo = algo.environment(env =CustomEnv)
algo = algo.framework('torch').build().train()

I notice that in result of train there has num_env_steps_sampled

...
num_agent_steps_sampled: 4000
num_agent_steps_trained: 4000
num_env_steps_sampled: 4000
num_env_steps_trained: 4000
num_env_steps_sampled_this_iter: 4000
num_env_steps_trained_this_iter: 4000
timesteps_total: 4000
num_steps_trained_this_iter: 4000
...

Is there a way to set num_env_steps_sampled?(for PPO and other built-in algorithms)
What’s the best practice of training in expensive step() env?

arturn · June 23, 2023, 7:45pm

Hi @radillus ,

The amount of steps sampled at minimum is mainly determined by the rollout_fragment_length, the number of envs per rollout worker and the number of workers. RLlib does not check if a maximum is reached on every step. The rollout workers just collect the fragments and once send them back to the main Algorithm instance.

Then, the number of samples that are sampled on each iteration is also bound by the batch size.
RLlib will collect at least as many samples as the batch size dictates.

Play around with these numbers to see what happens if you are not clear about it

Topic		Replies	Views
Num_env & agent_steps_trained 0 even though steps sampled? RLlib	7	862	April 25, 2024
Num_agent_steps_trained: 0 Configure Algorithm, Training, Evaluation, Scaling	2	242	May 4, 2024
Num_agent_steps less than num_env_steps RLlib	0	221	July 15, 2021
Get the number of training steps when loading a trained agent RLlib	2	594	March 16, 2021
Is the NUM_ENV_STEPS_TRAINED logged incorrectly, if not how to interpret it compared to NUM_MODULE_STEPS_TRAINED? Configure Algorithm, Training, Evaluation, Scaling	0	14	June 7, 2025

Is there a way to set num_env_steps_sampled?

Related topics