Why auto-adjust `rollout_fragment_length` by a floor division instead of ceiling operation?

klausk55 · June 8, 2022, 1:20pm

How severe does this issue affect your experience of using Ray?

Low: It annoys or frustrates me for a moment.

The PPOTrainer class auto-adjusts rollout_fragment_length by a floor division if
train_batch_size % (num_worker * num_envs_per_worker * rollout_fragment_length) != 0.
Is there a reason why rollout_fragment_length isn’t auto-adjusted by a ceiling operation?

An example:
train_batch_size = 4000
rollout_fragment_length = 200
num_worker = 7
num_envs_per_worker = 1

RLlib auto-adjusts rollout_fragment_length to 571 (result of floor division) and ends up in collecting a train batch of size 7994.
Instead, a ceiling operation, i.e. math.ceil(train_batch_size / (num_workers * num_envs_per_worker)), would yield a new rollout_fragment_length of 572 and the collected train batch would have only a size of 4004.

Sample collection per train step would be drastically reduced.

arturn · June 9, 2022, 12:02pm

Hi klausk55! Thanks for raising this.
That’s indeed a logical flaw. I’ll write a PR.

arturn · June 9, 2022, 12:48pm

The PR: [RLlib] PPO automatic train batch size calculation fix by ArturNiederfahrenhorst · Pull Request #25621 · ray-project/ray · GitHub

Topic		Replies	Views
Pong PPO from tuned example v2.4.0 not converging RLlib	4	454	May 27, 2023
PPO configuration parameters: num_rollout_workers & train_batch_size Configure Algorithm, Training, Evaluation, Scaling	1	691	November 2, 2023
PPO algorithms train buffer only collects the first fragment from each worker? RLlib	4	719	October 30, 2021
Is there a way to add keys to a SampleBatch if rollout_fragment_length = 1? RLlib	1	241	September 11, 2022
[RLlib] Batch size for complete_episodes issue RLlib	6	2056	February 3, 2022

Why auto-adjust `rollout_fragment_length` by a floor division instead of ceiling operation?

Related topics