Can you specify workers in rllib algorithm to each collect the same number of episodes? Or each a specific number?

cwijesundara · September 12, 2024, 8:21pm

Hi there! I know that usually under the hood, rllib algorithms with multiple workers will all perform episode rollouts with each worker doing multiple rollouts until the specified training batch size is reached. I was wondering if instead, whether it’s possible for me to specify that each worker should be collecting X episodes? That way it doesn’t matter if some workers end up with shorter episodes vs. other episodes, but they all end up collecting an even amount.

PhilippWillms · September 13, 2024, 2:11pm

I think the gist to this question lies in understanding the relationhsip of rllib rollout batches and episodes. Hints for it lie in the explanation of the parameter batch_mode in AlgorithmConifg().env_runners() .

https://docs.ray.io/en/master/rllib/package_ref/doc/ray.rllib.algorithms.algorithm_config.AlgorithmConfig.env_runners.html#ray.rllib.algorithms.algorithm_config.AlgorithmConfig.env_runners

With the complete_episodes setting, you can enable that no episodes will be truncated and hence you can ensure a minimum size of rollout batch. Together with the train_batch_sizeparameter you should be able to twerk batches wih consistent number of episodes.

It is easier if each episode has the same length, but that for sure depends on your environment design.

Topic		Replies	Views
[RLlib] Batch size for complete_episodes issue RLlib	6	2094	February 3, 2022
Handling of Incomplete Episodes in RLlib RLlib	0	23	September 25, 2024
Inconsistent number of episodes during evaluation RLlib	1	276	May 11, 2021
PPO algorithms train buffer only collects the first fragment from each worker? RLlib	4	733	October 30, 2021
How to tell RLLIB trainer (Not Tune) to run that many number of episodes RLlib	7	1162	June 9, 2023

Can you specify workers in rllib algorithm to each collect the same number of episodes? Or each a specific number?

Related topics