RLlib perform worse when rollout_worker/env_runner increased?

Morphlng · November 1, 2024, 3:10am

I’m using Ray 2.8.1 (I’ve also tried the latest 2.38.0), and it seems like RLlib’s performance is worse when adding rollout workers (env_runners).

The above results were trained with 2.8.1 by the following script:

from ray import tune, train
from ray.rllib.algorithms.dqn import DQNConfig

config: DQNConfig = (
    DQNConfig()
    .environment("CartPole-v1")
    .rollouts(num_rollout_workers=0, num_envs_per_worker=8)
    .resources(num_gpus=1)
)

tuner = tune.Tuner(
    "DQN",
    param_space=config.to_dict(),
    run_config=train.RunConfig(
        "CartPole_Env_Parallel",
        checkpoint_config=train.CheckpointConfig(checkpoint_at_end=True),
        stop={
            "episode_reward_mean": 300
        }
    )
)

results = tuner.fit()
print(results.get_best_result())

The black line was trained with num_rollout_workers=0, num_envs_per_worker=8, it reached 300 mean reward by 3minutes.
The blue line was trained with num_rollout_workers=8, num_envs_per_worker=1, it reached the same mean reward by 9 minutes.

Is this expected?

Topic		Replies	Views
What is the difference between num_env_runners and num_rollout_workers? RLlib	3	200	August 11, 2024
[RLlib] GPU performance in rollout.py RLlib	2	497	March 31, 2021
Rollout workers spend too much time on set_weights() RLlib	1	290	November 30, 2022
Issues after upgrading from 1.6.0 fro 1.7.0 RLlib	3	402	October 17, 2021
Bad inference after perfect training. What am I missing? RLlib	3	749	June 8, 2022

RLlib perform worse when rollout_worker/env_runner increased?

Related topics