How to drop rollouts when the environment raises an error?

sogartar · February 23, 2022, 5:24pm

I have a pretty minimalistic setup of training PPO+PBT.

from gym import Env
from gym.spaces import Discrete
import ray
from ray.tune.schedulers import PopulationBasedTraining
from ray.tune import run
from ray.tune.registry import register_env


class CrashingEnv(Env):
    action_space = Discrete(2)
    observation_space = Discrete(3)

    def reset(self):
        return self.observation_space.sample()

    def step(self, action):
        raise RuntimeError()


def make_crashing_env(env_config) -> CrashingEnv:
    del env_config
    return CrashingEnv()


def test_train_crashing_env_ray_pbt_ppo():
    ray.init(local_mode=True)

    pbt = PopulationBasedTraining(metric="episode_reward_mean", mode="max")
    register_env("crashing_env", make_crashing_env)
    config = {
        "env": "crashing_env",
        "num_workers": 0,
    }
    run(
        run_or_experiment="PPO",
        name="crashing_env_ray_pbt_ppo",
        scheduler=pbt,
        config=config,
        )


if __name__ == "__main__":
    test_train_crashing_env_ray_pbt_ppo()

The environment deliberately raises an error on step(...). This would cause the rollout collection to fail and then the trial to fail.
How can I configure the training to drop/ignore the faulty rollout, remake the crashed environment in the rollout worker and continue training?

Topic		Replies	Views
How to throw the fail trials Ray Tune	1	388	January 7, 2022
Unable to train on PettingZoo Atari "double_dunk" environment RLlib	0	25	May 5, 2025
Custom Gymnasium environment keeps crashing RLlib	3	261	February 10, 2024
Register a custom environment and runing PPOTrainer on that environment not working RLlib	7	2871	September 24, 2023
Error "AttributeError: 'RolloutWorker' object has no attribute 'config' " in custom environment Configure Algorithm, Training, Evaluation, Scaling	2	239	January 27, 2024

How to drop rollouts when the environment raises an error?

Related topics