PBT Replay with RLlib

Ciro_NA · September 6, 2023, 4:30pm

Hi everyone,

I’m having some problems to replay a PBT training. I’ve used the following code for the tuning:

tuner = tune.Tuner(
    "PPO",
    tune_config=tune.TuneConfig(
        metric="episode_reward_mean",
        mode="max",
        scheduler=pbt,
        num_samples=100,
    ),
    param_space=config_PPO,
    run_config=air.RunConfig(
        stop={"training_iteration": 200},
    )
)
results = tuner.fit()

I’m using a custom environment that I introduced directly as a class in PPOConfig with an env_config. For the PopulationBasedTrainingReplay a trainable class is needed, and using only “PPO” as in the above code is not an option. There is some examples on how to PBT Replay a generic Pytorch model, but I haven’t found any related to RLlib algorithms. Is there any example on how to define a trainable class for this same purpose? I would really appreciate any recommendation.

Topic		Replies	Views
Metric for PBT in Ray 2.40 Ray Tune	1	71	January 28, 2025
[Tune PBT] Population Based Training :: Questions & Errors Ray Tune	3	1181	April 1, 2021
[Tune/Rllib] Implementing reset_config for Rllib	1	418	March 31, 2024
Logging custom metrics by trial during PBT training RLlib	1	238	July 1, 2021
Custom Trainable RLLib for Ray 2.3.0 with Ray tune RLlib	6	947	April 2, 2023

PBT Replay with RLlib

Related topics