Restoring the best model without access to the Analysis object

Orie7 · January 29, 2021, 12:08am

Hi there,

I wanna understand how to restore agents from checkpoints, I’m training with:

analysis = ray.tune()

With this I can restore the best agent this way:

checkpoints = analysis.get_trial_checkpoints_paths(trial=analysis.get_best_trial('episode_reward_mean', mode='max'), metric='episode_reward_mean')

checkpoint_path = checkpoints[0][0]

agent = PPOTrainer(config=my_config, env=my_env)

agent.restore(checkpoint_path)

This is great if the whole process occurs in 1 go, but if something happens to the python session the analysis variable is lost and with it all the trials and checkpoints.

So I understood that I can just provide an absolute path to the checkpoint_path variable instead, it’s not the most convenient but I am able to restore agents this way by giving a path like so:

/Users/or/ray_results/myPPOrun/PPO_TradingEnv_01ac9_00000_0_2021-01-22_18-00-54/checkpoint_9/checkpoint-9

But if there’s like 100 checkpoints, how can I find the best agent? there would be tons of directories to go through manually.

There must be a better way to do this without the analysis object.

Thanks for any advice!

Topic		Replies	Views
Selecting best checkpoint to keep training in tune Ray Tune	0	397	January 25, 2021
Ray restore checkpoint in rllib RLlib	6	1640	August 11, 2021
Restoring Agent with Ray.tune Ray Tune	3	619	November 10, 2023
Loading experiment analysis from a different machine than the experiment was run with Ray Tune	7	563	January 4, 2024
Tuner cannot restore the checkpoints! Ray Tune	10	883	November 20, 2023

Restoring the best model without access to the Analysis object

Related topics