Another tune after restoring a PPO algorithm

Keik1999 · December 13, 2023, 1:07pm

if i want to continue training my algorithm after saving it in the checkpoint, how can i make it?
my idea is show as following but the grammer is not right that algo can not be directly used in the tune.Tuner:

algo = config.build()
algo.restore(user_checkpoint_dir)
results = tune.Tuner(
    algo,
    param_space=config,
    run_config=air.RunConfig(
        stop=stop,
        verbose=2,
        checkpoint_config=air.CheckpointConfig(checkpoint_at_end=True),
    ),
).fit()

Finebouche · December 14, 2023, 2:02pm

Hi,

This issue have been raised multiple time and it seems that there is no clear solution.

I have been trying to link all the github issue related to that on my own github issue here : Fails restoring weights · Issue #41508 · ray-project/ray · GitHub.

Basically you have some way to resume failed tuning with Trainer.restore but you cannot pass a new configuration so it’s useless.
ray/rllib/examples/restore_1_of_n_agents_from_checkpoint.py at master · ray-project/ray · GitHub show a way to restore weights of your policy but I have found it to be failing.

Any working example of how to do this is welcome as many people seem to have this problem

Keik1999 · December 15, 2023, 2:30am

thanks for your reply, i will continue looking for a way to solve this problem. I think the problem lies in the tune startup process, but I still don’t have the time or ability to look at the code bit by bit. I will get back to you if there is any progress

Topic		Replies	Views
Restoring RLlib Run Using Tuner.restore RLlib	5	643	February 17, 2024
Retraining a loaded checkpoint using Tuner.fit() with different config Ray Tune	6	1280	October 25, 2022
Restore agent and continue training with tune.run() RLlib	2	611	July 6, 2021
Resuming/extending rllib tune experiments Checkpointing, Restoring	4	450	November 4, 2023
How to resume training from a checkpoint RLlib	6	1847	December 22, 2023

Another tune after restoring a PPO algorithm

Related topics