Retraining a loaded checkpoint using Tuner.fit() with different config

Jorgen_Svane · October 25, 2022, 9:11pm

In terms of multiagent I suggest you take a look at this issue:

This should enable you to get something out of get_policy() by specifying which policy you want.

Well, I don’t think what you would love is possible (tuner = ray.tune.Tuner(agent, …). But there are ways around changing the config and then do a new Tuner.fit with the new config from a previously saved checkpoint from tuner without loosing anything.

I not proficient in Torch but I think Torch models will have similar functions to save and load weights as in Tensorflow. This topic is sparse and fragmented in the official Ray documentation in general. It’s a petty because it deters potential new users from getting started with this great RL framework. I’ve made a toy example on my github that does what you want (two times tuning with different config) and take it all the way to production without the need to carry the overhead of Ray in the end and thus only relying on the ML-framework (Tensorflow ). At the end of the day that is probably what most users need.

Note that during the second Tuner.fit() “lr” is changed as well as “training_iteration”. lr=0.0 is to prove that the weights loaded initially are the previously trained/saved in the best checkpoint as they do not change when lr=0,0.

Take a look here:

github.com

jlsvane/Ray-Tune-Tensorflow/blob/main/tune_reload_test.py

#!/usr/bin/env python3
# -*- coding: utf-8 -*-
"""
Created on Tue May  2 10:43:12 2023

@author: Jorgen Svane
"""

import os
import numpy as np
import ray
from ray import air, tune
import ray.rllib.algorithms.ppo as ppo
from ray.rllib.algorithms.ppo import PPOConfig

ray.init()

config = PPOConfig()\
        .training(lr=tune.grid_search([0.01, 0.001, 0.0001]))\
        .rollouts(num_rollout_workers=4)\

This file has been truncated. show original

In terms of your last point I’m not sure but my feeling is no, sorry.

BR

Jorgen

Topic		Replies	Views
How to resume training from a checkpoint RLlib	6	1698	December 22, 2023
Re-train algorithm from checkpoint with tuner.fit()	3	365	July 18, 2023
Restoring Tuned Tuner RLlib	4	47	July 22, 2024
Can I restore a Tune run without checkpointing the Trainable?	0	293	September 14, 2022
Continue training for successful ray tune candidates	3	866	October 7, 2022

Retraining a loaded checkpoint using Tuner.fit() with different config

Related topics