Run tune.Tuner with a given policy

dbk80 · October 18, 2024, 7:40am

Hi,

I would like to set up a tune.Tuner object and run tuner.fit() for an experiment using a Policy object restored from a checkpoint. Could you please guide me on how to achieve this?

To clarify, here’s what I’m currently doing:

from ray.rllib.policy import Policy
policy = Policy.from_checkpoint(<path>)['default_policy']
weights = policy.get_weights()
weights = {'default_policy': weights}

from ray.tune.registry import get_trainable_cls
config = (
            get_trainable_cls("PPO")
            .get_default_config()
            .environment(..., env_config=env_config)
            .training(..., model={custom_model_config=custom_model_config})
            )
algo = config.build()
algo.set_weights(weights)

At this point, I can call algo.train(), but how can I achieve something similar using a tuner?

Specifically, I’m interested in configuring a new environment (a new env_config) and passing on a custom model config (a new custom_model_config) in this setup.

Thank you in advance for your help!

Topic		Replies	Views
Retraining a loaded checkpoint using Tuner.fit() with different config Ray Tune	6	1259	October 25, 2022
Another tune after restoring a PPO algorithm Checkpointing, Restoring	2	301	December 15, 2023
Resuming/extending rllib tune experiments Checkpointing, Restoring	4	440	November 4, 2023
RLLib: How to use policy learned in tune.run()? RLlib	6	997	September 21, 2023
Fails restoring weights #41508 RLlib	2	423	December 29, 2023

Run tune.Tuner with a given policy

Related topics