How do I call my custom TorchPolicyV2 train with ray.tune?

ChiefAlu · March 20, 2024, 2:30pm

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi,
I implemented my custom TorchPolicyV2.
How do I now create an Algorithm out of it, and use it with ray.tune?
I found this code in the documentation: Key Concepts — Ray 2.9.3
However, I don’t want to run it in a while loop, but with ray.tune?

  config = DQNConfig()
  env = gym.make("CartPole-v1")
  policy = MyCustomPolicy(observation_space=env.observation_space,
                                     action_space=env.action_space,
                                     config=config)  # implenents TorchPolicyV2
  
  
  config = DQNConfig().environment(env="CartPole-v1").training(train_batch_size=8)
  algo = config.build()
  algo = algo.add_policy(policy=policy, policy_id="my_custom_policy")
  print(algo.train())

I get the following errors:
Can not figure out a durable policy name for <class 'my_custom_policy.MyCustomPolicy'>. You are probably trying to checkpoint a custom policy. Raw policy class may cause problems when the checkpoint needs to be loaded in the future. To fix this, make sure you add your custom policy in rllib.algorithms.registry.POLICIES.

AttributeError: 'MyCustomPolicy' object has no attribute 'train'

Thanks in advance for your help

Lars_Simon_Zehnder · March 21, 2024, 9:22am

@ChiefAlu it looks like you try to add a policy that is not registered (default). Afaik there is no way to register a custom policy. Instead you might want to take a look at this doc section here.

Also, keep in mind: adding a policy to an algorithm requires you to have a policy mapping function defined in the configuration. This is used for MARL.

Topic		Replies	Views
Add custom policy to config on a non multi-agent setup RLlib	2	294	June 4, 2023
Passing custom policy multi-agent RLlib	3	849	December 28, 2021
Run tune.Tuner with a given policy RLlib	0	24	October 18, 2024
Updating policy_mapping_fn while using tune.run() and restoring from a checkpoint RLlib	7	899	July 4, 2023
Policy rollout on Ray Tune 2.0 RLlib	4	316	December 15, 2022

How do I call my custom TorchPolicyV2 train with ray.tune?

Related topics