How to access the policy object in the on_train_result callback of Ray RLlib? The callback signature is on_train_result(self, *, algorithm, result, **kwargs)

ZanhaPeng · March 20, 2025, 9:45am

code is as followed:
@override(DefaultCallbacks)
def on_train_result(self, *, algorithm, result, **kwargs):
iteration = result.get(“training_iteration”, 0)
policy_map = algorithm.workers.local_worker().policy_map
policy = policy_map.get(“default_policy”)
config = (
PPOConfig()
.api_stack(enable_rl_module_and_learner=True, enable_env_runner_and_connector_v2=True)
.callbacks(IntrinsicRewardCallbacks)

Topic		Replies	Views
Getting the policy network on_trial_result RLlib	3	282	September 5, 2022
Extract and display policy RLlib	3	564	July 26, 2021
How to get and use a trained policy RLlib	0	589	September 8, 2024
Seeking recommendations for implementing Dual Curriculum Design in RLlib RLlib	13	801	April 11, 2023
RLLib: How to use policy learned in tune.run()? RLlib	6	1097	September 21, 2023

How to access the policy object in the on_train_result callback of Ray RLlib? The callback signature is on_train_result(self, *, algorithm, result, **kwargs)

Related topics