Episode user_data or Episode metrics not showing up in Tensorboard

AvisekNaug · March 2, 2023, 2:16am

I am trying to plot some of the variables returned by the info dictionary. I implemented the custom callback by subclassing the DefaultCallbacks class and added some variables to episode.user_data dictionary by appending them as lists. However, none of them show up in Tensorboard

My code snippet looks like

import custom_env  as myenv
from ray.rllib.algorithms.ppo import PPOConfig

from ray.rllib.algorithms.callbacks import DefaultCallbacks
class customCB(DefaultCallbacks):

    def on_episode_start(self, *, worker, base_env, policies, episode, env_index, **kwargs):
        episode.user_data['action'] = []
        episode.user_data['battery SOC'] = []

    def on_episode_step(self, *, worker, base_env, policies, episode, env_index, **kwargs):

        env1 = base_env.get_sub_environments()[0] # this step works 
        episode.user_data['action'].append(env1.info['action'])
        episode.user_data['battery SOC'].append(env1.info['battery SOC'])
        episode.custom_metrics['test_data'] = 10  # does not show up in Tensorboard either!

config = PPOConfig() 
config = config.training(gamma=0.9, lr=0.001, kl_coeff=0.3, use_gae=True, clip_param=0.1)  
config = config.rollouts(num_rollout_workers=0) 
config = config.framework('torch')
config = config.callbacks(customCB)
env_config= {'max_bat_cap' : 50, 'charging_rate' : 0.5}
config = config.environment(env = myenv.env_v3, env_config=env_config)

rllib_algo = config.build()

for i in range(1000):
    result = rllib_algo.train()
    # print(pretty_print(result))

    if i % 5 == 0:
        checkpoint_dir = rllib_algo.save()
        print(f"Checkpoint saved in directory {checkpoint_dir}")

print("End program")

Tensorboard doesn’t show any of the metrics I added to episode.user_data.

As a starting point, I tried running the rllib/examples/custom_metrics_and_callbacks.py but this generates a separate error ‘AttributeError: ‘EpisodeV2’ object has no attribute ‘last_observation_for’’

arturn · March 22, 2023, 5:27am

Hi @AvisekNaug ,

This code looks fine to me on first sight.
Can you compile this into a reproduction script that can be executed without custom_env, please?
I can debug it that way.

Cheers

AvisekNaug · March 23, 2023, 4:27pm

While that would take me some time, you could instead try running the latest version of rllib/examples/custom_metrics_and_callbacks.py. It gives the exact same error.

Topic		Replies	Views
Logging custom metrics by trial during PBT training RLlib	1	241	July 1, 2021
Custom Tensorboard Metric (episode.total_reward auto generates as mean, min, max) RLlib	5	287	June 24, 2024
Mean reward per agent in MARL RLlib	11	1123	January 12, 2023
RLlib callbacks to get custom metrics such as observation, reward...etc in each episode from SingleAgentEpisode and access it in the trainer RLlib	2	123	November 19, 2024
How to obtain single episode reward? RLlib	6	1471	March 19, 2024

Episode user_data or Episode metrics not showing up in Tensorboard

Related topics