Logging custom arrays with RLlib+Tune

vakker00 · June 7, 2021, 9:58am

I’m trying to save a custom NP array at the end of each episode. I created a callback like:

    def on_episode_end(self, *, worker, base_env, policies, episode, env_index,
                       **kwargs):
        envs = base_env.get_unwrapped()

        episode.custom_metrics["rollout_arrays"] = [
            env.gat_arr() for env in envs
        ]

and then in a custom logger:

    def log_trial_result(self, iteration, trial, result):
        # save rollout_arrays from result

But the data in custom_metrics seems to be processed and not passed in directly to the logger. What would be the best way to handle this?

stefanbschneider · June 14, 2021, 1:29pm

Hi, not sure I understand correctly. You want to save an entire array as single metric? What do you want to do with the logged array in the end?

I always saved a single scalar value for each metric so that it can be displayed properly in TensorBoard. In case of dicts or arrays, I simply created separate metrics for the different dict keys (eg, measurements for different users).

Here’s an example of what I implemented for logging a custom metric: DeepCoMP/callbacks.py at master · CN-UPB/DeepCoMP · GitHub
Maybe it’s helpful.

vakker00 · June 16, 2021, 2:09pm

Thanks, but that’s not really what I’m trying to do.
Actually, you can save entire arrays and TB will put it under the histograms if you use the episode.hist_data in the callback.

But in my case I want to do some post processing and visualization, so I need the arrays saved separately.

After some digging I realized there’s the episode.media property (which was introduced recently) that gets passed to the logger callback under episode_media, i.e.:

def on_episode_end(self, *, worker, base_env, policies, episode, env_index,
                       **kwargs):
        envs = base_env.get_unwrapped()

        episode.media["rollout_arrays"] = [
            env.gat_arr() for env in envs
        ]

and then in the logger CB:

def log_trial_result(self, iteration, trial, result):
    arrays = result['episode_media']['rollout_arrays']
    # save, render, whatevs with arrays

Topic		Replies	Views
Log or record custom env data via RLlib RLlib	4	1560	January 14, 2022
Is this the recommended way of saving per-episode training metrics? RLlib	1	403	February 6, 2024
Accessing custom metrics for episodes RLlib	3	832	March 19, 2024
Logging metrics/observations on every step RLlib	3	822	April 23, 2021
Persisting values across callbacks RLlib	2	604	May 8, 2023

Logging custom arrays with RLlib+Tune

Related topics