Logging stats side channels in custom Unity Environment using ML-Agents Academy.Instance.StatsRecorder

Philipp_D_Siedler · October 12, 2023, 8:11pm

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Heya, I have a custom environment built in Unity. In Unity, I use ML-Agents, which also provides an Academy.Instance.StatsRecorder to track custom data in the environment. This is how I was hoping this would work:

    from mlagents_envs.side_channel.environment_parameters_channel import EnvironmentParametersChannel
    from mlagents_envs.side_channel.stats_side_channel import StatsSideChannel
    
    env_setup_channel = EnvironmentParametersChannel()
    env_setup_channel .set_float_parameter("training", 1)

    # This is what I'm interested in:
    stats_channel = StatsSideChannel()

    tune.register_env(
        "unity3d",
        lambda c: CustomEnv(
            file_name=FILE_NAME,
            no_graphics=True,
            episode_horizon=EPISODE_HORIZON,
            side_channel=[env_setup_channel, stats_channel],
        ),
    )

The env_setup_channel is passing parameters to the Unity environment, which works well when I use RLLibs tune.Tuner to run training:

    results = tune.Tuner(
        "PPO",
        param_space=config.to_dict(),
        run_config=air.RunConfig(
            stop=stop,
            verbose=2,
            checkpoint_config=air.CheckpointConfig(
                checkpoint_frequency=5,
                checkpoint_at_end=True,
            ),
        ),
    ).fit()

However I was expecting for the stats that I’m adding to my Academy.Instance.StatsRecorder in the Unity environment, would show up when I open the training results in tensorboard. (this is the behaviour in Unitys ML-Agents)

Am I missing something? Maybe sven1977 has an idea? Thank you!

Philipp_D_Siedler · October 19, 2023, 10:31am

I just found out how to do it:

Create a custom callback:

from ray.rllib.algorithms.callbacks import DefaultCallbacks

class CustomMetricsCallback(DefaultCallbacks):
    def on_episode_end(self, worker, base_env, policies, episode, **kwargs):
        unity_env = base_env.get_sub_environments()[0]
        stats_side_channel = unity_env.stats_channel.stats

        for key, metric in stats_side_channel.items():
            episode.custom_metrics[key] = sum([value[0] for value in metric]) / len(
                metric
            )

Then add it to the config:

config = (
    PPOConfig()
    ...
    .callbacks(CustomMetricsCallback)
)

Topic		Replies	Views
Questions and Confusion: Getting started with RLlib Configure Algorithm, Training, Evaluation, Scaling	0	48	February 19, 2025
Trying to set up external RL environment and having trouble RLlib	14	1431	September 28, 2021
How does Ray RLLib handle individual agent EndEpisode calls in Unity 3D environments? RLlib	0	19	October 28, 2024
Logging stuff in a custom gym environment using RLlib and Tune RLlib	4	1409	June 1, 2022
Possible to access default logger from environment? RLlib	15	1461	April 27, 2021

Logging stats side channels in custom Unity Environment using ML-Agents Academy.Instance.StatsRecorder

Related topics