Add step info dictionary to MLflowLoggerCallback with Tune

fedetask · July 6, 2022, 11:25am

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

I am using RLlib with ML Flow, by using the MLflowLoggerCallback and passing it to

tune.run(
    **config,  
    callback=MLflowLoggerCallback(experiment_name="test")
)

However, this makes the callback log only a few metrics that are generated by RLlib.

In my environment, I return several metrics of interest in the info returned at each environment step. How can I have them logged by MLFlow?

mannyv · July 6, 2022, 12:39pm

Hi @fedetask,

You need to add a callback that adds your custom metrics.

Here is an explanation:

https://docs.ray.io/en/latest/rllib/rllib-training.html#callbacks-and-custom-metrics

Here is an example:

github.com

ray-project/ray/blob/master/rllib/examples/custom_metrics_and_callbacks.py

"""Example of using RLlib's debug callbacks.

Here we use callbacks to track the average CartPole pole angle magnitude as a
custom metric.
"""

from typing import Dict, Tuple
import argparse
import numpy as np
import os

import ray
from ray import tune
from ray.rllib.algorithms.callbacks import DefaultCallbacks
from ray.rllib.env import BaseEnv
from ray.rllib.evaluation import Episode, RolloutWorker
from ray.rllib.policy import Policy
from ray.rllib.policy.sample_batch import SampleBatch

parser = argparse.ArgumentParser()

This file has been truncated. show original

fedetask · July 6, 2022, 1:12pm

So I should have both a CustomCallback and the MLflowLoggerCallback, right? And the custom callback will add the info to the custom_metrics, so they should be passed to tune.run() in this order:

tune.run(callbacks=[CustomCallback(), MLflowLoggerCallback()])

am I correct?

mannyv · July 6, 2022, 1:25pm

@fedetask,

I do not think that is quite how it should work, but I could be wrong. In my understanding, the MLflowLoggerCallback is a tune callback and the CustomCallback is an rllib callback. I think it should look something like this.

tune.run(config={...,  "callbacks": CustomCallback,},
         callbacks=[MLflowLoggerCallback()])

avnishn · July 6, 2022, 9:37pm

@xwjiang2010 can you please chime in here

xwjiang2010 · July 7, 2022, 3:55am

@mannyv’s understanding of the callbacks is correct. @fedetask did you get a chance to try it?

fedetask · July 7, 2022, 8:50am

Hello,
Unfortunately, I am using Ray version 1.8.0; I cannot upgrade it for now.

I did as @mannyv described but things still don’t work. I think the reason is that in Ray 1.8.0, the MLflowLoggerCallback in mlflow.py, lines 148-160, logs stuff as follows:

148    def log_trial_result(self, iteration: int, trial: "Trial", result: Dict):
149        step = result.get(TIMESTEPS_TOTAL) or result[TRAINING_ITERATION]
150        run_id = self._trial_runs[trial]
151        for key, value in result.items():
152            try:
153                value = float(value)
154            except (ValueError, TypeError):
155                logger.debug("Cannot log key {} with value {} since the "
156                             "value cannot be converted to float.".format(
157                                 key, value))
158                continue
159            self.client.log_metric(
160                run_id=run_id, key=key, value=value, step=step)

and for key='custom_metrics', value is a dictionary that cannot be cast to float in line 153 and therefore isn’t logged.

I solved it by creating a new class that extends MLflowLoggerCallback and overriding the log_trial_result to allow for the custom_metrics dictionary to be logged.

olipinski · July 19, 2022, 8:55am

(post deleted by author)

Topic		Replies	Views
Example code uses RLlib's DefaultCallbacks, but tune.run expects tune.Callback Ray Tune	1	1315	July 14, 2022
Ray Tune train.report logs parameters via MLflowLoggerCallback in mlflow as metrics Ray Tune	0	217	January 27, 2024
Problems combining ray tune, mlflow and keras (tensorflow) Ray Tune	2	674	April 24, 2023
Are custom LoggerCallbacks only intended for use with Ray Tune or also for use with plain RLlib? RLlib	2	605	January 19, 2022
Tune and custom logger fails RLlib	3	157	May 8, 2024

Add step info dictionary to MLflowLoggerCallback with Tune

Related topics