Saving checkpoints with good custom_metric using tune.run()

mannyv · May 13, 2021, 1:20am

Glad it worked. I figured out the issue with the csv logger. The very first time it logs data using the “on_result” method is when it creates the file and determines the “fieldnames” (flattened keys) that it will log for the duration of the experiment.

In an example like yours where you are doing evaluations n > 1 the evaluation keys will not be in that first set of results and so in subsequent calls to “on_result” they will be ignored. This will also be true for your custom_metrics keys if you only add them when in evaluation.

I think it will be hard to change the behavior of the csv logger. Instead it might be easier to have the ExperimentAnanlysis class build the dataframe from the json log file.

Topic		Replies	Views
Custom metrics over evaluation only RLlib	8	1802	December 16, 2021
Use `checkpoint_score_attr` with custom metric Ray Tune	3	518	May 11, 2022
Store best checkpoints according to evaluation metrics Checkpointing, Restoring	0	385	June 19, 2023
Which attributes can be used in `checkpoint_score_attr` when using `tune.run` RLlib	10	1228	April 20, 2022
Best model based on Checkpoint not Last epoch Ray Tune	10	1701	April 24, 2021

Saving checkpoints with good custom_metric using tune.run()

Related topics