Store best checkpoints according to evaluation metrics

fedetask · June 19, 2023, 3:49pm

I want Ray to store the n best checkpoints according to an evaluation metric. I set the CheckpointConfig as

result = tune.Tuner(
    ...
    checkpoint_config=air.CheckpointConfig(
        checkpoint_frequency=10,
        checkpoint_at_end=True
        num_to_keep=4,
        checkpoint_score_attribute='evaluation/custom_metrics/my_metric
    )
)

but I get the Result dict has no key error, as it seems the evaluation metrics are not present in the result dict. What am I missing? How can I set the checkpoint_score_attribute to use an evaluation metric?

Topic		Replies	Views
Use `checkpoint_score_attr` with custom metric Ray Tune	3	507	May 11, 2022
Setting a CheckpointConfig doesn't seem to filter out checkpoints correctly Ray Core	3	267	March 26, 2024
Saving checkpoints with good custom_metric using tune.run() Ray Tune	18	2285	July 20, 2021
Which attributes can be used in `checkpoint_score_attr` when using `tune.run` RLlib	10	1209	April 20, 2022
[Rllib, Tune, AIR] Checkpointing as per custom metric minimum Checkpointing, Restoring	1	27	November 25, 2024

Store best checkpoints according to evaluation metrics

Related topics