How to use a Stopper based on an evaluation metric?

ihopethiswillfi · December 11, 2021, 5:42pm

Hi, how do I create a stopper that can work on an evaluation metric? I tried the following:

            def stopper(trial_id, result):
                stop = (
                        ((result['timesteps_total'] > 2e5) and (result['evaluation/episode_reward_mean'] < -1.0)) |
                        ((result['timesteps_total'] > 5e5) and (result['evaluation/episode_reward_mean'] < -0.5)) |
                        ((result['timesteps_total'] > 1e6) and (result['evaluation/episode_reward_mean'] < -0.25)) |
                        ((result['timesteps_total'] > 2e6) and (result['evaluation/episode_reward_mean'] < 0))
                )
                return stop

When I then run ray.tune(stop=stopper) I get the following error message:
KeyError: 'evaluation/episode_reward_mean'

My evaluation_interval is set to 1.
I’m using rllib (not sure if this matters)

matthewdeng · December 17, 2021, 1:58am

Hey @ihopethiswillfi,

Can you share your code to reproduce this?

It may be that the metric is nested:

-result['evaluation/episode_reward_mean']
+result['evaluation']['episode_reward_mean']

ihopethiswillfi · December 17, 2021, 7:01am

Your solution works! Many thanks.

It is however counter-intuitive, because to tune.run() I’m successfully passing as argument: metric="evaluation/episode_reward_mean"

Topic		Replies	Views
TrialPlateauStopper improvement Ray Tune	4	443	March 16, 2021
Stop criteria using a custom metric RLlib	2	47	July 10, 2024
How to use time_total_s as a stop condition? Ray Tune	2	586	May 26, 2022
Stopping criteria for PPOTrainer RLlib	2	837	January 30, 2022
Use `checkpoint_score_attr` with custom metric Ray Tune	3	509	May 11, 2022

How to use a Stopper based on an evaluation metric?

Related topics