Using exploration during evaluation

Hi @Lars_Simon_Zehnder,

The evaluation_config is a configuration for overriding the base config during evaluations. You would want to put changes in there. Given your goal you would want it to look like this

"evaluation_config": {
     'exploration_config': {'epsilon_timesteps': 10000,
                        'final_epsilon': 0.05,
                        'initial_epsilon': 0.05,
                        'type': 'EpsilonGreedy'},
}
2 Likes