RLlib DQN Trainer Evaluate Function Help

Sitting-Down · August 19, 2022, 8:29pm

Hi Everyone! I would like some help with the DQN Trainer

When I was trying to use RLib DQN Trainer to train my Eclipse Sumo Environment it works out fine in that it explores and then I assume exploits once enough time steps have elapsed to reduce the Epsilon Greedy value to its minimum of 0.2.

My issue comes from when I try to evaluate the model in the hopes to see what the data would look like assuming there was no exploration at the start of the training simulation.
It outputs very bad results and gets stuck repeating one action all throughout the Simulation.

My best guess for why that is happening is that the policy doesn’t properly contain the observation and actions recorded from the training simulation.

Here are the files I was using
https://github.com/Sitting-Down/RLlib-DQN-Experiments
the main packages I used was RLlib for the DQN Trainer and Sumo-RL as a environment

Any help would be appreciated

Lars_Simon_Zehnder · August 22, 2022, 3:02pm

@Sitting-Down , without looking into your code as no specific line has been mentioned, I guess you need to also set the evaluation_config parameter in your main config:

"evaluation_config": {
        "explore": False,
        "exploration_config": {
                .... # set this if you want to have exploration during evaluation, but with different settings
        }
}

Hope this helps

Topic		Replies	Views
`rllib rollout` command seems to be training the network, not evaluating RLlib	3	742	January 22, 2021
DQNTrainer evaluate() doesn't perform any episode RLlib	1	500	March 16, 2022
Recommended way to evaluate training results RLlib	0	3273	June 12, 2021
Cannot reproduce training results in evaluation even on same dataset RLlib	1	541	November 20, 2022
Using exploration during evaluation RLlib	4	957	January 5, 2022

RLlib DQN Trainer Evaluate Function Help

Related topics