Dueling and Distributional Q Learning on the top of Custom Policy

Moiz_Ahmad_Muhammad · February 2, 2022, 2:25am

I am wandering how the definition of the custom policy and the usage of the dueling and/or Distributional Q Learning will interact?
If I am using hyperparameter optimization on the parameters like “dueling” and “distributional”, with my custom policy and/or value function, then how the things will work. Will new layers be added and/or removed?

gjoliver · February 7, 2022, 1:32am

Hi Moiz, thanks for your question.
If your custom policy is extending the existing dqn policy class properly, I don’t see why it wouldn’t honor those config bits. I.e., if they return the same model class, things should work.
In case you find otherwise, file an issue with your script, and we are happy to take a look.

Topic		Replies	Views
Cannot understand how to create custom model for DQN RLlib	2	1494	April 29, 2022
[RLlib] Multi-headed DQN RLlib	5	1326	June 13, 2021
Why does DQN can have custom function? RLlib	1	251	January 9, 2023
Customize DQN policy in two-trainer multiagent example RLlib	4	388	September 20, 2022
Setting config["dueling"]=False still runs Dueling DQN RLlib	2	348	August 19, 2021

Dueling and Distributional Q Learning on the top of Custom Policy

Related topics