Is Rainbow/DQN really usable with parametric action spaces?

How severe does this issue affect your experience of using Ray?

  • High: It blocks me to complete my task.


I’m mostly asking these questions to make sure I understand the warning / the documentation properly.

  1. According to the parametric action spaces section in the documentation, DQN can only be used with hiddens: [] inside the trainer config. So that basically means there is no hidden layers at all in the neural network used by DQN, correct? If so, then is DQN really usable with parametric action spaces since we can’t define a “true” NN?

  2. Similarly, having to set dueling to False inside the trainer config, as per the cartpole example with masked actions, means it is impossible to use Rainbow DQN on parametric action spaces, correct?

Perhaps I am missing something, but I can’t gather more from what I’m reading in the docs. If I am mistaken, please tell me what I am misunderstanding!