Why does DQN can have custom function?

mingjunwang88 · October 27, 2022, 4:04pm

I am reading the link: ray/parametric_actions_model.py at master · ray-project/ray · GitHub. The class TorchParametricActionsModel(DQNTorchModel) is inherited from DQNTorchModel. It is bit confusion that since the custom function is supposed to be policy function. But DQN does not have explicit policy function. Anyone have more knowledge on this? Thanks in advance.

kourosh · January 9, 2023, 2:53am

Hi @mingjunwang88 , Can you clarify your question a little bit? You mentioned

that the custom function is supposed to be policy function? But DQN does not have explicit policy function.

What do you mean by the custom function and policy function?

This example shows how you can extend a DQN algorithm / model to search over a large number of discrete actions (say 10000 actions). Instead of having an output head of size 10000 logits, you predict an embedding of the observation and compute the inner product of the logits and the embedding of those 10000 actions to product probabilities over the actions. This is what is shown in this file.

Topic		Replies	Views
Cannot understand how to create custom model for DQN RLlib	2	1494	April 29, 2022
Custom model for DQN RLlib	3	809	July 20, 2021
DQNTorchPolicy; Custom Policy Configure Algorithm, Training, Evaluation, Scaling	0	139	March 1, 2024
[RLlib] Multi-headed DQN RLlib	5	1326	June 13, 2021
Dueling and Distributional Q Learning on the top of Custom Policy RLlib	1	325	February 7, 2022

Why does DQN can have custom function?

Related topics