How to get DQN action distribution

Samuel_Showalter · July 23, 2021, 7:18pm

Hi,

I noticed that if I run policy.model.from_batch(obs) for DQN, the output is not the size of the action space but rather the size of the internal feature representation of the network, can you help me understand how to get the action distribution? Is it derived from the Q function? Apologies if this question is naive.

Thanks,
Sam

mannyv · July 23, 2021, 8:39pm

Use the compute_actions / compute_single_action methods.

Dejan_Grubisic · November 3, 2022, 4:09pm

How to get the distribution of actions from compute_action or compute_actions? The documentation says they return only the best action.

This would be really important, since if your network returns an action that is impossible, how would you know the next best action to try instead?

Topic		Replies	Views
How do you get action probabilities from a policy? RLlib	8	1746	September 22, 2022
Fetch action probability distribution from trained policy RLlib	7	663	March 18, 2023
How are action computed from action_dist_inputs? RLlib	2	327	December 12, 2023
Help understand the output from compute_actions() RLlib	3	452	February 15, 2023
Requesting Clarification about Network Architectures Designs RLlib	3	260	July 31, 2021

How to get DQN action distribution

Related topics