How to get the policy distribution

Xim_Lee · June 10, 2021, 8:17am

HI.
I use the PPO algorithm without share actor-critic.
I want to get the standard deviation of the policy (actor network) of the trained agent.
How to get the policy distribution?

sven1977 · June 11, 2021, 7:38am

Hey @Xim_Lee , not sure I 100% understand what you are trying to get: The mean/stddev of all of the actor network’s weights or the outputs (parameterizing an action distribution) of the policy network for a given observation?

Xim_Lee · June 11, 2021, 11:36am

Thank you for reply, @sven1977 !
Sorry, I guess I didn’t say it clearly.
I mean the stddev of outputs of the policy network for a given observation.
Policy is the distribution about specific observation and PPO algorithm select action through stochastic policy. So I want to stddev of the policy distribution.

Topic		Replies	Views
How to get the weight? RLlib	2	984	June 30, 2021
How to get DQN action distribution RLlib	2	383	November 3, 2022
Accessing weights of neural network of a policy RLlib	1	332	October 22, 2022
Output from custom policy network for PPO RLlib	1	431	November 15, 2022
Fetch action probability distribution from trained policy RLlib	7	633	March 18, 2023

How to get the policy distribution

Related topics