[rllib] Customized action distribution of probability matrices

Ofir · November 6, 2022, 2:47pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty in completing my task, but I can work around it.

Hi everyone,

Is there a way to create a customized action distribution where I’m sampling a probability matrix from it?
My current solution is that my NN outputs mean and log_std for a gaussian distribution, and I’m passing the sampled element through softmax.

Each row in my probability matrix represents the probability of choosing path i from N paths.

Thank you for the help!

Ofir · November 9, 2022, 7:44am

Hi all,

Is anyone have an idea to overcome my issue? is my solution is good enough?

Topic		Replies	Views
How to use Custom Action Distributions for this? RLlib	5	261	May 6, 2024
Where does ActionDistribution.sample() actually get called? RLlib	0	53	May 7, 2024
Rllib is auto adjusting my action distribution RLlib	4	316	May 26, 2022
How do you get action probabilities from a policy? RLlib	8	1747	September 22, 2022
Fetch action probability distribution from trained policy RLlib	7	664	March 18, 2023

[rllib] Customized action distribution of probability matrices

Related topics