Where does ActionDistribution.sample() actually get called?

How severe does this issue affect your experience of using Ray?

  • Medium: It contributes to significant difficulty to complete my task, but I can work around it.

I’m trying to figure out where in rllib the ActionDistribution actually has it’s .sample() and logp() functions called, so that I can potentially pass them additional inputs.