How severe does this issue affect your experience of using Ray?
- None: Just asking a question out of curiosity
Hi all,
Can I access the recent action probs after each episode/env step?
I’ve thought of using a custom callback on_episode_step
where I have access to worker, policies and episode objects, but I don’t know which of these objects contain the recent action probs?