How to revise the PPO action compute?

Xim_Lee · February 16, 2022, 6:31am

Hi !
I want to change the method with respect to the action_comput of PPO.
The original form of action computing is the sampling by using the mean and variance of output of the Actor-network. But I need to use the parameterization trick. So, Which file should I change?

Topic		Replies	Views
How do i compute an action from a trained RLlib PPO policy with the new API? RLlib	2	45	November 10, 2025
Controlling compute_actions during training RLlib	0	398	November 26, 2021
Compute actions Programmatically RLlib	1	302	February 5, 2022
[rllib] Retrieve and modify the computed discrete action logits to PPO agent RLlib	6	768	May 5, 2021
How to compute actions with RLlib and Tune after training RLlib	6	680	July 15, 2025

How to revise the PPO action compute?

Related topics