Customise policy to only do forward/backward pass for certain observations

bmanczak · December 9, 2021, 9:11am

Hi!

Thanks for the reply.
That’s what I ended up doing and it works.
However, as far as I understand, this still requires forward/backward pass, causing an overhead. I tried to solve the issue by customising the compute_single_action in the PPO trainer (post) but that did not work.

Topic		Replies	Views
Trouble Migrating Multi-Agent PPO with Custom Model(Action Masking + CNN + MLP) to New RLlib API RLlib	7	115	July 30, 2025
Controlling compute_actions during training RLlib	0	402	November 26, 2021
Issue creating custom action mask enviorment RLlib	14	2307	October 11, 2023
Model doesn't recognize ObservationWrapper and keeps using orig_observation RLlib	4	370	October 7, 2022
Action Masking without Including "action_mask" in the Observation Space? RLlib	0	37	October 31, 2024

Customise policy to only do forward/backward pass for certain observations

Related topics