Restored Policy gives action that is out of bound

hxwwayne · March 4, 2023, 2:47am

Hi,

I used PPO to train the agent and saved the policy using checkpoint. But when I restored the policy and using compute single action function to get action, the policy will output action that out of action spaces. So, I want to know if I did something wrong or there are problems in Rllib.

By the way, trainer.compute single action will not output action that is out of bound.

arturn · April 13, 2023, 11:06pm

Have a look at what I wrote over here.
Algorithm unsquashes actions.

Topic		Replies	Views
Action space boundaries not adhered to by restored agent RLlib	1	295	August 12, 2021
PPO Policy not respecting action-space bounds RLlib	0	44	June 27, 2024
Prediction outside outside action space during inference	0	106	March 18, 2024
[rllib] Problem running compute_single_action from PPO restored checkpoint Checkpointing, Restoring	1	360	December 13, 2023
Get_policy error when get an action from restored trained model- New API stack	12	111	April 22, 2025

Restored Policy gives action that is out of bound

Related topics