Prediction outside outside action space during inference

riles · March 18, 2024, 1:25pm

Hello,

I’m running PPO with custom env. The training is running perfectly, I checked it predicted action within the action space.

I save a checkpoint every iteration. But when I load it, the action predicted by the loaded policy using policy.from_checkpoint are ranging from -1 to 1 where the action space should be between 0-30.

is there any postprocessing i’m missing ?

help please

Topic		Replies	Views
PPO Policy not respecting action-space bounds RLlib	0	44	June 27, 2024
Restored Policy gives action that is out of bound Checkpointing, Restoring	1	576	April 13, 2023
Change action space within episode RLlib	2	272	December 28, 2021
Continuous actions go beyond defined action_space and then nan for multi-agent PPO RLlib	0	329	July 3, 2021
PPO: Value estimate off for goal state Debugging and performance tuning	0	14	September 18, 2024

Prediction outside outside action space during inference

Related topics