Continuous actions go beyond defined action_space and then nan for multi-agent PPO

hridayns · July 3, 2021, 11:49am

The problem is exactly as the title says! I’m using a custom RLlib Multi-agent environment with PPO and I have defined my action space as

def get_action_space(self, agent):
        """ Returns the action space. """
        return gym.spaces.Box(
            low=-7.5, high=2.9, shape=(1,), dtype=np.float32
        )

For example: The action space is supposed to be between -7.5 and +2.9 but an action generated may have a value of 50 or even -1594974.6. I’m still not sure why this is happening or if these two issues are even related. Can anyone give an idea what could be wrong?

Topic		Replies	Views
PPO Policy not respecting action-space bounds RLlib	0	44	June 27, 2024
Multiagent PPO with custom model gives actions that are outside of the action space RLlib	2	353	October 5, 2021
Return obs_space in gym.Box format RLlib	1	555	March 6, 2022
Changing the action space bounds after every RLlib	3	299	July 18, 2023
Continuous action space and custom model RLlib	4	1533	July 17, 2021

Continuous actions go beyond defined action_space and then nan for multi-agent PPO

Related topics