Action masking for dependent multi discrete space

jacobDL · August 3, 2023, 5:01pm

Hi

I have been working on an rllib project and specifically seeing how we can use action masking to improve results as it is very hard to train due to the frequency of invalid actions (they often outnumber the number of valid actions)

Currently the action space is a multiDiscrete([2, 5, 5]). The first is a boolean 0/1 and the second make up a pair of cartesian x, y coordinates. These are dependent on eachother. For example [0, 1, 1] may be a valid set of actions but [1, 1, 1] isn’t. Unfortunately it does not look like you can mask out combinations but only individual actions (such as getting 0 for the first value)

Has anyone else encountered this issue and been able to overcome it or is this something that cannot be solved?

Topic		Replies	Views
Action masking error RLlib	9	1687	February 6, 2023
How to deal with irregular action space? Configure Algorithm, Training, Evaluation, Scaling	3	129	April 2, 2024
Invalid action masking for variable sized permutation action RLlib	0	210	May 27, 2021
Example for action masking (without action embeddings) for tuple action space RLlib	2	680	October 27, 2021
Observation dependent continuous action space ("Masking" continuous action space) RLlib	4	1100	February 9, 2022

Action masking for dependent multi discrete space

Related topics