What is the difference between action mask and action available?

Peter_Pirog · April 22, 2022, 9:04pm

I try to understand what what is the difference between action masked and action available?
As I understand action masked is returned by environment.
I try to solve combinatorial environment https://arxiv.org/pdf/2003.03600.pdf with very large action space (preparing school timetable).

As I understand relationship between action space (all actions), masked actions and available actions looks:

Do I understand this correctly?

sven1977 · April 26, 2022, 9:03am

Hey @Peter_Pirog , thanks for posting this question!
I think it’s even simpler. Take a look at this environment here:
ray.rlllib.examples.env.action_mask_env.py::ActionMaskEnv

It produces an additional “action_mask” observation component, which is basically a binary tensor of len N (N=all actions) and values of either 0.0 (not available) or 1.0 (available).
This tensor is the “mask”.
The “available actions” is simply a list of the possible action values.

For example:
action_space = Discrete(10)
obs = env.reset()
obs = {
“action_mask”: [0.0, 1.0, 0.0, 0.0, 0.0, 1.0, 0.0, 0.0, 0.0, 1.0], # ← only actions 1, 5, and 9 are “available”
“actual_obs”: …
}

Some action-masking capable model (example here: ray/rllib/examples/models/action_mask_model.py) then needs to interpret the “action_mask” field in the observation accordingly: set all unavailable actions’ logits to -inf.

Peter_Pirog · May 3, 2022, 7:35am

@sven1977 Thank You for the explanation the sense of “available actions”. Now it’s clear for me

Topic		Replies	Views
[RLlib] Impossible actions RLlib	12	4118	May 11, 2022
Problem with action masking RLlib	7	2287	May 19, 2022
Action Masking without Including "action_mask" in the Observation Space? RLlib	0	31	October 31, 2024
Action masking & Dict observation space & 'avail_actions'? Configure Algorithm, Training, Evaluation, Scaling	1	1154	August 4, 2023
Action masking for dependent multi discrete space Configure Algorithm, Training, Evaluation, Scaling	0	488	August 3, 2023

What is the difference between action mask and action available?

Related topics