Action masks and loss functions

sven1977 · January 25, 2021, 10:14am

S. Fang asked this question on our Slack channel.
Please do not use the Slack channel anymore for questions on RLlib! All discussions should be moved here for better searchability and documentation of issues and questions. Thank you.

Hi, I have a question related to action-masks and loss functions. Currently I have an offline dataset of pre-generated episodes that I am using for RL training by applying action-masks which imitate the action sequences + states in the offline experiences. I’m using action-masks because it seemed easier to implement in our complex web-application-based RL setup than using RLLibs SampleBatch API.
However, the imitation learning isn’t having the effect that I’m expecting. Is it perhaps because applying action-masks which basically force the action probability distribution to assign all probability to a single action (the imitation action) also affects the loss function calculation and therefore backprop and gradients?

sven1977 · January 25, 2021, 10:48am

Not sure I understand your exact setup.
My first questions would be:

Which offline algo are you using? Pure Behavior cloning (BCTrainer)?
What’s your action space?
Where exactly are you applying the masking? After the network output and before the loss calculation?
If yes, then that could create issues as you may be obfuscating the parameterization of the action distribution output by your network (and making lots of useful gradients zero).
Also, am I understanding correctly that each mask only has one valid (discrete) action, which is given by the offline dataset?

Topic		Replies	Views
Problem with action masking RLlib	7	2242	May 19, 2022
[RLlib] Impossible actions RLlib	12	4084	May 11, 2022
AttentionNet with action masking RLlib	4	483	November 2, 2021
Invalid action masking for variable sized permutation action RLlib	0	210	May 27, 2021
Does KL loss make sense when using action masking in PPO? RLlib	2	383	August 1, 2023

Action masks and loss functions

Related topics