Passing additional tensors from custom_actions to loss function that are not part of the model.view_requirements

Bam4d · February 14, 2021, 1:30pm

I have a model where masks are being generated on-the-fly in a overridden custom_action function. The mask cannot be generated in the model as the mask generation algorithm requires access to data in the info_batch.

The mask then needs to be passed to the loss function and the only way to do that is to place the mask tensor in the extra_fetches variable in the custom_actions function.

A problem arises here however as during template generation the mask is not included as part of the view_requirements (and shouldnt be because its not a view requirment of the model).

Due to this the template generation fails as the action mask tensor is missing from the train_batch in the loss function.

I’m currently working around this by just adding a default tensor in the loss function:

valid_action_mask = train_batch.get(['valid_action_mask'], torch.zeros(....))

Is there a better “proper” way of doing this or is this an expected solution to this problem?

Thanks

Bam4d · February 15, 2021, 11:08am

This method works for the first few sample, however. rllib then seems to magically inject the ‘valid_action_mask’ itself, which totally breaks the logic above.

I have no idea why this is happening.

Topic		Replies	Views
Masking in custom autoregressive ActionDistribution RLlib	1	195	May 13, 2023
Issue creating custom action mask enviorment RLlib	14	2212	October 11, 2023
Action masks and loss functions RLlib	1	404	January 25, 2021
Action Masking without Including "action_mask" in the Observation Space? RLlib	0	24	October 31, 2024
Passing non-tensor data from a custom environment to a model RLlib	4	289	February 8, 2021

Passing additional tensors from custom_actions to loss function that are not part of the model.view_requirements

Related topics