Custom Autoregressive Action Models/Distributions

hdelecki · December 26, 2020, 11:00pm

Hello,

I’m looking into trying out an autoregressive action model for one of my projects. I just looked through the example autoregressive model and action distribution..

The custom environment used with the autoregressive example uses an action space of (Discrete(2), Discrete(2)). The model contains a warning that it is only suitable for binary action spaces. I’m wondering if anyone can offer any insight into how the model/distribution would need to be changed to use higher dimensional discrete spaces, besides the input/output sizes in the model.

I greatly appreciate any advice!

sven1977 · December 29, 2020, 3:44pm

Hi, thanks for this question!
Yeah, we should generalize this example. Looking at this briefly, I think all you have to change to make this work for higher dimensional Discrete action spaces is:

Remove the assertion.
Make a1_logits and a2_logits Dense layers the same size as the two actions, e.g. If your action space is Tuple(Discrete(4), Discrete(6)), make a1_logits size 4 and a2_logits size 6.
Make sure that your action distribution handles the “context” (the model’s output, which are the logits for action 1) correctly to produce an action 2.

Hope this helps, lmk.

Topic		Replies	Views
Continuous action space and custom model RLlib	4	1515	July 17, 2021
Variable-length / Parametric Action Spaces RLlib	1	538	August 31, 2021
Custom action space Configure Algorithm, Training, Evaluation, Scaling	4	571	July 31, 2023
How to choose the action dist for a custom model with a Tuple action space? RLlib	5	828	May 15, 2022
Observation dependent continuous action space ("Masking" continuous action space) RLlib	4	1086	February 9, 2022

Custom Autoregressive Action Models/Distributions

Related topics