How to deal with irregular action space?

RAY_fresh · March 7, 2024, 11:57am

Hello! I’m quite new to Ray, and meet a difficulty in my task:

I defined an action space such as several Discrete(7) in a tuple. Indeed, I don’t want two of them to be same during training. Please help me about it or notice me some keywords…

For example, the action can be [1, 2, 3, 6], but cannot be [1, 1, 3, 6] since the first two are same.

I suspect action mask cannot do this, or I miss some thing?

Thank you.

mannyv · March 7, 2024, 3:28pm

Hi @RAY_fresh,

The way I would handle this is to usa action masking and keep track of the masking criteria in the environment. That will be the simplest way to handle that.

RAY_fresh · March 9, 2024, 2:30am

Hi dear @mannyv ,

Sounds reasonable and workable, but a bit abstracted to me…

Could you please tell a bit more?

Thank you for your kind and mercy!

rusu24edward · April 2, 2024, 7:30pm

Looks like auto-regressive action space

Topic		Replies	Views
How to deal with the action space that the sequence does not matter?	0	72	March 21, 2024
Action masking for dependent multi discrete space Configure Algorithm, Training, Evaluation, Scaling	0	458	August 3, 2023
Rllib extremely complex action space Possible? RLlib	1	258	May 4, 2022
Action masking error RLlib	9	1662	February 6, 2023
Example for action masking (without action embeddings) for tuple action space RLlib	2	676	October 27, 2021

How to deal with irregular action space?

Related topics