How to deal with the action space that the sequence does not matter?

RAY_fresh · March 21, 2024, 2:07am

Hi dear, I tried to train my agent, but fail into many unnecessary actions.

For example, action space is [Discrete(3) * 3], while indeed [1, 2, 2] and [2, 1, 2] are the same action. So the agent does not need to do it again. And even fall into local optima because of this.

Is there any way to add some constraints to mask these unnecessary actions? Only the value of space matters, not the sequence of its elements.

Thank you!

Topic		Replies	Views
How to deal with irregular action space? Configure Algorithm, Training, Evaluation, Scaling	3	129	April 2, 2024
Repeated in action space RLlib	1	448	August 19, 2023
Rllib extremely complex action space Possible? RLlib	1	257	May 4, 2022
Action masking for dependent multi discrete space Configure Algorithm, Training, Evaluation, Scaling	0	449	August 3, 2023
Condition on actions space RLlib	4	355	March 31, 2023

How to deal with the action space that the sequence does not matter?

Related topics