Hello! I’m quite new to Ray, and meet a difficulty in my task:
I defined an action space such as several Discrete(7) in a tuple. Indeed, I don’t want two of them to be same during training. Please help me about it or notice me some keywords…
For example, the action can be [1, 2, 3, 6], but cannot be [1, 1, 3, 6] since the first two are same.
I suspect action mask cannot do this, or I miss some thing?
Thank you.