My action space is a `Box(-1, 1, (2,), np.int)`

. I specifically want integers, and I don’t want to use a `Discrete`

space because I don’t want to manage the interpretation between `[0, n)`

and the potentially multi-dimensional `Box`

space that my environment uses. When I try to train this in RLlib with tensorflow, I get the following error: `ValueError: minval must be a scalar; got a tensor of shape [2]`

. After digging around a bit, I traced this to tensorflow random uniform; apparently, the `minval`

can be broadcasted when the type is float, but not when it is integer. Too bad.

So my first thought is to change to a float type and round the actions in my step function to the nearest integer. However, I just wanted to post this on here to see if anyone else has had a similar issue and any corresponding advice.

Thanks!