My action space is a
Box(-1, 1, (2,), np.int). I specifically want integers, and I don’t want to use a
Discrete space because I don’t want to manage the interpretation between
[0, n) and the potentially multi-dimensional
Box space that my environment uses. When I try to train this in RLlib with tensorflow, I get the following error:
ValueError: minval must be a scalar; got a tensor of shape . After digging around a bit, I traced this to tensorflow random uniform; apparently, the
minval can be broadcasted when the type is float, but not when it is integer. Too bad.
So my first thought is to change to a float type and round the actions in my step function to the nearest integer. However, I just wanted to post this on here to see if anyone else has had a similar issue and any corresponding advice.