Hi @mannyv,
I tested now two further action space versions - and your intuition was pretty right The problem is rooted in the action space type.
I first used a Dict
action space:
action_space = {
'trade': Box(low=-1, high=1, shape=(1,), dtype=np.int8),
'stop': Box(low=-np.inf, high=np.inf, shape=(1,), dtype=np.float64)
}
action_space = Dict(action_space)
This gave the same error as in my initial question. Then I tried a simple Box
action space:
action_space = Box(low=-np.inf, high=np.inf, shape=(2,), dtype=np.float64)
and this worked out. Training runs through now. Probably Dict
action spaces are not yet implemented? It would be a nice feature as it allows to refer to certain action elements by name and makes code more readable.
I will try now, if a Tuple
action space will work.