I’m doing some training using PPO, and I would like the value function to have additional states that the policy doesn’t have.
By default, the FullyConnectedNetwork looks like this (14 should be 7 here):
I slightly modify it by splitting the input layer, this way I can give additional states to the value function input. I have a state space of size 14 and split it in 2, passing the first 7 to policy_obs and the 7 last to value_obs.
This seems to work fine, however I’d like to add an lstm on top of it, and the
use_lstm wrapper doesn’t work by default (cf. error below). I was wondering if there is an easier way to do this (giving additional states to the VF) that I didn’t find? That would save me from having to add an lstm manually to my custom net. Or did I miss something with the
ValueError: Input 0 of layer fc_value_1 is incompatible with the layer: expected axis -1 of input shape to have value 14 but received input with shape [None, 7]