Hi @zzchuman,
You can add an automatic lstm with config["model"]["use_lstm"]=True
If by critic you mean the value network then yes,the action network and value network will both use the output from a single lstm layer.
You can have a look at the network architectures with and without an lstm here:
1 Like