Ppo add the lstm NN

Hi @zzchuman,

You can add an automatic lstm with config["model"]["use_lstm"]=True

If by critic you mean the value network then yes,the action network and value network will both use the output from a single lstm layer.

You can have a look at the network architectures with and without an lstm here:

1 Like