[Rllib] compute_single_action() with an LSTM-PPO trainer fails

Hi @Mirakolix_Gallier,

Try this example:

1 Like