in case a policy model contains a RNN or more precise an LSTM-cell, then
trainer.save() stores the weights of all trainable variables. However,
trainer.save() doesn’t store recent cell and hidden state of the LSTM-cell.
Is there a way to store these tensors (at least cell state), too? Or doesn’t make this sense, I’m not sure about that?! For my understanding, cell state is a long term memory and thus it might be helpful.