How severe does this issue affect your experience of using Ray?
- Low: It annoys or frustrates me for a moment.
I’m trying to replicate the Impala Deep Residual Model from the paper but without the embedding part - right-hand side. See Image
So far I’ve managed to forward the residual CNN part into the RNN and get that working in RandomEnv.
Code can be found here:
However, like in the paper I would also like to include the previous action and reward into the LSTM block. I’ve had a close look into this part of the documentation (ViewRequirement) but appears to be unable to access the input_dict in the forward_rnn(…)
Can anyone help?