[RLlib] Workaround for incorrect initial state shape with custom RNN models?

Gregory · December 30, 2020, 1:06am

Greetings everyone,

Back on June 21 issue 9071 was opened regarding incorrect initial state shapes when using a custom model in both Tensorflow and Torch. Can I ask the more experienced users here how to correctly set the initial shape (so that it represents the correct batch size)?

Thanks for any tips

Gregory · December 30, 2020, 1:35am

This is discussed in depth on ray-project/ray/issues/12509 but using 1.1.0 and the nightly 1.2 the challenge is still present. I’ve not been able to communicate with others about this, so if I find a solution I’ll share it.

Gregory · January 2, 2021, 3:03pm

For anyone else struggling, I believe I have it running on a custom model by using LSTMWrapper(RecurrentNetwork) as a template for a custom model. It will be interesting when we find why it’s happening in the use case mentioned in the GitHub issue.

Topic		Replies	Views
Problem with handling states in RNN RLlib	2	740	February 27, 2023
State shapes incorrect using custom model (TorchModelV2) (PPO) RLlib	2	431	July 15, 2021
[RLlib] Shape Error for custom PyTorch model RLlib	2	690	March 12, 2021
Custom RNN Model with Examples - why do they fail? RLlib	11	2358	May 5, 2022
Yet another question on RNN sequencing RLlib	7	813	January 4, 2022

[RLlib] Workaround for incorrect initial state shape with custom RNN models?

Related topics