Sharing an LSTM cell between policies

klausk55 · June 30, 2021, 12:08pm

Hello everybody,

what do you think, is it reasonable to share an LSTM cell of a neural network model between policies?
More precisely, I have different input and output layer, but all the layers in between (including LSTM) I want to share between the various policies.

I guess it might be reasonable since the policies share the weights of an LSTM cell but all policies have theirs own cell and hidden states.

Is my point of view okay?

PS: The way I do this is equivalent to the example showed in shared_weights_model.py

michaelzhiluo · July 1, 2021, 7:40am

This is what people usually do for multi-modal/multi task learning, e.g. learning between different tasks, not just perturbation of the same env, but actually different tasks. It sounds good.

arturn · July 1, 2021, 7:45am

Hi klausk55,

You should just try it and let us know how it went in this thread!
Schulman came up with some helpful slides for avoiding basic mistakes in evaluating an algorithm. So if you do your research and feel like you came up with something rather original and would like to test it, maybe consider his slides
Good luck!

Topic		Replies	Views
Save RNN model's cell and hidden state RLlib	16	813	April 24, 2023
Use LSTM model for policy gradient multi-agent with different recurrent hidden states per agent Configure Algorithm, Training, Evaluation, Scaling	0	28	July 30, 2024
What is the intended architecture of PPO vf_share_layers=False when using an LSTM RLlib	5	3380	June 24, 2023
RLLIB LSTM model summary view RLlib	1	825	March 31, 2023
Built in 2D Convolutions with LSTM RLlib	7	609	August 7, 2022

Sharing an LSTM cell between policies

Related topics