Max_seq_len of LSTM and Attention Net

TothAron · October 3, 2022, 8:55pm

Hello everyone!
I want to train an CNN+LSTM and CNN+AttentionNet model separately. My questions are the followings:

1. How can I set the same number of past observations passed to an LSTM and to an Attention Net?

My best quess:

---------------------------------------------------------------------
Set LSTM config    -> max_seq_len=64
---------------------------------------------------------------------
Set Att.Net config -> attention_memory_training=64
                                    attention_memory_inference=64
---------------------------------------------------------------------
Should I set max_seq_len=64 for AttentionNet too? What max_seq_len affect on Attention Net?

2. How many prev action/reward passed to LSTM if on LSTM config I set ‘lstm_use_prev_action=True’ and ‘lstm_use_prev_reward=True’. How can I match these on Attention Net?

My best quess:

---------------------------------------------------------------------
on LSTM this will pass 64 past reward/action.
---------------------------------------------------------------------
on Att.Net I have to set -> attention_use_n_prev_actions=64
                                               attention_use_n_prev_rewards=64

Thank you in advance!

arturn · November 30, 2022, 8:36pm

Hi @TothAron ,

max_seq_len=64 is probably ok. max_seq_len sets the length of segments we gather from our sampling procedures. So that makes it the maximum coherent context you get during training, which obviously has an effect on training attention.
For LSTM, you only use information that was produced on the previous step (state) and combine it with new information to produce a new state and output. So the corresponding value would always be something like “lstm_use_n_prev_actions=1”. We don’t provide this though.

Topic		Replies	Views
Max_seq_len of LSTM and Attention Net RLlib	0	194	October 3, 2022
Are there some blocking points in adding LSTM to DQN? RLlib	3	629	November 15, 2021
Built in 2D Convolutions with LSTM RLlib	7	602	August 7, 2022
Fcnet_hiddens and lstm settings RLlib	5	2072	December 16, 2021
LSTM Auto Wrapper RLlib	6	1520	October 2, 2021

Max_seq_len of LSTM and Attention Net

Related topics