Help needed with understanding and using attention model

Blackwood · May 3, 2022, 11:09am

Hi!

I am relatively new in reinfocment learning space. So far I have tried simple neural networks and LSTM for training agent (PPO) and now I want to try attention based agent. So I have couple of “simple” questions (partly because I also recently learned about transformers as well and don’t grasp everything yet).

When defining model config there are parameters:
“attention_memory_inference” and", “attention_memory_training”
What these parameters do and how they affect the learning? What does the “attention_memory_inference” in this context?

Also if I have trained some agent and want to use it then what is the input for this model? Is it only the timeseries of observations (to get information about states changing in time) like LSTM model?

avnishn · May 4, 2022, 9:02pm

attention_memory_inference is the number of timesteps to concat (time axis) and feed into the next transformer unit as inference input. The first transformer unit of your policy will receive this number of past observations (plus the current one), instead.

attention_memory_training is the number of timesteps to concat (time axis) and feed into the next transformer unit as training input (plus the actual input sequence of len=max_seq_len). The first transformer unit will receive this number of past observations (plus the input sequence), instead.

So essentially tweaking these parameters will change the number of timesteps that are concatenated and fed through the attention units of your policy, probably because attention is slow, and you don’t need to rely on it as much during inference as you do when you’re training.

Topic		Replies	Views
Max_seq_len of LSTM and Attention Net RLlib	1	495	November 30, 2022
Max_seq_len of LSTM and Attention Net RLlib	0	194	October 3, 2022
RecurrentNetwork and Trajectory View API Configure Algorithm, Training, Evaluation, Scaling	0	251	September 21, 2023
Error when using attention_memory_training equal to 0 Configure Algorithm, Training, Evaluation, Scaling	0	157	January 26, 2024
RLlib sequencing for GTrXL RLlib	1	190	December 20, 2023

Help needed with understanding and using attention model

Related topics