What does the PPO attention layer do?

Massimo_Fioravanti · July 24, 2024, 3:06pm

I am experimenting with ppo attention layer, and i am trying to understand what exactly it is supposed to do. I assumed that it was just using a transformer to better handle inputs where position of inputs may be relevant, but when i enabled it i saw no performance changes, except that the models with attention where taking longer to train.

The documentation does not really explain what exactly happens when attention is enabled. Is my understanding of what the layer supposed to do wrong?

Topic		Replies	Views
Help needed with understanding and using attention model RLlib	1	567	May 4, 2022
Using trained policy with attention net reports assert seq_lens is not None error Checkpointing, Restoring	1	655	July 23, 2023
PPO forgetting some good actions RLlib	1	257	November 30, 2022
LSTM and Attention on Stateless CartPole RLlib	5	1654	February 20, 2022
Accessing other agents' rewards and actions in ppo loss for multi agent environment RLlib	0	144	January 12, 2024

What does the PPO attention layer do?

Related topics