Ray
Use LSTM model for policy gradient multi-agent with different recurrent hidden states per agent
RLlib
Configure Algorithm, Training, Evaluation, Scaling
kimhyun1018
October 21, 2025, 1:53am
2
Is there any update on this? I’m facing a similar issue
too
.
show post in topic
Related topics
Topic
Replies
Views
Activity
Multi-Agent PPO rollout stalls + “duplicate agent at same env-t” assertion under Connector-v2 (Ray 2.49.2)
RLlib
3
37
October 17, 2025
Multi agent partial parameter sharing
RLlib
2
442
November 30, 2023
Sharing an LSTM cell between policies
RLlib
2
412
July 1, 2021
Decentralized multi agent reinforcement learning
RLlib
4
208
November 2, 2024
Multi agent checkpoints - KeyError: 'default_policy'
RLlib
1
608
October 30, 2021