I have a custom multi-agent environment with multiple groups of agents, where I would like each group to utilize the same policy. I configured this by setting AlgorithmConfig.multi_agent(policies={...}, policy_mapping_fn=...). Having the policy remember a history past states and actions would be ver…

Could you clarify which specific issue or topic you are referring to? There are multiple issues discussed in the data sources, such as Ray job deadlocks, OOM errors, or dashboard persistence. Please specify the context or provide more details so I can give you the most relevant update and sources. I…

@RunLLM Just was wondering if there was any recent update from Ray on configuring a multi-agent shared LSTM policy. I’m having 1. an assertion error or 2. the env not able to progress due to the Connector V2 assigning partial actions in multi-agent settings. Since not all agents act in a step due t…

It seems likely that this issue—multi-agent shared LSTM policies with Connector V2, where not all agents act every step, leading to assertion errors or stuck environments—remains a challenge. The most recent discussions and documentation indicate that RLlib’s multi-agent API expects only those agent…

Use LSTM model for policy gradient multi-agent with different recurrent hidden states per agent

RLlib Configure Algorithm, Training, Evaluation, Scaling

kimhyun1018 October 21, 2025, 1:53am 2

Is there any update on this? I’m facing a similar issue too.

Topic		Replies	Views
Multi-Agent PPO rollout stalls + “duplicate agent at same env-t” assertion under Connector-v2 (Ray 2.49.2) RLlib	3	47	October 17, 2025
Multi agent partial parameter sharing RLlib	2	451	November 30, 2023
Sharing an LSTM cell between policies RLlib	2	415	July 1, 2021
Decentralized multi agent reinforcement learning RLlib	4	235	November 2, 2024
Multi agent checkpoints - KeyError: 'default_policy' RLlib	1	610	October 30, 2021

Use LSTM model for policy gradient multi-agent with different recurrent hidden states per agent

Related topics