AssertionError: Discrete(33) | MASAC with continuous and discrete agents

RunLLM · November 20, 2025, 12:50pm

MASAC in RLlib does not natively implement a single centralized critic (plus its twin) shared across all agents for centralized training and decentralized execution. Instead, each policy typically maintains its own critic(s). To implement a truly centralized critic shared by all agents, you would need to customize the model and training logic, as described in the centralized critic examples and documentation. RLlib provides examples and strategies for centralized critics, but MASAC does not provide this out-of-the-box for all agents as a single shared network.

Would you like more detail on how to implement a single centralized critic in RLlib?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Mixed Action Space - SAC RLlib	1	78	November 3, 2025
Mutiagent - Different action space for different agents RLlib	8	1940	August 25, 2022
Does RLlib algorithm support both discrete and continuous action spaces simultaneously? RLlib	7	1772	February 22, 2023
[RLlib] Why some algorithms do not suppport multiagent or discrete/continuous action space? RLlib	1	521	January 25, 2021
PPO centralized critic example with more than two agents RLlib	4	1941	October 19, 2021

AssertionError: Discrete(33) | MASAC with continuous and discrete agents

Related topics