Does rllib QMIX work with a tuple of 2 actions?

Jiffer_PengPeng · April 23, 2021, 9:39pm

I tried to use an action space tuple of 2 discrete actions

ACTION_SPACE = gym.spaces.Tuple([
    gym.spaces.Discrete(3), 
    gym.spaces.Discrete(3), 
])

then got this error:

ValueError: The two structures don't have the same nested structure.

First structure: type=tuple str=(0,)

Second structure: type=tuple str=(Discrete(3), Discrete(3))

More specifically: The two structures don't have the same number of elements. First structure: type=tuple str=(0,). Second structure: type=tuple str=(Discrete(3), Discrete(3))
Entire first structure:
(.,)
Entire second structure:
(., .)

If I change action space to:

ACTION_SPACE = gym.spaces.Tuple([
    gym.spaces.Discrete(3), 
])

then it runs without errors

Sorry this isn’t an reproducible script, I would like to confirm whether QMIX algorithm work with more than 1 discrete action. If so, is there an example of it? the two_step_game example uses 1 discrete action

mannyv · April 23, 2021, 9:50pm

Hi @Jiffer_PengPeng,

Try using MultiDiscrete instead of a Tuple of Discrete.

Jiffer_PengPeng · April 23, 2021, 9:58pm

Hi @mannyv
Thanks so much for the quick reply!
I tried MultiDiscrete

ACTION_SPACE = gym.spaces.Tuple([
    gym.spaces.MultiDiscrete([3, 5])
])

then got this error:

ValueError: QMix requires a discrete action space, got MultiDiscrete([3 5])

Also tried removing the tuple:

ACTION_SPACE = gym.spaces.MultiDiscrete([3, 5])

then got

ValueError: Action space must be a Tuple, got MultiDiscrete([3 5]). Use MultiAgentEnv.with_agent_groups() to group related agents for QMix.

Note I don’t think the “Use MultiAgentEnv.with_agent_groups() to group related agents for QMix.” part is related to the problem as i did grouping. This part of error message isn’t accurate

mannyv · April 23, 2021, 10:21pm

How many agents are in your environment? QMIX is expecting a Tuple space with one entry per agent. I am not 100% positive but looking at this I think it will only accept Discrete action spaces for each agent.

github.com

ray-project/ray/blob/b08b2c5103c634c680de31b237b2bfcceb9bc150/rllib/agents/qmix/qmix_policy.py#L516-529


if not hasattr(obs_space, "original_space") or \
        not isinstance(obs_space.original_space, Tuple):
    raise ValueError("Obs space must be a Tuple, got {}. Use ".format(
        obs_space) + "MultiAgentEnv.with_agent_groups() to group related "
                     "agents for QMix.")
if not isinstance(action_space, Tuple):
    raise ValueError(
        "Action space must be a Tuple, got {}. ".format(action_space) +
        "Use MultiAgentEnv.with_agent_groups() to group related "
        "agents for QMix.")
if not isinstance(action_space.spaces[0], Discrete):
    raise ValueError(
        "QMix requires a discrete action space, got {}".format(
            action_space.spaces[0]))
if len({str(x) for x in obs_space.original_space.spaces}) > 1:
    raise ValueError(
        "Implementation limitation: observations of grouped agents "
        "must be homogeneous, got {}".format(
            obs_space.original_space.spaces))
if len({str(x) for x in action_space.spaces}) > 1:
    raise ValueError(

Jiffer_PengPeng · April 23, 2021, 10:31pm

@mannyv
I have 2 agents, 1 predator and 1 prey
The following is the env creation code. RLlibHiWayEnv is a custom env that inherits ray MultiAgentEnv

    PREDATOR_IDS = ["PRED1"]
    PREY_IDS = ["PREY1"]
    grouping = {
        'PREY1': PREY_IDS,
        'PRED1': PREDATOR_IDS,
    }
    env = RLlibHiWayEnv(config)
    return env.with_agent_groups(
        grouping, obs_space=OBSERVATION_SPACE, act_space=ACTION_SPACE
    )

If QMIX accepts two discrete action spaces for each agent, it would work(both action can be homogeneous as well). This is also what I think QMIX’s doc says it can do. But currently it looks like QMIX only accepts 1 discrete action per agent.

mannyv · April 23, 2021, 10:41pm

This will almost certainly hurt performance but if your action space is small you could define your action space as Discrete with d1*d2 actions then map them to (0,0), (0,1),…(0,m),(1,0),…(n,m)

Jiffer_PengPeng · April 23, 2021, 11:05pm

This is a pretty good workaround and probably the only workaround. I will start training this way and see it will work. Thanks!

Topic		Replies	Views
Qmix - Tuple Space? First and Second structure error RLlib	3	690	February 1, 2022
Discrete tuple action space for simple Q RLlib	4	1297	October 14, 2021
RLlib and gym.space RLlib	4	712	November 14, 2021
Right way to use tuple action space RLlib	9	1565	September 24, 2021
Example running RL on tuple space RLlib	7	505	February 4, 2022

Does rllib QMIX work with a tuple of 2 actions?

Related topics