How to vary observation space in multi-agent training using tune.run()

RickLan · May 4, 2021, 8:52am

I have a custom multi-agent environment class. It supports different types of observation_space for study. For example:


class MyEnvCls(MultiAgentEnv):
  # etc

config = {
  "env": MyEnvCls,
  "env_config": {
    "obs_type" : tune.grid_search(["type1", "type2"]),
  },
}
config["multiagent"] = {
  "policies" : { # (policy_cls, obs_space, act_space, config)
    "agent_{}".format(x): (None, some_observation_space, MyEnvCls.action_space, {}) for x in range(3)
  },
  "policy_mapping_fn": lambda x: "{}".format(x),
}

tune.run(
  "A3C", 
  name="study",
  config=config, 
  stop=stop, 
)

How do I implement some_observation_space so that when tune runs “type1”, it is a different gym.space from when tune runs “type2”?

mannyv · May 4, 2021, 12:48pm

Hi @RickLan,

You can find more info on sample_from here: Tune Custom/Conditional Search Spaces

Here is one way to do it.

class MyEnvCls(MultiAgentEnv):
    @staticmethod
    def MyEnvCls.get_observation_space(env_type)
        ...

config["multiagent"] = {
  "policies" : tune.sample_from( lambda spec: 
     {"agent_{}".format(x): (None, 
         MyEnvCls.get_observation_space(spec.config.env_config.obs_type),
         MyEnvCls.action_space, {}) for x in range(3)}),
      "policy_mapping_fn": lambda x: "{}".format(x),
}

RickLan · May 4, 2021, 2:13pm

Thank you @mannyv ! Let me try

Topic		Replies	Views
Searching Across Environment Configurations RLlib	6	348	June 8, 2022
MultiAgents type actions/observation space defined in environement RLlib	8	1373	May 10, 2022
Multi-agent configuration incompatible with Ray hyperparam tuning RLlib	3	691	May 11, 2022
What is the proper way to deal with varying observation space? RLlib	7	1501	April 20, 2021
Different observation space in MultiAgentEnv RLlib	2	734	August 12, 2021

How to vary observation space in multi-agent training using tune.run()

Related topics