Variable number of agents

rsv · September 12, 2021, 7:13am

Is there way to handle variable number of agents? In my custom environment agents can die or appear in one episode, and therefore their number changes. Because of this a get a different batch size for agent’s observations and other agent observation when trying handle it for the centralized critic. Now I reset the observations of dead agents and store them throughout the entire episode, but I think this approach is not effective

mannyv · September 12, 2021, 11:29am

Hi @rsv,

What I do in this case is something similar to what you said. I always return all the agents that have existed in the environment, in my case it is at most 13, and the agents that are dead have an observation of all 0s and reward of 0. That works well for me.

rsv · September 12, 2021, 2:32pm

Thank you for answer! I did this, but new agents, that spawn during an episode, have batch size less than older.

I want to try set observation function with Repeated space for opponent’s actions, but not shure that it will be work

Topic		Replies	Views
[RLlib] varying the number of agents in multi-agent environments RLlib	3	425	June 11, 2021
What is the proper way to deal with varying observation space? RLlib	7	1512	April 20, 2021
Vectorized multi-agent setup RLlib	3	421	February 12, 2021
Different episode segmentations for different agents in multiagent? RLlib	2	278	June 30, 2022
Question about Environment/Observation construction RLlib	1	385	June 17, 2021

Variable number of agents

Related topics