Hi there! I am working on an environment that some agents are dying.
For example, at the t=0, there might be two agents: agent0, agent1.
Later, some agents might die, so only parts of the agent is alive: agent1.
There could also be some new agents being added: agent1, agent2, agent3.
So the observation space as well as the action space is always changing.
check_shape function in preprocessor always breaks the training.
Does anyone have some idea on this issue? Thanks!!