Multi-agent APPO with variable agent numbers and horizon

Eugene_Vinitsky · April 4, 2022, 6:12pm

How severe does this issue affect your experience of using Ray?

Medium

Hi, I’m trying to use the multi-agent implementation of APPO in an environment that is

multi-agent
variable agent number
variable trajectory length

but am receiving some sort of masking error
File “/private/home/eugenevinitsky/.conda/envs/nocturned/lib/python3.8/site-packages/ray/rllib/agents/ppo/appo_torch_policy.py”, line 98, in reduce_mean_valid
(APPOTrainer pid=2792319) return torch.sum(t[mask]) / num_valid
(APPOTrainer pid=2792319) IndexError: The shape of the mask [4, 264] at index 0 does not match the shape of the indexed tensor [7, 264] at index 0

would anyone be able to explain the likely meaning of this error? It’s slightly hard to provide a reproducible example at the moment as the environment is private / custom

Topic		Replies	Views
Issue with LSTM PPO mask dimension mismatch with custom environment RLlib	1	188	November 15, 2023
Non acting agents in APPO RLlib	2	259	January 26, 2022
Simple multi agent setup with action masking problems RLlib	0	67	September 16, 2024
Action masking for multi-agent DQN RLlib	1	1025	February 23, 2023
Error with Multi Agent RL with RLlib RLlib	0	147	February 25, 2024

Multi-agent APPO with variable agent numbers and horizon

Related topics