How does environment creation work? Specifically, why am I recognizing 2 environments when training?

Randy-Hodges · April 2, 2023, 5:39am

For context, I am doing multiagent learning with a PPO config and with the number of agents varying during training. I am also just trying to understand rllib a little bit better. When I start training (I am training manually, without tune atm), I am able to see that there are two environments being created and that both environments are adding agents during training. Why are there two environments? Also, assuming my simulation takes up a lot of resources, how would I address resource use?

Topic		Replies	Views
Can't understand training config Configure Algorithm, Training, Evaluation, Scaling	2	33	July 30, 2024
Multi-Agent Training with Different Algorithms RLlib	24	3445	October 11, 2022
Asymmetric play multiagent environment RLlib	2	463	January 6, 2022
RLlib experiments Configure Algorithm, Training, Evaluation, Scaling	0	228	October 22, 2023
Training on multiple environment Offline RL	2	897	February 14, 2023

How does environment creation work? Specifically, why am I recognizing 2 environments when training?

Related topics