Hey all,
I’m working on a PettingZoo environment that can be configured for either:
- A single central agent, or
- Multiple decentralized agents
I’d like to maintain this flexibility but am unsure about the best approach for RLlib (specifically PPO). Here are the options I’m considering:
- Use
.multi_agent
in the PPO config with a single policy for the single agent case – Is this supported in RLlib? - Wrap PettingZoo into a Gymnasium env for the single agent case and use the standard PPO single agent config – More work, but keeps configurability.
- Write a separate Gymnasium env for single-agent – Less elegant but straightforward.
I’d prefer Option 1 if possible—has anyone tried this or know if it’s viable? Are there better alternatives?
Thanks for any insights!