Different learning rates for different agents

cool-RR · October 1, 2023, 10:35am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi,

I’m running a MARL experiment with RLlib. My environment is a subclass of MultiAgentEnv. I’ve got six agents/policies that I’m training with PPO.

I want some of the agents to have a learning rate x, while other agents will have a learning rate y. Is that possible?

Alternatively, is it possible to have some agents not learn at all, just continue playing with their existing PPO policy, while other agents play and learn?

Thanks for your help,
Ram Rachum.

Topic		Replies	Views
Accessing other agents' rewards and actions in ppo loss for multi agent environment RLlib	0	146	January 12, 2024
Multi-Agent Training with Different Algorithms RLlib	24	3441	October 11, 2022
An example of RLLib used with multiple neural networks RLlib	2	362	June 29, 2022
Examples that scales to hundreds of agents RLlib	1	266	July 22, 2021
Multi agent policy optimization in competitive settings RLlib	0	330	April 20, 2023

Different learning rates for different agents

Related topics