Loading pre-trained single-agent policy weights for multi-agent training

jgonik · June 9, 2021, 8:17pm

Hi! I’m currently trying to use RLlib to train on a custom multi-agent car racing environment (essentially a multi-agent version of Gym CarRacing-v0). In my previous workflow without RLlib, I was pre-training a model on the single-agent CarRacing-v0 environment before fine-tuning in the multi-agent environment. Is this something that’s possible with RLlib? For instance, if I have four policies in my multi-agent environment (one for each of four agents), would I be able to save model weights from the single-agent environment and load these weights into the four multi-agent policies? I’d like each policy in the multi-agent environment to start out with the same pre-trained weights.

Thanks so much!

sven1977 · June 11, 2021, 7:36am

Hey @jgonik , great question. We should add an example script to RLlib that shows how to do that.

You can basically do a pre-run using the BCTrainer (ray.rllib.agents.marwil.bc.py). The test case in ray.rllib.agents.marwil.tests.test_bc.py shows how to train from an offline file.
After training your BCTrainer, you save the policies weights by doing:

trainer = BCTrainer(...)
... #<- training
weights = trainer.get_policy().get_weights() # <- single agent weights

# Create the actual trainer and load the BC trained weights into it.
new_trainer = PPOTrainer(...)
for n in range(4):
    policy = new_trainer.get_policy([the nth policy ID])
    policy.set_weights(weights)

jgonik · June 11, 2021, 3:48pm

Awesome, thanks so much! I’ll give that a try

Topic		Replies	Views
[RLlib] Multiagent with one pre-trained policy (vs another adversarial one) RLlib	4	1228	June 14, 2024
RLLib Multiagent: Load only one policy from checkpoint & Compatibility of RLLib/Tune Checkpoints RLlib	9	3271	November 24, 2021
How to pretrain a model with behavior cloning RLlib	14	5231	December 5, 2023
Transfer Learning for Multi-Agent env. with RLlib RLlib	4	793	September 21, 2022
Multi-Agent Training with Different Algorithms RLlib	24	3484	October 11, 2022

Loading pre-trained single-agent policy weights for multi-agent training

Related topics