How to restore part of weights from trained model into another model before training?

cwijesundara · April 12, 2021, 7:19pm

Hello all! As the title mentions, I’m essentially trying to do some transfer learning of model weights from one trained model to another. However, the catch is that the models have differing input/output layers, so I’m only trying to transfer weights of the layers in between. Therefore I haven’t been able to use the regular “restore” option to restore from a checkpoint. What I would also like to do is start everything else in this new training as if it were a brand-new training, except with the weights of specific layers of the model initialized to the pre-trained weights that I’ve transferred over. What would be the best way to accomplish this?

I’ve been playing around with two ideas so far, but I’m not sure if I’m on the right track. The main thing I’ve been doing is making a new Trainer class that’s inheriting from PPOTrainer, and then overriding the _restore function to splice out the weights I don’t want transferred, but when I enable the “restore” option with a checkpoint from my trained model, debugging seems to ignore this function entirely (i.e. if I set a breakpoint in the function it gets ignored). I’ve also tried instead overriding load_checkpoint and attempting policy modifications that way, and that does seem to work. Would this be the approach to go here?

Topic		Replies	Views
How do I copy the model? RLlib	2	445	June 28, 2021
Transfer Learning for Multi-Agent env. with RLlib RLlib	4	793	September 21, 2022
Specify which layers to restore RLlib	1	187	March 27, 2023
Fails restoring weights #41508 RLlib	2	420	December 29, 2023
RLLib Multiagent: Load only one policy from checkpoint & Compatibility of RLLib/Tune Checkpoints RLlib	9	3278	November 24, 2021

How to restore part of weights from trained model into another model before training?

Related topics