Finetuning MBMPO policy

Nehal_Soni · September 13, 2022, 7:31pm

Thank you @arturn for your quick response, it’s helpful.

I understand that MAML policy needs to be fine-tuned and it is possible directly using PPO algorithm of RLlib (This thread mentions it and it has been tested also: MAML finetune adaptation step for inference).

Is there any better approach in RLlib to fine-tune MBMPO policy?

Regards

Topic		Replies	Views
Transfer Learning for Multi-Agent env. with RLlib RLlib	4	843	September 21, 2022
Loading pre-trained BC policy weight for tunning with hyper-parameter optimization Checkpointing, Restoring	1	59	August 28, 2024
ValueError when restoring checkpoint with PPO RLlib	1	549	October 20, 2022
MBMPO Questions & Implementing Model-Based Policy Optimization RLlib	0	419	March 2, 2022
RLLib Multiagent: Load only one policy from checkpoint & Compatibility of RLLib/Tune Checkpoints RLlib	9	3397	November 24, 2021

Finetuning MBMPO policy

Related topics