[RLlib] MAML with another agent than PPO?

lucas_spangher · May 3, 2021, 1:47am

Hello,

We’d like to use SAC with MAML so that we can incorporate some offline training. Is this fairly straightforward in RLLib? If not, can anyone suggest a minimal course of action to implement this?

mannyv · May 4, 2021, 2:15pm

Hi @lucas_spangher,

How are you imagining this training would go?

Like this?

Pretrain with CQL with offline data
Train with MAML

Or like this?
Loop

Train with CQL for x iterations
Train with MAML for y iterations
Repeat

You could also save offline data from MAML to use with CQL but there would be some bookkeeping to make sure the environments matched.

Also the RLLIB CQL implementation is very new so there may be some lingering issues that have not been found yet.

Topic		Replies	Views
[rllib] Applying MAML in multi-agent environments RLlib	2	274	January 25, 2021
How to train a SAC agent with the offline API? RLlib	1	313	April 26, 2022
[RLlib] Why some algorithms do not suppport multiagent or discrete/continuous action space? RLlib	1	487	January 25, 2021
CQL for discrete action space RLlib	6	816	October 5, 2023
CQL for discrete actions RLlib	2	172	December 24, 2023

[RLlib] MAML with another agent than PPO?

Related topics