How can I train multiple 'trainer' in same environment?(or embed trained trainer in environment?)

coco · December 19, 2022, 7:21am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi,
Is there a way to train multiple trainers(like PPOTrainer …) in same environment?
more exactly, I want to embed trained trainer(lr = 0) in environment while other new trainer being trained. so, new trainer trained in environment that containing trained trainer.

I tried this way: I manipulated the environment to have some agents perform actions calculated from the trained policy, but they were not perfect.
So, I would manipulate training stage.

Can you please help me?
I’m using ray.tune.
Thank you.

arturn · December 20, 2022, 10:58pm

Hi @coco ,

Can you provide a little more information on your setting, please?
Here is what I understand: You use a multi-agent env, you want one agent in the env to be frozen to a perfect policy and another one to train as usual?
If so, you should implement the perfect policy by hand and use Rllib’s multi-agent capabilities.

Have a look at this example if you want to learn more.

Cheers

coco · January 9, 2023, 9:09am

I am modifying my code by referring to this example.
Since my source code is made to have multiple agents follow one policy,
config[‘multiagent’] ['policy_mapping_fn '] was like
" policy_mapping_fn = lambda x : original policy "
By the way, I wanted to make one of these agents follow a policy other than the original policy.
so,

The code was written as shown in the example above.
And then, it’s execution result seemed that ‘one policy’ was assigned to ‘every agent’ except ‘agent0’. (one policy per one agent)
Can you tell me how to modify the code to have ‘multiple agents(exept ‘agent0’)’ refer to ‘only one policy’ ?
thank you. Have a nice day.

mannyv · January 9, 2023, 1:31pm

Hi @coco,

That mapping function should do it. It is hard to say what might be wrong without a full reproduction script. Can you share it?

Topic		Replies	Views
How to train multiple policies in one environment? RLlib	3	450	January 12, 2023
Two different method mapping policy to agents RLlib	1	307	February 2, 2023
How to run multiple trainers? RLlib	2	367	August 26, 2022
Passing trained agents into Trainable RLlib	3	550	September 11, 2022
Decentralised pre-trained policies loaded into multi-agent environment for further training and evaluation RLlib	0	54	June 6, 2024

How can I train multiple 'trainer' in same environment?(or embed trained trainer in environment?)

Related topics