Decentralised pre-trained policies loaded into multi-agent environment for further training and evaluation

giovannivarr · June 6, 2024, 9:43am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi, I’m trying to setup a workflow in a modified version of multi-agent WaterWorld (I’m using PettingZoo). First, I want to pre-train two agents individually in a single-agent version of the environment (as of now, I’m training the agents in two copies of this same single-agent environment, one per each of them). After this pre-training phase, I would like to save their checkpoints, and then load their policies to (possibly) further train and evaluate them in the multi-agent version of the environment. Is this possible? I’m also using Ray Tune and would like to keep using that to also do some hyperparameter tuning. Thanks a lot!

[By the way, I saw that there was a very similar question back in 2021, but I suppose that by now the answer to that is obsolete.]

Topic		Replies	Views
How can I train multiple 'trainer' in same environment?(or embed trained trainer in environment?) RLlib	3	493	January 9, 2023
Handling Configurable Multi-Agent vs. Single-Agent Environments Configure Algorithm, Training, Evaluation, Scaling	1	29	May 19, 2025
Best practice for multi-stage training workflow RLlib	3	493	September 6, 2022
How to do MARL with different policies using Ray Tune? Ray Tune	0	382	April 4, 2022
Pre-train one type of policies in MARL Checkpointing, Restoring	0	58	June 18, 2024

Decentralised pre-trained policies loaded into multi-agent environment for further training and evaluation

Related topics