Multi Agent Policies and Checkpoint

qorbanpour · July 1, 2021, 11:16pm

Hi,

I have a multi agent setup with two agents ( two policies) . I only trained one of the agents in the multi agent setup and was discarding the second agent’s action in my setup ( replacing it with my controller’s value ) and saved the checkpoint when achieved my desired average reward with training only that one agent.

Then I was planning to load the checkpoint and get the policy for the trained agent and set it fixed ( explore=False) for that agent and now let the other agent to be trained and the other agent learn a new policy but I get observation out of space with “nan” values from the beginning the first iteration . I don’t have a clue why this is happening . Anyone can suggest me what’s going on or give me a clue! That would be appreciated .
Sahand.

Topic		Replies	Views
Multi agent checkpoints - KeyError: 'default_policy' RLlib	1	594	October 30, 2021
How to set one checkpoint per agent in a multiagent config? RLlib	1	468	June 22, 2022
RLLib Multiagent: Load only one policy from checkpoint & Compatibility of RLLib/Tune Checkpoints RLlib	9	3294	November 24, 2021
Restoring nn after training in multi agent environment Checkpointing, Restoring	3	304	September 25, 2023
Evaluating multiple policies in multiagent RLlib	4	525	July 6, 2021

Multi Agent Policies and Checkpoint

Related topics