TF error when restoring from checkpoint, multi-agent

RickLan · March 30, 2021, 1:24am

It turns out the agent.compute_action() expects non-dict observation, but I pass in dict version because the environment expects that. Here it the working code:

import ray.rllib.agents.a3c as a3c
agent = a3c.A3CTrainer(config=config, env=AgentEnv)
agent.restore(args.checkpoint_path)

# instantiate env class
env = AgentEnv(env_config)

# run until episode ends
done = False
episode_length = 50
length_count = 0
obs = env.reset()
while not done and (length_count <= episode_length):

  action_dict = {}
  for policy_id in config["multiagent"]["policies"].keys():
    action_dict[policy_id] = agent.compute_action(obs[policy_id], policy_id=policy_id)

  obs, reward, done_dict, info = env.step(action_dict)
  length_count += 1
  done = done_dict["__all__"]

Topic		Replies	Views
Multi agent checkpoints - KeyError: 'default_policy' RLlib	1	594	October 30, 2021
Restore from checkpoint gives tf not present error Checkpointing, Restoring	7	497	January 19, 2023
Get_policy error when get an action from restored trained model- New API stack	12	111	April 22, 2025
RLLib Multiagent: Load only one policy from checkpoint & Compatibility of RLLib/Tune Checkpoints RLlib	9	3293	November 24, 2021
Compute/display actions from ray.tune RLlib	10	1682	March 30, 2021

TF error when restoring from checkpoint, multi-agent

Related topics