Potential bug in client server setup with policy mapping functions

Blubberblub · August 18, 2022, 1:51pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

It set up a client server training setup that follows the cartpole client/server example. The environment is a multi agent setup and therefore uses a policy mapping function. When i run the client and the server setup on the same machine everthing works perfectly for remote and local action inference.

Problem:
When i run the client on another machine it works in remote inference mode but not in local mode. I get:
KeyError: “policy_mapping_fn returned invalid policy id ‘None’!”

My policy function looks like this:

def policy_mapping_fn(agent_id, episode, worker, **kwargs):
    if agent_id.startswith("p_d_"):
        return "pot_decider_policy"
    elif agent_id.startswith("p_m_"):
        return "pot_move_policy"

Theoretically it is possible that it returns None (if the agent_id isn’t matching the patterns in my if statements) but when i add an additional else statement to my mapping function to catch the agent_id that caused the problem i get an agent_id that should actually match my if-statements…

Debugging attempt:

def policy_mapping_fn(agent_id, episode, worker, **kwargs):
    if agent_id.startswith("p_d_"):
        return "pot_decider_policy"
    elif agent_id.startswith("p_m_"):
        return "pot_move_policy"
    else:
        print(agent_id)

Result:
The else statement prints “p_m_0_0” before the error is thrown.

I have no idea what could cause the issue here so if anyone could provide help or a hint where to look it would help me out a lot! Thanks in advance!

mannyv · August 26, 2022, 2:01pm

HI @Blubberblub,

Is this still an issue or did you figure it out?

Do you have a reproduction script or error stack trace you could share?

Blubberblub · August 29, 2022, 6:18am

@mannyv I didn’t finde the issue so far. So i settled with server-side action generation for now. I didn’t yet find the time to build a reproduction script using the server client example and replacing the env with a multi agent env that actually has a policy_mapping_fn. If the problem occurs as well in this case i will create an issue on github.

Blubberblub · September 1, 2022, 1:22pm

Problem was solved by upgrading to ray 2.0.

Topic		Replies	Views
ValueError: Could not find policy for agent: agent policy id `<SumoMultiEnv.SUMOAgent object at 0x7fe32e53d1f0>` not in policy map keys dict_keys RLlib	1	265	July 23, 2021
Agent_key and policy_id mismatch on multiagent ensemble training RLlib	9	913	March 30, 2021
[HIGH] TypeError: policy_mapping_fn() takes 1 positional argument but 2 were given RLlib	0	211	December 3, 2023
Ray 1.6.0 Impala multiagent, PolicyID 'default_policy' not found in this PolicyMap RLlib	1	411	July 25, 2022
Ray 2.4.0 (RLLib) Completely lost with documentation RLlib	1	353	December 19, 2023

Potential bug in client server setup with policy mapping functions

Related topics