Discrepancy in policy_mapping_fn Signature in AlgorithmConfig Documentation (New API Stack)

RunLLM · December 19, 2025, 6:15pm

You are correct: in the New API Stack, the policy_mapping_fn should use the signature (agent_id, episode, **kwargs) and not include the worker argument. Including worker leads to a TypeError, as you observed. The documentation currently does not clearly distinguish this, which can cause confusion for users migrating to or starting with the New API Stack. This has been confirmed by user reports and code examples in the Ray community and source code comments, which show the new stack omits worker, while the old stack includes it for backward compatibility and RolloutWorker-based workflows. See the discussion and examples in Ray GitHub issues and Ray Discourse.

Would you like more detail on how to structure your config for multi-agent environments in the New API Stack, or on how to avoid compatibility issues?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Wrong docstring in algorithmconfig.multi_agent RLlib	1	19	July 29, 2025
Two different method mapping policy to agents RLlib	1	307	February 2, 2023
Add custom policy to config on a non multi-agent setup RLlib	2	320	June 4, 2023
RLModule and Policies in multi-agent setting RLlib	0	290	February 25, 2024
Policy mapping for computing actions in multi agent env RLlib	8	1308	January 2, 2022

Discrepancy in policy_mapping_fn Signature in AlgorithmConfig Documentation (New API Stack)

Related topics