Accessing other agents' rewards and actions in ppo loss for multi agent environment

miladink · January 12, 2024, 8:05am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi All! I have a multi agent environment. Currently, each agent is being trained via PPO. They only receive the reward, observation, and action they have token in the loss function. However, I am trying to change the PPO to another algorithm I am researching on so it also take into account the rewards and actions of others. But, I don’t know how I can access it or even if this is possible at RLlib.

Topic		Replies	Views
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case? RLlib	5	828	November 10, 2021
Multi-Agent Training with Different Algorithms RLlib	24	3494	October 11, 2022
Different learning rates for different agents RLlib	0	140	October 1, 2023
Help with ppo config in multiagent env with complex observations Configure Algorithm, Training, Evaluation, Scaling	0	44	April 11, 2025
Workflow for Multi-Agent training RLlib	2	375	January 12, 2022

Accessing other agents' rewards and actions in ppo loss for multi agent environment

Related topics