Multiagent checks on info on step function is not aligned with Gymnasium

zoe_tsekas · October 8, 2023, 10:25pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

For a multiagent environment, The info returned requires to have same keys as the future observations as implemented in Ray 2.7 Multiagent checks for step function while in official Gymnasium documentation It states:

info (dict ) – Contains auxiliary diagnostic information (helpful for debugging, learning, and logging). This might, for instance, contain: metrics that describe the agent’s performance state, variables that are hidden from observations, or individual reward terms that are combined to produce the total reward.

Please align checks to official spec

Topic		Replies	Views
Callback methods available? RLlib	0	197	January 19, 2024
Episode.last_info_for() always returns empty dictionary in custom callback RLlib	2	592	June 30, 2021
Setting global info state in Multi-Agent step function RLlib	0	221	December 9, 2020
MultiAgent training Issues RLlib	1	457	April 9, 2024
New observation and action spaces in Ray 2.0 RLlib	3	317	October 27, 2022

Multiagent checks on info on step function is not aligned with Gymnasium

Related topics