How to extract gradient, state, and reward information from trainer.evaluate?

aadharna · April 5, 2022, 5:42am

So, I have a rollout function that returns the following structure:

             Rollout_results(info=infos,
                             states=states,
                             values=values,
                             actions=actions,
                             rewards=rewards,
                             win=win,
                             logps=logps,
                             entropies=entropies,
                             dones=dones,
                             net_info=network_infos)

The majority of this info is useful in downstream calculations (e.g., computing GAE)

However, so that I don’t duplicate work that’s already done here in rllib, I want to switch to using the ‘trainer.evaluate()’ functions instead since that will gracefully handle cases like single-agent and multi-agent under the hood.

Is there a way to get all this info out of the trainer.evaluate function?

Topic		Replies	Views
Extracting and storing per step agent state from RLlib rollouts RLlib	3	333	July 23, 2021
`rllib rollout` command seems to be training the network, not evaluating RLlib	3	761	January 22, 2021
Train / Evaluate hist stats not even close to matching manual evaluation stats RLlib	1	201	March 21, 2023
How can I get evaluation metrics from ExperimentAnalysis Ray Tune	2	701	February 8, 2022
Can ray allow access to individual episodes? RLlib	5	467	September 22, 2021

How to extract gradient, state, and reward information from trainer.evaluate?

Related topics