Accessing info dicts in postprocessing callback

sven1977 · December 20, 2020, 9:52am

It’s actually quite hackish to do this on top of 1.0.1 (several files need to be changed b/c of bugs).
The recommended way is to use the current master (or upcoming 1.2.x), where this has been fixed.
You will get env infos automatically in your loss or postprocessing function (if these functions need this field, i.e. access it in a test pass).

Documentation is in-flight (doc PR is in review).

Yes, the speedup on Atari for PPO was ~20%. For more “learn-heavy” algos (lots of updates vs action inference) like DQN or SAC, it’s not really faster, but definitely not slower either.

Topic		Replies	Views
Info dict keys and then add them as new entries of SampleBatch RLlib	5	839	November 26, 2021
In trajectory_view_API, I want to add "infos" to the model input RLlib	6	535	August 28, 2022
'infos' automatically stripped if they are accessed in mixin RLlib	2	347	February 26, 2021
'infos' in view requirement replaced with zeros in dummy batch RLlib	0	311	March 4, 2021
Episode.last_info_for() always returns empty dictionary in custom callback RLlib	2	620	June 30, 2021

Accessing info dicts in postprocessing callback

Related topics