[RLLIB] How can I modify the collected data before it is thrown into the replay buffer? Any help will be appreciated. Thanks a lot.
The easiest way to do this is by wrapping the environment that you give to RLlib. You can modify data right there.
A classic place would be postprocess_trajectory_fn
, which is essentially a callback where you can modify sampled trajectories.