Initialize replay buffer

vineet54 · June 29, 2021, 6:42pm

Hello,

I’d like to initialize the replay buffer in SAC (or any other off-policy algo) with experience from a non-RL policy. I want the RL agent to start out learning from this policy. What is the best way to do this?

michaelzhiluo · July 1, 2021, 7:42am

I did something similar for offline RL agent CQL: ray/cql.py at master · ray-project/ray · GitHub

They key part is to add the dataset in the after_init function in the Trainer template.

Topic		Replies	Views
Using Hindsight Experience Replay in SAC RLlib	8	725	March 30, 2022
Load/save replay buffer RLlib	5	783	September 18, 2022
Add the experiences to the buffer "by hand" RLlib	7	953	December 14, 2021
Offline training using previous obs+action=reward tuples RLlib	1	299	May 24, 2021
Offline data with self made dataset RLlib	1	268	June 7, 2023

Initialize replay buffer

Related topics