How to train a SAC agent with the offline API?

trustee · April 17, 2022, 3:38pm

Hi!

If I want to store training data for a SAC trainer with a SampleBatchBuilder, what data do I have to pass to each call to add values()?

sven1977 · April 26, 2022, 8:50am

Hey @trustee , thanks for posting this question. I think this example here would answer your question:

https://docs.ray.io/en/latest/rllib/rllib-offline.html#example-converting-external-experiences-to-batch-format

Once that’s done, you can train your SAC agent also with mixed input, like so:

config:
  input:
       [location (str) of your json output files from the SampleBatchBuilder example above]: 0.5,
       sampler: 0.5,

This would use the actual env (sampler) 50% of the time and 50% of the SAC training data would be taken from the json file. Just an example, you can set the ratios to different values, of course.

Topic		Replies	Views
Mixing simulation and offline data with SAC RLlib	8	444	September 12, 2022
Understanding SAC: Data Collection and Training RLlib	0	594	August 24, 2023
Sample Rule-Based Expert Demonstrations in Rllib RLlib	6	1267	January 24, 2023
What is the cleanest way to train an off-policy algorithm (e.g. SAC) using the sample batches collected by a RolloutWorker.sample()? RLlib	1	186	April 19, 2023
Initialize replay buffer RLlib	1	482	July 1, 2021

How to train a SAC agent with the offline API?

Related topics