Load/save replay buffer

axr8716 · July 26, 2021, 5:56pm

Hi all

I am looking for a way to save and load replay buffer for off-policy methods. I am looking into this like a transfer learning approach. Please let me know.

mannyv · July 26, 2021, 10:58pm

There is currently no support for this. I am sure they would welcome a PR if you implement it. Then we could add it to the save_state methods.currently there are often performance drops when resuming training with off policy algorithms because even though the weights are restored for the policies, the replay buffer restarts empty.

sven1977 · July 28, 2021, 3:41pm

Hey @axr8716 , yeah, what @mannyv said :), it’s not supported right now, but on our TODO list.

The problem is that the replay buffer objects currently only “sit” inside the Trainer’s execution plan function, so we have no reference to it from within the Trainer of Policies. We need to make these objects registrable (via some convention). This way, they would be accessible when saving/restoring a Trainer. Feel free to do a PR that fixes the problem. A simple POC that only works e.g. for SimpleQ would suffice to get this rolling.

James_Liu · September 16, 2022, 3:29am

I am wondering if it is supported now or not.
Thanks

mannyv · September 17, 2022, 3:04am

Hi @James_Liu,

I think it should be supported now.

github.com

ray-project/ray/blob/6ca0b2f8e5a1d9b93cd138b3ae0d58a9fac0e632/rllib/algorithms/algorithm.py#L2496


      
                  raise NotImplementedError
              else:
                  return self.import_policy_model_from_h5(import_file)
          
          
def __getstate__(self) -> dict:
              state = {}
              if hasattr(self, "workers"):
                  state["worker"] = self.workers.local_worker().get_state()
              # TODO: Experimental functionality: Store contents of replay buffer
              #  to checkpoint, only if user has configured this.
              if self.local_replay_buffer is not None and self.config.get(
                  "store_buffer_in_checkpoints"
              ):
                  state["local_replay_buffer"] = self.local_replay_buffer.get_state()
          
          
    if self.train_exec_impl is not None:
                  state["train_exec_impl"] = self.train_exec_impl.shared_metrics.get().save()
          
          
    return state
          
          
def __setstate__(self, state: dict):

github.com/ray-project/ray

[RLlib] Replay buffers: Add config option to store contents in checkpoints.

ray-project:master ← sven1977:replay_buffer_add_option_to_store_in_checkpoint

opened 10:48AM - 21 Aug 21 UTC

sven1977

+495 -211

The content of replay buffers for off policy algos are currently not stored when… taking a checkpoint of a Trainer. This PR: - adds a config option (`store_buffer_in_checkpoints`) to enable this behavior. - makes sure that warnings are raised if `store_buffer_in_checkpoints` is True and checkpoint does NOT have any replay data in it or if `store_buffer_in_checkpoints` is False and checkpoint does have replay data in it. - test cases for PrioritizedReplay were added to make sure get_state/set_state works properly. - existing test cases for SimpleQ/DQN/DDPG were augmented to make sure checkpointing with replay_buffer content works ok. ## Why are these changes needed? Issue #8151 ## Related issue number Closes #8151 ## Checks - [x] I've run `scripts/format.sh` to lint the changes in this PR. - [x] I've included any doc changes needed for https://docs.ray.io/en/master/. - [x] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [x] Unit tests - [ ] Release tests - [ ] This PR is not tested :(

James_Liu · September 18, 2022, 6:49pm

@mannyv Yes. There is JSonReader and I can use it to read saved replay buffer. Thanks,

Topic		Replies	Views
Replay buffer - simple how-to question RLlib	2	292	October 7, 2021
Initialize replay buffer RLlib	1	478	July 1, 2021
Adding, or not, transitions to the replay buffer based on the state RLlib	1	245	May 30, 2023
Best way to save policy RLlib	2	1586	August 26, 2021
Accessing DQN Memory Buffer from Ray object store memory for Restore RLlib	0	228	December 8, 2020

Load/save replay buffer

Related topics