Replay buffer - simple how-to question

Mike · October 5, 2021, 3:51pm

Hello,
I’m a bit lost reading the documentation so I thought I could ask a simple question here if you may.

I have a simple task which is to store experience tuples (S, A, R, S′) as weel as models into a replay buffer and eventually sample them (using any existing approaches if any at all)
I can’t find the right way to do it
Also, I’d like this replay buffer to be shared accross multiple nodes and stored somehow so that I can reload it later.

Does it make any sense?

nb: I’m using pytorch

mannyv · October 7, 2021, 12:56pm

Hi @Mike,

Welcome to the forums. I am not sure how to go about answering your question because I am not sure what you are trying to do.

Are you planning on using on of the built in algorithms or implement your own algorithm from scratch?

Can you provide more information on your bigger picture plan?

Mike · October 7, 2021, 1:01pm

Hi @mannyv ,
From scratch. All I’m interested in right now is a distributed and persistent replay buffer.

Thanx.

Topic		Replies	Views
Offline training using previous obs+action=reward tuples RLlib	1	298	May 24, 2021
Load/save replay buffer RLlib	5	783	September 18, 2022
My prioritised replay buffer slows down my code massively RLlib	2	479	January 5, 2023
Can i check the Replay buffer? RLlib	4	378	July 27, 2021
Sample form the reply buffer sequentially in DQN RLlib	1	212	June 27, 2022

Replay buffer - simple how-to question

Related topics