I’m a bit lost reading the documentation so I thought I could ask a simple question here if you may.
I have a simple task which is to store experience tuples (S, A, R, S′) as weel as models into a replay buffer and eventually sample them (using any existing approaches if any at all)
I can’t find the right way to do it
Also, I’d like this replay buffer to be shared accross multiple nodes and stored somehow so that I can reload it later.
Does it make any sense?
nb: I’m using pytorch