Creating buffer for PPO

hossein836 · July 6, 2023, 5:49am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Hi
To stabilize my agent further, I’m going to try using a buffer for PPO. It’s worth mentioning that OpenAI Five also used a buffer, although not in the same way I intend to use it.

My goal is to collect experiences from multiple episodes and sample X sequences with a sequence length of Y to train my model (my model is RNN). Once selected, these experiences should be removed from the buffer.

I have absolutely no idea where to start, and I’m also unsure whether the buffer can utilize RAM or GPU RAM, depending on the implementation. Additionally, I would like to know which type of RAM is being used for buffering in RLlib.

hossein836 · July 21, 2023, 8:12am

guys I really need help on this, can anyone give me a clue?

Topic		Replies	Views
Expected RAM usage for PPOTrainer (debugging memory leaks) RLlib	10	955	September 15, 2022
Add the experiences to the buffer "by hand" RLlib	7	953	December 14, 2021
RNN support + RAM usage for RL algorithms RLlib	2	218	January 17, 2023
RLLib PPO Trainer allocating additional memory on second training iteration RLlib	0	300	July 21, 2022
PPO with PyTorch GPU has a RAM memory leak for Ray 1.6.0 RLlib	5	673	October 5, 2021

Creating buffer for PPO

Related topics