Memory Leak when training PPO on a single agent environment

MrDracoG · December 20, 2022, 5:03pm

Here is the ram usage outside of docker ( 2 workers )…

ram_util_percent_1

The line seems to be fairly flat and, to me, this seems to signal that training inside of a docker container may be causing the memory leak .

Here is the other thread that referenced docker containers and linux cgroups related to a memory leak: Help debugging a memory leak in rllib

I would also like to note that I don’t think there was a memory leak when using a single worker ( and no gpu ) inside of a docker container. I will go back and check that out.

Topic		Replies	Views
Help debugging a memory leak in rllib RLlib	21	3894	September 25, 2022
Expected RAM usage for PPOTrainer (debugging memory leaks) RLlib	10	953	September 15, 2022
PPO trainer eating up memory RLlib	9	2346	April 2, 2021
[RLlib] GPU Memory Leak? Tune + PPO, Policy Server + Client RLlib	18	1218	May 29, 2023
[RLlib][Tune] Major memory leak 80GB (!) in 3 days (!) RLlib	1	340	June 3, 2021

Memory Leak when training PPO on a single agent environment

Related topics