hi ray team, I’m running into a potential memory leak issue on ray head node in latest ray 1.13. It’s very similar to this discussion: Memory leak in ray head
-
I’m using ray 1.13 and deploy ray on a k8s cluster
-
head node’s memory keeps increasing (slowly) and in about 4 days it goes out of memory (3GB under my setting). Interestingly during most time of this period, the grid is idle and no task was sent to the grid. Here’s the dashboard screenshot when it’s going to OOM:
-
Here’s the top command from head node:
-
I also tried ray memory, but I can’t see any red herring from it.
Any help would be appreciated and I’d like to provide more information.
Thansk,
-BS