How to control the total memory of ray.serve?

Rong · October 28, 2021, 12:04pm

I use ray[serve] as the server and send requests by http . But after sending hours of requests, I found the memory of ray’s actors is increasing continuously (over 90G for 20 actors). How can I control the total memory used by ray ?

Memory usage is increasing linearly.

OS: MacOS 10.15.7 / Linux Ubuntu 18 LTS
Ray version: 1.7.0/1.4.1
Python version: 3.7.11
Installation:

cat requirements
> ray[serve]==1.7.0
> psutil
> requests

pip install -r requirements

The code for reproduction is as follows. I have tried object_store_memory , but it doesn’t work for ray serve.

import json
import os
import numpy as np
import psutil
import ray
import requests
from ray import serve


@serve.deployment(name="test_qps", route_prefix="/test_qps", ray_actor_options={"num_cpus": 1}, num_replicas=1)
class QpsTest:
    def __init__(self):
        pass

    async def __call__(self, request):
        step_cnt = await request.json()
        state = np.random.randint(0, 255, (1000, 1000), np.uint64)

        process = psutil.Process(os.getpid())
        proc_mem = process.memory_info().rss / (1024 ** 2)
        print(f'actor_pid={process.pid} \t mem={proc_mem:6.1f} MB.')
        return state


if __name__ == '__main__':
    ray.init(num_cpus=1, dashboard_host="0.0.0.0", object_store_memory=150_000_000)

    client = serve.start(http_options={"host": "0.0.0.0"})
    QpsTest.deploy()

    step_no = 5000000

    step_cnt = 0
    url = "http://127.0.0.1:8000/test_qps"
    while step_cnt < step_no:
        req = requests.post(url, data=json.dumps(step_cnt))
        step_cnt += 1
        process = psutil.Process(os.getpid())
        proc_mem = process.memory_info().rss / (1024 ** 2)
        print(f'main_pid={process.pid} \t mem={proc_mem:6.1f} MB.')
        print('-' * 30)

simon-mo · October 29, 2021, 5:01pm

Hi @Rong thanks for posting. I created a tracking issue here and I’m looking into it. [Bug] Serve memory grow unbounded after hours of requests. · Issue #19891 · ray-project/ray · GitHub

Rong · November 4, 2021, 3:36am

Thank you simon-mo, I will trace the issue and discuss more on that.

Rong · November 10, 2021, 8:05am

We found the memory leaking is not due to ray, but the third-party package pytorch_forecasting. The relative Github issue is Memory leak in TimeSeriesDataSet · Issue #648 · jdb78/pytorch-forecasting · GitHub

Thanks. ray-serve works very well on Debian.

Topic		Replies	Views
Memory Leak in Ray Serve 2.2.0	0	232	January 13, 2023
High memory usage using ray serve Ray Core	2	56	December 9, 2024
Limit the number of application that Ray launch to fit memory aviable Ray Serve	3	291	October 14, 2023
Ray Actor RAM usage keep growing Ray Core	7	1095	June 9, 2021
[Core] How to reslove RayOutOfMemoryError in python for ray package? Ray Core	5	958	April 29, 2021

How to control the total memory of ray.serve?

Related topics