Resources used by HTTPProxyActor

hbfernandes · February 9, 2021, 5:51pm

I’ve been testing serve to create a few endpoints and i noticed that as soon as I run serve.start() i see that each worker starts up an instance of HTTPProxyActor.

What is this Actor’s purpose?
It has 1 CPU resource assigned. This seems like a lot, can it be configured to a different value?

Thank you.

simon-mo · February 9, 2021, 8:31pm

Hi the actor is used as a proxy that listens to http port. In new version of Ray (ray 1.2.0), we only start one of them in head node. (the new version should be coming out this week!)

hbfernandes · February 10, 2021, 5:20pm

Will it be possible to change the resources from the HTTPProxyActor in the new release?

simon-mo · February 11, 2021, 5:38pm

No not yet. Can I ask why do you need to change it? If you want to put more models into the system you can start ray with more workers by running ray.init(num_cpus=16*2) or ray start --num-cpus= 32 (assume your system has 16)

hbfernandes · February 12, 2021, 11:15am

Well, actually it has nothing to do with serving models for now. I have a small graphql endpoint made available through serve. Ray is running on kubernetes and the cluster node resources are very limited, 2 cpus/8gb ram, but i am able to spawn as many of them as needed. This means that the ray head lives on one of these nodes and its CPU count is 2. Just deploying the endpoint will take up those 2 CPUS, 1 for the proxy and another for the backend.

Since the endpoint usage is light and not very regular, making its resource requirements smaller seemed like a good option. Managed to reduce the backend CPU requirement but not the proxy, therefore my question here.

simon-mo · February 16, 2021, 9:44pm

right… i don’t see a clear path to customize the resource usage of the HTTPProxy actor though because it can vary so much. The proposed solution to

keep your k8s pod asking for 2 cpus
but tell ray via init argument it has more than 2 cpus so it will spawn more than 2 workers.
in this case, the 2+ worker processes (including the http proxy) will multiplex on the 2 cpus pod.

Topic		Replies	Views
How to configure resource used by proxyactor Ray Serve	1	204	February 5, 2024
Specifying resources using Ray Serve Ray Serve	1	18	May 19, 2025
Understanding performance of Ray serve Ray Serve	4	667	January 10, 2024
Ray Serve Outages Ray Serve	5	418	July 7, 2023
Regarding HttpProxyActor Ray Serve	2	538	December 3, 2022

Resources used by HTTPProxyActor

Related topics