Ray Serve HTTP requests handling

GolikovAndrey · April 6, 2023, 6:07am

Ray clusters are deployed on several virtual machines (Linux Debian 11, Python 3.9.2, Ray versions are the same). The trained model is deployed on these clusters. I can’t send a request from my machine to the head node, but I can send a request inside the virtual machine to localhost:8000. The question is: how to open the Ray Serve server for external requests?

Also I tryed to send a remote request like this:

@ray.remote
def remote_request():
    print(requests.get("http://localhost:8000/some_prefix").json())

remote_request.remote()

And also didn’t got anything, but this warning:

WARNING long_poll.py:149 -- LongPollClient connection failed, shutting down.

Jules_Damji · April 6, 2023, 4:05pm

@GolikovAndrey You will probably need to put a proxy NGIX or FastAPI server and have it route to the Serve deployment.

cc: @Sihan_Wang do we have a best practice example in the doc how to expose Ray Serve deployment as external services on OSS Ray?

Sihan_Wang · April 6, 2023, 4:23pm

Hi @GolikovAndrey, to rule out network problem, have you ever successfully sent requests from your machine to the other virtual machine? (E.g. AWS has some network default port rules to block outbound traffic, user has to enable specific port to receive traffic.)

Thanks
Sihan

GolikovAndrey · April 6, 2023, 6:18pm

Thanks for the answer guys @Jules_Damji, @Sihan_Wang!

We don’t work with cloud services like AWS or GCP, our system admins deploy virtual machines on our servers.
I found solution of my problem, but forget to write here. It doesn’t matter, how you start Ray Serve (via CLI or Python API), you can specify a parameters --http-host and --http-port. This will open the server for external requests.

CLI:
serve start --http-host "xxx.xx.xxx.xx" --http-port xxxx
Host is string type, port is integer type.

Python API:

from ray import serve

serve.start(
    http_options= {
        "host": 'xxx.xx.xxx.xx',
        "port": xxxx
    }

Also you can specify location parameter, that deploy your application on the head node, or on all nodes that your cluster have.

Thank you for attention!

Jules_Damji · April 6, 2023, 6:25pm

@GolikovAndrey Excellent. You could also use a proxy for auth if you wanted and then have it routte to the appropriate Ray Serve deployment.

Should we consider this resolved then?

Happy to help (HTH)

GolikovAndrey · April 6, 2023, 6:30pm

Yes, we can consider this problem solved.

Jules_Damji · April 6, 2023, 7:04pm

excellent, and hope all works for you

Topic		Replies	Views
Access ray serve from the inside K8s cluster Ray Clusters	9	858	August 31, 2022
Expose deployments Ray Serve	3	761	August 28, 2023
Reaching FastAPI deployment from outside cluster Ray Serve	5	2148	August 4, 2022
Serve host on VM - needs restart to work Ray Serve	0	208	February 15, 2024
Ray + VLLM - Need support on Proxy	5	167	September 10, 2024

Ray Serve HTTP requests handling

Related topics