[Core] Local mode ray in many pods

delioda79 · March 11, 2021, 3:58pm

Hi,
I need to run some application where I need multi processing, essentially I have either some deep learning training, either just some inference, either some text processing, which run at the moment in some web services. These services run in the same kubernetes cluster. Now I need to expose some http endpoints per each one of these services, but the endpoints need not to be blocked by a long running processing (up to 30 minutes). I though so to run the long run processing on a different process than the one the http server runs. I had several possibilities but we are experimenting with having a fastAPI app, served by gunicorn, and the app will also init ray in local mode and will run the long process in a remote agent. IN future we will need also to optimize the process and we could use multiple agents, but for now we just need one agent and one fastAPI app. This seems to work very well, but my concern is that we have several services, running each one on a pod in the same k8s cluster. Running 10 services each with 3 instances means to have 30 pods each one with a local mode ray. Is this gonna cause any trouble? The alternative would be to run two processes with the standard python API, but we would loose lot of benefits for parallel processing. Do you think we are gonna have probelms with this setup?

sangcho · June 16, 2021, 11:49pm

cc @Dmitri Can you follow up with him?

Dmitri · June 17, 2021, 3:50am

There’s no problem with using single-node Ray for multiprocessing in a K8s pod.

Topic		Replies	Views
Running 10+ models on a ray cluster Kubernetes	1	576	February 27, 2022
Serving Ray on Kubernetes from Another App	5	626	August 4, 2021
Ray on k8s, how to properly config head node Ray Clusters	4	902	June 24, 2022
Some questions about Ray on Kubernetes Ray Clusters	3	771	December 3, 2021
Usage of ray on edge devices	0	194	February 22, 2024

[Core] Local mode ray in many pods

Related topics