Continuous Delivery with Ray Serve

Abaschkim · April 21, 2021, 3:41pm

Hi everyone. I have setup a ray cluster on a remote microk8s cluster, which also hosts a MLFlow model registry to store and track my models. To deploy my model to the cluster, I use ray serve to fetch my model from MLFlow (as here) and then and submit it to the cluster via ray submit
Now I would like to apply some basic continuous delivery. Ideally, a new model would be fetched from MLFlow and submitted to the cluster upon a commit to a master branch on GitHub.
Has anybody done something similar or could suggest some tools to facilitate this workflow?
Thanks!

architkulkarni · April 23, 2021, 4:58pm

Hi @Abaschkim, great question! The “best practice” here would probably be to have CI directly connect to the cluster via Ray Client Ray Client — Ray v2.0.0.dev0 and deploy the new endpoint. This is possible in Ray 1.3 which just got released! Other good options include running ray submit from CI, or making a Ray Serve endpoint that would trigger the operation of fetching/submitting, and then setting up a git hook to fire it off.

Abaschkim · April 24, 2021, 9:38am

Thanks for the reply @architkulkarni. Great news! I will look into that. I am currently using a Kubernetes job as mentioned in the docs with Jenkins to deploy a ray serve instance.

frascuchon · May 10, 2021, 10:24pm

Hi @Abaschkim mlflow deployment plugin for ray serve is a good starting point.

While the official plugin version doesn’t support remote deployment, there is an open PR that includes this behaviour.

Topic		Replies	Views
Deploying ML Model using Ray serve on K8s Ray Serve	1	1191	August 8, 2022
Ray serve on K8s Ray Serve	1	627	April 5, 2021
Ray Serve for production usecase Ray Serve	3	730	May 25, 2022
Ray serve on Kubernetes Ray Serve	14	944	March 27, 2024
Dynamically serve new model via Ray Serve Ray Serve	5	90	June 11, 2025

Continuous Delivery with Ray Serve

Related topics