Does Ray Serve support local model hot update/reload?

Japson · July 5, 2022, 2:53am

That is, I can smoothly update the deployed models to the latest version without having to stop the online service. And supports rollback operations

Just like Tensorflow Serving

Sihan_Wang · July 5, 2022, 3:35am

Hi Japson,

Currently ray serve doesn’t have the functionality to support the model load/rollback/version control.

(To unblock you) For new version model update, can you directly trigger another deployment (new model) and shit traffic and remove the previous deployment (old model)?

shrekris · July 5, 2022, 6:48pm

Hi @Japson, as @Sihan_Wang mentioned, you can deploy your updated model to new Serve deployments on a fresh Ray cluster, and then you can shift traffic to it once the deployments are running.

Additionally, Serve does a rolling update when you update your live deployments on existing Ray cluster. In this case

Serve tears down some replicas from your old deployment.
Serve replaces them with replicas from your new deployment.
Serve repeats this with some more replicas from your old deployment until all replicas have updated.

During this time, your service is still live, but some requests may be handled by outdated replicas. This might also be a viable option.

Topic		Replies	Views
Best practice for loading deep learning models in production on Ray serve Ray Serve	4	833	October 27, 2022
How to preserve state of ray serve on ray cluster restart? Ray Serve	0	442	May 4, 2021
Automating the serving of many different models Ray Serve	8	1683	May 3, 2023
Dynamically serve new model via Ray Serve Ray Serve	5	77	June 11, 2025
Continuous Delivery with Ray Serve Ray Serve	3	1047	May 10, 2021

Does Ray Serve support local model hot update/reload?

Related topics