How to run multiple deployments in ray serve 2.0

puntime_error · October 4, 2022, 4:15pm

When I was converting from ray 1.11 to 2.0, I was still able to deploy multiple models like this:

Model1.options(init_args=model1_args, num_replicas=1, name="model1").deploy()

Model2.options( init_args=model2_args, num_replicas=num_reps, name="model2", ray_actor_options={"num_gpus": 1} ).deploy()

That being said, I was making direct calls on the model and not via the url served endpoints.

I did eventually convert over to use the DAG just to see how to works

Topic		Replies	Views
Multiple Serve instances on a ray cluster with serve REST API Ray Serve	3	639	November 30, 2022
Deploy, delete and use deployments in Ray Serve 2.0.0 Ray Serve	8	2018	December 13, 2022
Automating the serving of many different models Ray Serve	8	1700	May 3, 2023
Production best practices for Ray Serve Ray Serve	6	1182	August 15, 2023
Ray Serve FastAPI Recommended Approach Ray Serve	1	1287	August 10, 2021