When I was converting from ray 1.11 to 2.0, I was still able to deploy multiple models like this:
Model1.options(init_args=model1_args, num_replicas=1, name="model1").deploy()
Model2.options( init_args=model2_args, num_replicas=num_reps, name="model2", ray_actor_options={"num_gpus": 1} ).deploy()
That being said, I was making direct calls on the model and not via the url served endpoints.
I did eventually convert over to use the DAG just to see how to works