[Ray Serve] using GRPC and DAG to host multiple models(or actors) in the same deployment

nihal · January 28, 2023, 9:15am

Hello Ray team, I was looking at the GRPC ingress (here) and I understand that it is still in alpha and I was just wondering if it is possible to use that with deployment graph (like this)?
If so, I am assuming we would be able to autoscale the different models independently?
If not, is this something that is in the roadmap for the future?

Sihan_Wang · January 30, 2023, 8:18pm

Hi @nihal , there shouldn’t have blockers to be used as deployment graph with current API, you can still follow the same idea as the doc suggested. Do you give a try? Please let me know if you have issues about it.

nihal · February 2, 2023, 2:11pm

Hi @Sihan_Wang
Yes, I had a misunderstanding regarding the deployment class in case of GRPC ingress. So now I was able to get it working, but I am not able to figure out what to give in import_path in the serve config file to deploy in a ray cluster using KubeRay. Could you please help?

Sihan_Wang · February 2, 2023, 5:57pm

Hi @nihal , glad to hear it worked from your side!

For import path, that is the package path for your deployment. Basically you should be able to get the import path by using serve build cli. Serve Config Files (serve build) — Ray 2.2.0

Topic		Replies	Views
Making model accesible across the nodes on Ray Serve - how Ray Serve	3	398	November 10, 2023
Production best practices for Ray Serve Ray Serve	6	1164	August 15, 2023
How to run multiple deployments in ray serve 2.0 Ray Serve	10	2406	December 13, 2022
Model composition with serve deployments, why?	0	303	April 1, 2023
[Serve] New API not as good as old one for programmatic deployment	0	311	October 5, 2022

[Ray Serve] using GRPC and DAG to host multiple models(or actors) in the same deployment

Related topics