Integrating GradioIngress and non-gradio endpoints

tjedwards · August 9, 2023, 2:44pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

I’m new to Ray, but so far so good! Using the Ray Server doc examples I’ve built a “Driver” ingress deployment that dispatches (by URL) between two other deployments, one for “/embeddings” and one for “/inference”. It works well!

Driver.__init__() captures the deployments, and Driver.__call__() accepts a Request and dispatches between them based on request.url.path, e.g.:

        if request.url.path.startswith('/embeddings'):
            ref = await self.embedding.classify.remote(req)
        ...
        return await ref  #produces a Response

I’d now like to serve a Gradio page under the “/chat” url. The standalone deployment works like a charm:

def gradio_builder():
    def respond(message, user_bot_history):
        return random.choice(["Yes", "No"])
    iface = gr.ChatInterface(respond)
    #without this there is a constant polling loop of some kind
    iface.config["dev_mode"] = False
    return iface

@serve.deployment(
    #route_prefix="/chat",
    ray_actor_options={"num_cpus": 0},
    autoscaling_config={"min_replicas": 1, "max_replicas": 1},
)
class MyGradioServer(GradioIngress):
    def __init__(self):
        super().__init__(gradio_builder)

deploy = MyGradioServer.bind()

Now, finally, my question; How do I combine this deployment with my Driver ingress deployment such that it supports “/embeddings”, “/inference”, and “/chat” from the same server?

yic · August 17, 2023, 7:54pm

@shrekris could you take a look at this?

sangcho · August 21, 2023, 1:29pm

I moved the questions to the serve channel.

Topic		Replies	Views
Move request (wrong category)	0	286	August 9, 2023
[Ray Serve] using GRPC and DAG to host multiple models(or actors) in the same deployment Ray Libraries (Data, Train, Tune, Serve)	3	407	February 2, 2023
Ray serve with dynamic deployments Ray Libraries (Data, Train, Tune, Serve)	0	545	September 23, 2022
How to run multiple deployments in ray serve 2.0 Ray Serve	10	2291	December 13, 2022
Help designing fire and forget server for large batch inference Ray Serve	7	696	November 30, 2023

Integrating GradioIngress and non-gradio endpoints

Related topics