Ray Serve LLM application

CansuCandan · August 21, 2023, 1:22pm

Hi,

I want to apply ray.serve for my llm model.

It could be basic problem. But I stuck with this problem.

my app2.py:

import requests
from ray import serve
import starlette

@serve.deployment(route_prefix="/forecast")
class Ray_llm:
    async def __call__(self, request: starlette.requests.Request):
        if "file" not in requests.files:
            return {"error": "No file part in the request"}, 400

        file = requests.files["file"]

        if file.filename == "":
            return {"error": "No selected file"}, 400

        query_text = requests.form.get("query", None)
        if not query_text:
            return {"error": "No query text provided"}, 400

        if file and query_text:

            response = send_to_llm(file, query_text)
            return response

def send_to_llm(file, querry_text):

    response = llm_caller(file, querry_text)  # Send to model

    return response

app = Ray_llm.bind()
serve.run(app, port=8081)

Dockerfile:

FROM python:3.8-slim

WORKDIR /app

COPY requirements.txt .

RUN pip install --no-cache-dir -r requirements.txt
RUN pip install "ray[serve]"

EXPOSE 8081

COPY . .

CMD ["python", "app2.py"]

BuildAndRun.sh :

docker build -t llm_api .

docker run -p 8081:5000 llm_api

when I run BuildAndRun.sh my service does not alive to post request. How can I solve this problem?

ss:

Topic		Replies	Views
LLM Ray Serve Problem	0	345	August 19, 2023
Ray Serve LLM example in document cannot work Ray Serve LLM APIs	6	200	April 3, 2025
How to correctly build a Ray Serve server in Docker with a generic Ubuntu image (x86_64 in an amd system)? Ray Serve	4	232	April 24, 2025
ModuleNotFoundError: No module named 'ray.serve.llm' Ray Serve	1	96	March 20, 2025
Ray Serve Latest version vLLM example requires code modification to work Ray Serve	7	719	March 17, 2025

Ray Serve LLM application

Related topics