Serving model via Ray Serve vs FastAPI on ECS

Wilfredc · August 12, 2024, 5:31pm

We have been evaluating Ray Serve and FastAPI for serving models in AWS ECS. The performance for running a single app in a single ECS node, Ray Serve lags behind FastAPI a lots (in terms of requests per second and response time).

In which use cases, Ray Serve outperforms FastAPI when serving a single or 2 models in a single application?

Thank. you

Topic		Replies	Views
Ray Serve with vs without FastAPI Ray Serve	3	1672	March 4, 2021
Ray Serve with FastAPI slowing down performance Ray Serve	1	487	July 19, 2023
FastAPI vs Ray FastAPI performance	3	146	July 25, 2024
How to Scale Up Your FastAPI Application Using Ray Serve	0	749	December 8, 2020
Ray with FastAPI Ray Core	1	794	December 24, 2023

Serving model via Ray Serve vs FastAPI on ECS

Related topics