I’m really enjoying reading “Optimizing Prediction Serving on Low-Latency Serverless Dataflow” by Vikram Sreekanti et al. Paper:https://arxiv.org/pdf/2007.05832.pdf.
The Dataflow API (map, filter, aggregate over a Table datastructure) mentioned in the paper looks very interesting.
The more I read about the optimizations(fusing, co-location), the more it reminds me of the Ray Serve Deployment Graph API.
- What is Ray Serve team’s opinion about such a dataflow API over the Deployment Graph API?
- I’m wondering if the Ray Serve team is looking into something like this and if it’s in the roadmap somewhere.
- Given the academic background of a lot of the Ray Serve team, are there other papers or ideas that the team is looking at instead. I’m just curious about what you are reading