How severe does this issue affect your experience of using Ray?
- High: It blocks me to complete my task.
Hello,
I am evaluating using Ray Serve for the company I work for. I am running a scenario where we run a Ray Serve application with 3 replicas (using kuberay RayService) and we kill the head node.
With this scenario, the expectation I had was that the Ray Serve application would still be served by the worker node and the head node would recover.
However, actually what we saw was that the Ray Serve application did not work anymore. The head node came back, but it didn’t have the Ray Serve application on it. Is that expected?