Restart of raylet


When a raylet crashes, is it restarted by the current ray system so that workers can continue executing the computation possibly leading to better cluster utilization and improvement in job finish time?


Hi @asm582, the raylet will be restarted if you are using ray cluster launcher Ray Cluster Overview — Ray v1.4.1 which will monitor the health of the ray cluster.

Hi, @simon-mo thanks, does this mean that computation continues to execute on the same node by restarting the raylet process only without re-running the entire computation on the different node?

In ray1.4 I think I do not see raylet restarted once I kill a raylet manually, can you please confirm?