How severe does this issue affect your experience of using Ray? High: It blocks me to complete my task. I slurm ran ray start --head, and the slurm gave me a node and showed the ray has started. As the picture blow shows: [image] but then I ran ray status, it showed me blow: [image] I wrot…

Thank you cade for helping me. [image] cade: Does the node on which you are running the slurm script (ray start --head and ray status) have multiple network interfaces? Yes, the node does has multiple network interfaces. [image] cade: To verify that this is the case, can you reproduce t…

“ray start --head” succeed but "ray status" could not find any running ray instance

Ray Clusters

cade June 15, 2022, 9:34pm 6

or maybe Raylet errors some worker have not registered within the timeout ?

Topic		Replies	Views
Ray on Slurm: shutdown throws errors Ray Clusters	15	921	June 16, 2022
No GPUs available when using slurm-template.sh to launch SLURM Ray cluster Ray Core	0	138	June 8, 2024
Ray on SLURM/HPC: starting worker nodes simultaneously Ray Clusters	10	2002	June 15, 2022
Serve CLI not working into ray==2.3.0 Ray Serve	3	651	May 2, 2023
Running ray on supercomputer with slurm Ray Core	4	704	August 4, 2021

“ray start --head” succeed but "ray status" could not find any running ray instance

Related topics