Ray cluster uses only Head node

kpavel · June 23, 2021, 6:21pm

Hello, my cluster spawned successfully, but for some reason when running tasks it is utilizing head node only.

Didn’t notice any relevant errors in logs.

Head node been spawned with:
ulimit -n 65536; ray start --head --port=6379 --object-manager-port=8076 --autoscaling-config=~/ray_bootstrap_config.yaml --dashboard-host=$RAY_HEAD_IP
And workers with:
ulimit -n 65536; ray start --address=$RAY_HEAD_IP:6379 --object-manager-port=8076

Please point me what do I miss. Thanks.

architkulkarni · June 25, 2021, 6:22pm

Hi @kpavel, can you provide some more details about what tasks you’re running? If you’re running more than 2 tasks in parallel, i would expect the tasks beyond the 2nd task to be scheduled on the other two nodes. For example,

import ray
import time
ray.init(address="auto")
@ray.remote
def f(i):
    time.sleep(30)
    return i
futures = [f.remote(i) for i in range(6)]
print(ray.get(futures))

If you run this on the head node, do you see the tasks show up in the dashboard?

kpavel · June 27, 2021, 1:26pm

Hello @architkulkarni. Thanks, we can close the issue.
It appears that when you use ray.init() without address=“auto” it schedules tasks to head node only.
Didn’t see it documented anywhere.

architkulkarni · June 28, 2021, 5:44pm

Ah yeah, that’s correct: Starting Ray — Ray v2.0.0.dev0. We definitely want to make our documentation crystal-clear, so any edits or suggestions would be very welcome!

Topic		Replies	Views
Local cluster with multiple nodes in YAML config, while there's only head being started... Any hints? Ray Clusters	11	1625	June 17, 2022
Ray workers can't ssh to head node Ray Core	5	746	June 14, 2022
Some Issues When I Start My Ray Cluster in centos 7 Ray Clusters	4	599	January 28, 2022
Ray cluster one worker plasma is N/A Ray Clusters	9	443	June 14, 2022
Problems lauching gcp cluster Ray Core	4	721	July 7, 2022

Ray cluster uses only Head node

Related topics