Trying to run a cluster at home

Juffin_Hally · June 30, 2021, 10:40am

I have the following setup: behind a NAT, I have two machines, let’s call them the big machine and the small machine, both running ubuntu, and I can access both with the same username and each can ssh into the other. Both also have ray installed.
On the large machine, I run the coordinator_server.py script as recommended in the docs and using the ip of the small machine as the --ips arg.

I then create a config based on the minimal_automatic one, with the coordinator host and port from the previous step.

Next, i run ray up -y my_automatic_config.yaml, which eventually says that ray runtime is started.

Once all that is done, I run the testing script (from /cluster/quickstart.html in the docs) to check that all the nodes are available.
But it only shows the cluster consisting of one node, the large machine.

What am I doing wrong? Is there something I’ve missed in the docs, the tutorials, setting up the cluster, or anything else?

Topic		Replies	Views
How do you connect Ray Client to a cluster managed by a coordinator server? Ray Clusters	0	330	July 30, 2021
Worker nodes not available with manual configuration Ray Core	5	462	May 5, 2021
Ray cluster doesn't work, even connected well Ray Core	1	389	May 31, 2022
Ray cluster uses only Head node Ray Clusters	3	446	June 28, 2021
How to start a cluster with ray up and local provider?	1	405	December 13, 2023

Trying to run a cluster at home

Related topics