The easiest way to copy file (Tune results) from head node?

HuangLED · November 24, 2021, 10:39pm

Hey, folks,

After AutoTune is done, result log files are kept on head node, we are planning to copy all the files to where the ray-client is, then kick off tensorboard to inspect the result.

Is there such a utility function already? Or we will need to implement an actor to do the copying?

Thanks.

amogkam · November 29, 2021, 9:46pm

Hey @HuangLED there is an undocumented ray rsync-down CLI command you can use (just type ray rsync-down --help for more info on how to use it).

You can also achieve the same thing via manual scp. I don’t think using an actor would work since the actor would be scheduled on the head node and not on the client side.

HuangLED · November 29, 2021, 10:06pm

Thanks Amog!

Is there already a recommended API way similar to what rsync-down achieves? The reason is I am integrating this step into piece of code implementation.

If not, guess one may always use os.system().

amogkam · November 29, 2021, 10:18pm

You can do this programmatically using the ray autoscaler sdk on the client side (ray.autoscaler.sdk.rsync(...)) .

HuangLED · November 30, 2021, 3:51am

Have a follow-up question.

This sdk.rsync() api requires a cluster_config, do I need to construct it by myself? Searched in the discussion group a bit but couldn’t find an answer. Or there is any handy mechanism that we can just retrieve such config (since at this point we are already connected to the cluster).

amogkam · November 30, 2021, 4:03am

This method should be called on the laptop, not on the cluster itself.

The cluster_config should just be the path to the yaml file that you used for ray up

HuangLED · November 30, 2021, 6:42pm

ok. Thanks amog.

what if I used command line to start the cluster? Is the config still being yaml file in this case?

amogkam · November 30, 2021, 7:06pm

By command line do you mean ray up? In that case, yes it’s just the path to the yaml file you used for ray up.

HuangLED · November 30, 2021, 7:11pm

by saying command line, I used “ray up --head” on the head, then use the corresponding cmd on the non-head machines.

During the process, I am not explicitly pointing to any yaml file. (or at least the whole process is agnostic to me; I am not sure which one is used)

HuangLED · December 1, 2021, 12:03am

@amogkam

I’ve read the section about yaml file here: Config YAML and CLI Reference — Ray v1.8.0

Though our particular use case here now is not using any cloud-native solution at all. We just manually started a cluster on top of raw machines and keep using this cluster. In this case, does yaml file still apply?

HuangLED · December 1, 2021, 12:09am

digged into the doc a bit more, and this template is for local mode and fit my use case? ray/example-full.yaml at master · ray-project/ray · GitHub

amogkam · December 1, 2021, 12:19am

@HuangLED ohhh got it. Ok if you are manually starting the ray cluster and not using the ray cluster launcher, then the autoscaler sdk won’t be useful here.

I would just use the subprocess module for example to execute an rsync command, and add this to your python script. This is what ray.autoscaler.sdk.rsync is doing underneath the hood anyways.

Topic		Replies	Views
How does one copy folders from workers to headnode? Ray Clusters	1	650	January 7, 2022
How to copy (non-pip) dependencies to cluster nodes	5	1698	March 26, 2021
Syncing files to and from Ray Cluster Kubernetes	3	1008	June 29, 2021
How to run the script distributedly? Ray Clusters	4	571	May 9, 2021
Can i export the cluster_config_file from a cluster that setup by manual Ray Clusters	0	387	September 29, 2021

The easiest way to copy file (Tune results) from head node?

Related topics