Is the data transfer through Ray client as efficient as inside a Ray cluster?

Hanyu · August 13, 2022, 2:31am

It seems that data transfer through the Ray client is via a gRPC call, but that inside a Ray cluster will use multiple gRPC connections to maximize throughput (I saw this from here).
Additionally, is it possible that the Ray client introduces extra memory copies / serializations compared to inside a Ray cluster? I know that the Plasma Object Store implements zero-copy for numpy arrays.
So I’m wondering if there is difference between the efficiencies of the two. Do you have any analysis or benchmarks on this?
Thanks!

Chen_Shen · August 15, 2022, 4:06pm

hi @Hanyu

Exactly as you mentioned; the Ray client is not as efficient as Ray object transfer, and it does incur extra copy/serializations.

This is due to the fact the Ray client is designed to submit a script or do interactive development to a remote cluster, where the network quality is unpredictable, and performance is not a high priority requirement.

If you care about the performance and overhead, I’d suggest you not using Ray client, and consider either connecting Ray on the head node directly or consider Ray’s job submission Ray Job Submission — Ray 1.13.0.

Hanyu · August 16, 2022, 3:15am

I see. Thank you so much!

Topic		Replies	Views
What use for data transferring? Ray Core	9	1041	December 2, 2022
How is large data copied between two nodes? Ray Core	1	569	November 30, 2021
Plasma usage across Nodes Ray Serve	2	733	March 8, 2022
Options other than using ray client Ray Client	1	59	July 23, 2024
Fetching an object from remote memory Ray Core	0	295	June 1, 2021

Is the data transfer through Ray client as efficient as inside a Ray cluster?

Related topics