Question about Ray for HPC

baran · March 19, 2021, 11:13am

Is Ray developed for doing shared memory computing or can I use it for distributed memory computing ? (there will be multiple nodes)

If you have any experience with MPI:
Can it be used as an alternative for Message Passing Interface (MPI)? If I develop my program with Ray, would its speed be significantly slower than an equivalent MPI program? Or will it be similar? Is Ray a suitable tool for high performance computing?

Alex · March 20, 2021, 11:14pm

Is Ray developed for doing shared memory computing or can I use it for distributed memory computing ? (there will be multiple nodes)

Both. Ray’s object store is distributed memory, but we implement the shared memory optimization whenever possible. If you call ray.get on a large tensor, it will get transferred once per node, then multiple workers on the same node will access the same object in shared memory. See https://docs.ray.io/en/master/serialization.html#numpy-arrays for more details.

Can it be used as an alternative for Message Passing Interface (MPI)?

Ray is meant to provide a simpler API than MPI, but given a well designed program, you should be able to achieve similar performance for many workloads.

baran · March 21, 2021, 11:17am

Thank you for the reply.

Topic		Replies	Views
Does Ray support multi node message passing like MPI? If so does it support HPC schedulers like Slurm or PBS? Ray Core	0	361	August 23, 2021
Multiple Ray instances on one node accessing shared memory Ray Core	2	973	November 30, 2022
Question on scalability Ray Core	1	291	August 3, 2021
Ray.util.collective uses for what circumstance? Ray Core	4	316	May 27, 2022
Working with a large dataset Ray Core	2	1086	December 16, 2021

Question about Ray for HPC

Related topics