Difference between remote calls and multiprocessing in Ray

gstepan · January 29, 2021, 6:56pm

Just had some quick questions about the difference between running ray through separate processes with remote functions and running it through the multiprocessing pool function. Why does multiprocessing pool not use remote functions? Also, I was wondering why multiprocessing pool only allows me to run as many processes as the number of total cpus across allocated cluster nodes while I can spin up hundreds of processes in a for loop with calls to a remote function?

Alex · January 29, 2021, 7:43pm

They’re pretty much the same! The Ray’s multiprocessing API is really just a thin wrapper around remote functions/actors.

The main reason we have the multiprocessing API is just to make it super easy to move your multiprocessing code over.

If you’re not super tied to the multiprocessing API, you could also consider using a ray utility like the actor pool instead: Using Actors — Ray v2.0.0.dev0

Topic		Replies	Views
maximize the parallelization efficiency using Python ray ActorPool?	4	679	November 15, 2022
Understanding differences in performance for Ray.remote vs Ray Serve Ray Serve	2	1344	April 13, 2023
Question about logging.info in ray actor Monitoring & Debugging	1	688	May 12, 2022
Running a list of functions with limited parallelism and autoscaling Ray Core	2	350	February 8, 2022
Difference in logging between Ray Actors and processes Ray Core	1	672	November 28, 2022

Difference between remote calls and multiprocessing in Ray

Related topics