GIl and race conditions with Ray multiprocessing

Wall-E · April 26, 2023, 3:45pm

I am parallelizing functions that read the same 3D numpy array (a series of images) using Ray multiprocessing Pool().

I share the 3D array images with ray.put() as follows:

images_ref = ray.put(images)
myfunc_partial = functools.partial(myfunc, obj_ref=images_ref, arg2=blah2)

with Pool(processes=ncpus) as pool:
   pool.map(myfunc_partial, params_list)  #params_list has at least 10 elements

Insise myfunc_partial, images is processed after calling images = ray.get(obj_ref)

First of all, is that the proper way to share my 3D Numpy array across the 10 processes so that there are no copies of the array?

If so, is the GIL stepping in by preventing the mutliple processes to read the shared-memory array at the same time?

sangcho · April 27, 2023, 3:35pm

GIL is only applied within the same process. Ray’s Shared memory objects are read-only, so it doesn’t require any explicit lock. So all the processes can concurrently access the buffer without any synchornization

Topic		Replies	Views
Ray Serve Parallelism Python GIL vs Java Ray Serve	2	1067	August 5, 2021
Loop multiprocessing and shared memory using ray Ray Core	1	632	March 10, 2021
Multiprocess safe Shared object Ray Core	5	458	September 29, 2021
Question about multiprocessing large array using ray.remote Ray Core	4	975	May 10, 2022
Shared numpy array Ray Core	1	1249	July 19, 2022

GIl and race conditions with Ray multiprocessing

Related topics