Zero-copy deserialization with recursive dictionaries/lists

Drew_Barclay · April 25, 2021, 10:56am

Hi there,

I’m curious on the limits of Ray’s zero-copy deserializability. My current understanding is that if a function returns a (large, over 100KB by default) numpy array, it will be put in the Plasma object store, so that any other executing tasks can use it without copying any memory.

Does this guarantee apply to Numpy arrays stored in dictionaries? What about dictionaries of dictionaries?

As per the docs at Serialization — Ray v2.0.0.dev0, You can often avoid serialization issues by using only native types (e.g., numpy arrays or lists/dicts of numpy arrays and other primitive types), or by using Actors hold objects that cannot be serialized. But it’s not clear to me whether it handles recursive lists/dictionaries.

Would zero-copy deserialization work with the following? Or, how can I tell whether zero-copy deserialization occurred so I can test it myself?

a)

def a(x):
    return {'a': large_numpy_array(x)}

b)

def b(x):
    return {'a': {'b': large_numpy_array(x)}}

c)

def c(x):
    return {'a': [{'b': large_numpy_array(x)}], 'c': large_numpy_array(x)}

ericl · August 3, 2021, 8:58pm

Yep, Ray supports all of these cases with zero-copy. There’s an easy way to tell:

x_id = ray.put({“x”: {“y”: np.zeros(5)}})
x = ray.get(x_id)
x[“x”][“y”].flags
C_CONTIGUOUS : True
F_CONTIGUOUS : True
OWNDATA : False
WRITEABLE : False
ALIGNED : True
WRITEBACKIFCOPY : False
UPDATEIFCOPY : False

WRITABLE = False, which is sure sign that we’re using zero-copy memory from plasma.

Topic		Replies	Views
On `task:deserealization` and effective usage of Plasma Store Ray Core	1	312	November 23, 2021
How to check whether some data can be zero-copy deserialized? Ray Data	1	688	February 1, 2022
Is a pyarrow.array guaranteed to be zero-copy deserialized Ray Core	1	335	August 3, 2023
Zero-copy deserialization for np.recarray Ray Core	1	381	August 31, 2022
@ray.remote function seemingly copying data from plasma store Ray Core	10	1081	March 27, 2021

Zero-copy deserialization with recursive dictionaries/lists

Related topics