How to keep the data in GPU memory after remote call?

zjacob · March 17, 2021, 5:49am

I have a large data array on GPU memory, and I would like to replace one row of this data while keeping all the other rows unchanged at each call of the remote function. The GPU kernel is a Ray remote function. However, I find that after calling the remote function, the data array will be set zero on GPU memory, and the previously replaced rows will be all zero.

import ray
from numba import cuda
import numpy as np

ray.init(ignore_reinit_error=True)

copy newly calculated ‘new’ to ‘temp’ (on GPU memory) in row ii

@cuda.jit
def copy_(length, new, temp, ii):

i = cuda.grid(1)

# loop through each spatial grid
if i < length:
    temp[ii,i] = new[i]

@ray.remote(num_gpus=1)
def solver(length, new, ii):

temp = cuda.device_array([5,length])
new = cuda.to_device(new)

# Configure the blocks
threadsperblock = 32

# configure the grids
blockspergrid = (length + (threadsperblock - 1)) // threadsperblock

copy_[blockspergrid, threadsperblock](length, new, temp, ii)

return temp.copy_to_host()

length = 10

new = np.arange(length)

specify the index to replace, if next time ii = 1, the copied ii=0 row will be zero

ii = 0
re = solver.remote(length, new, ii)
ray.get(re)

The above code will reset the data array. I want the array ‘temp’ to stay on the GPU memory.

If instead, I feed the data array ‘temp’ as an input in remote function, the data array cannot be pickled.

Is there a way such that this can be done?

Dmitri · March 18, 2021, 2:51am

cc @Clark_Zinzow @ericl

Topic		Replies	Views
GPU memory management Ray Core	4	491	November 10, 2021
Ray.get() on Torch CUDA tensors Ray Core	7	1073	August 11, 2022
@ray.remote function seemingly copying data from plasma store Ray Core	10	1076	March 27, 2021
Ray consumes all my RAM Ray Core	5	526	October 18, 2021
How to wait for GPU memory to be released when using TensorFlow in a ray remote function Ray Core	1	200	January 25, 2024

How to keep the data in GPU memory after remote call?

copy newly calculated ‘new’ to ‘temp’ (on GPU memory) in row ii

specify the index to replace, if next time ii = 1, the copied ii=0 row will be zero

Related topics