Parallel Detectron2(Pytorch) inference with GPU

james811223 · May 25, 2022, 6:15pm

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

Problem: I’m unable to parallelize a function.

What the function does:

Some stuff
Load image from AWS S3.
Preprocess image
Make inference with a detectron2 model. (GPU)
Apply rules to the output of the model inference.
Return data

Does anyone know how to parallelize this function?

Mingwei · May 25, 2022, 6:44pm

Do you want to have parallelized execution of many instances of this function over many images, or parallelize certain steps within this function?

Mingwei · May 25, 2022, 10:16pm

Assume the function takes an S3 URL, and returns data. You probably can parallelize running multiple functions on a list of S3 URLs (s3_url_list):

@ray.remote
def process_image(s3_url):
    ......

results = [process_image.remote(url) for url in s3_url_list]

james811223 · May 26, 2022, 12:57pm

I need to parallelized the whole function. I tried many different ways. The processes just don’t recognize the GPUs.

james811223 · May 26, 2022, 1:10pm

Here’s my code:

Imports .....

def fun_a_for_fun_to_parallel(...):
    ...

def fun_b_for_fun_to_parallel(...):
    ...

def fun_c_for_fun_to_parallel(...):
    ...

@ray.remote
def fun_to_parallel(...):
    stuff...
    function call with model inference...
    stuf...
    return ...

One of the other scripts being imported into main script above:

from detectron2.config import get_cfg
from detectron2.engine import DefaultPredictor
more imports

cfg = get_cfg()
more configs
model = DefaultPredictor(cfg)

def fun(...):
    stuff
    outputs = model(im)
    stuff
    return ...

I’ve also tried replicating the model and assigned them to different GPUs(cuda:0, 1, 2…), which didn’t work.

I tried setting ngpus to like 2, 3, 4… for ray init.

Mingwei · May 26, 2022, 6:31pm

Have you tried setting num_gpus=1 when converting a function to Ray remote function? e.g.

@ray.remote(num_gpus=1)
def fun_to_parallel(...):
    ...

More documentations are at GPU Support — Ray 1.12.1

Mingwei · May 26, 2022, 9:46pm

Another way to convert a function to Ray remote function, without @ray.remote decorator, is by calling ray.remote(func).options(num_cpus=xx).remote(args...)

james811223 · June 8, 2022, 3:00pm

I tried using Ray Serve, and it’s now working. Thanks @Mingwei for your help!

Sujit_Kumar · August 16, 2022, 1:11pm

@Mingwei @james811223
I am unable to use multi gpu while doing inference. I have raised an issue at Issue on page /serve/getting_started.html · Issue #27905 · ray-project/ray · GitHub
can you please help?

Topic		Replies	Views
Using ray with transformers pipeline for inference Ray Core	0	514	August 19, 2021
Run Python function in parallel on GPU Ray Core	10	4761	January 28, 2022
Ray multiprocessing with multi pytorch model inference Ray Core	1	599	October 18, 2023
Does ray support multi GPU inference with TensorRT? Ray Serve	1	1286	January 5, 2022
Use Ray to parallelize tasks	3	436	February 22, 2021

Parallel Detectron2(Pytorch) inference with GPU

Related topics