Does ray support multi GPU inference with TensorRT?

jinfagang · January 5, 2022, 10:53am

simon-mo · January 5, 2022, 6:31pm

Hi @jinfagang, yes! You can just use TensorRT Python API in Ray Serve like local script because Ray Serve support arbitrary Python code. In order to use multiple GPU make sure to allocate GPUs via @serve.deployment(ray_actor_options={"GPU": 2}) syntax.

Topic		Replies	Views
Is it possible to run inference on local GPU as well as rollout CPU workers?	1	261	November 2, 2023
Spread accross several fractional GPUs or 1< num_gpus < 2 Ray Core	1	343	February 13, 2024
Ray multiprocessing with multi pytorch model inference Ray Core	1	553	October 18, 2023
Ray Actor not utilising GPU Ray Core	7	274	November 6, 2024
How to test if tensorflow can see GPU on worker node using Ray? Ray Core	2	342	January 19, 2021

Does ray support multi GPU inference with TensorRT?

Related topics