Installing TensorRT LLM on Ray Docker Image as Custom Docker

rifkybujana · January 3, 2024, 7:10am

Hi, I’m trying to use TensorRT LLM (GitHub - NVIDIA/TensorRT-LLM: TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.) to deploy my model using Ray. However, I’ve always been stuck on the installation part of the TensorRT, as it always produces errors related to mpi4py. While trying to compile the tensorrt-llm from source always resulted in errors on CUDA, such as libcud*.dll not found. Can anyone who has tried TensorRT LLM on ray help me out? Thanks.

leon · March 6, 2024, 9:32pm

I am running into similar issue. @rifkybujana were you able to get around that?

rifkybujana · March 7, 2024, 12:20am

Nope, instead, I just use the officially released tensorrt support for Ray-llm.

Topic		Replies	Views
Tensor parallelism with torch run inside ray Ray Core	0	114	April 29, 2024
Several questions about DL training (e.g. alexnet with pytorch) Ray Core	2	323	July 12, 2021
Ray tune with environment using GPU RLlib	2	841	February 8, 2021
Serving LLM with multiple gpus Ray Serve	0	274	July 3, 2024
How to test if tensorflow can see GPU on worker node using Ray? Ray Core	2	340	January 19, 2021