1. Severity of the issue: (select one)
None: I’m just curious or want clarification.
2. Environment:
Ray version: 2.48.0
Python version: python3.9
OS:
3. What happened vs. what you expected:
I have several NPU cards and hope to use the Compiled Graph feature on VLLM to replicate PP parallelism. I would like to know if the community currently has plans to implement NPU as a backend for aDAG.
As of now, there is no official support for NPU as a backend in Ray Compiled Graphs (aDAG/Compiled Graph) in the mainline Ray documentation or API references. The official docs and API guides do not mention NPU support—only GPU (CUDA/NCCL) and CPU transports are described, and the current transport options are “auto”, “cpu”, “accelerator”, “nccl”, and “shm” for tensor transport, with “accelerator” referring to GPU/XLA or vendor-provided backends, but not specifically NPU. The PRs and issues you referenced (e.g., #51032, #51574, #55428, #55381) indicate ongoing community and contributor interest, but these features are not yet documented as available or stable in the official Ray releases or user guides.
If you are interested in NPU support, you may want to follow those PRs/issues for updates or contribute to the discussion. For now, Ray Compiled Graphs are officially supported for GPU and CPU, and NPU support is experimental or under development, not part of the stable API or documentation. Would you like more detail on the current state of accelerator support in Ray Compiled Graphs?