As of now, there is no official support for NPU as a backend in Ray Compiled Graphs (aDAG/Compiled Graph) in the mainline Ray documentation or API references. The official docs and API guides do not mention NPU support—only GPU (CUDA/NCCL) and CPU transports are described, and the current transport options are “auto”, “cpu”, “accelerator”, “nccl”, and “shm” for tensor transport, with “accelerator” referring to GPU/XLA or vendor-provided backends, but not specifically NPU. The PRs and issues you referenced (e.g., #51032, #51574, #55428, #55381) indicate ongoing community and contributor interest, but these features are not yet documented as available or stable in the official Ray releases or user guides.
If you are interested in NPU support, you may want to follow those PRs/issues for updates or contribute to the discussion. For now, Ray Compiled Graphs are officially supported for GPU and CPU, and NPU support is experimental or under development, not part of the stable API or documentation. Would you like more detail on the current state of accelerator support in Ray Compiled Graphs?
Sources:
- https://github.com/ray-project/ray/blob/master/python/ray/experimental/channel/torch_tensor_type.py
- https://discuss.ray.io/t/how-to-make-a-multi-input-node-in-dag/16073
Hint: Mention @RunLLM in the post for followups.