Ray Data

Ray Data LLM APIs Ray Data has a LLM module that enables efficient batch inference with large language models (LLMs) using Ray Data. It integrates with inference engines like vLLM and OpenAI-compatible APIs, allowing users to process LLM requests in parallel, optimize resource usage, and configure model parallelism for larger models.

Topic	Replies	Views	Activity
About the Ray Data category Ray Data	1	735	April 14, 2025
Ray data `ReadParquet->SplitBlocks(2)` shows failure even though the entire ray job is successful Ray Data	1	7	June 3, 2025
How to use the same set of actors in multiple non-adjacent processing steps Ray Data	0	12	May 30, 2025
Why UDF time is larger than Remote wall time with concurrency=1? Ray Data	0	7	May 29, 2025
Does map_batches avoid saturating the inference engine? Ray Data LLM APIs	1	24	May 25, 2025
Does RayData Support multi-node vllm inference Ray Data LLM APIs	2	42	May 23, 2025
Ray Column With Custom Python Dataclass Type Ray Data	3	205	May 22, 2025
Join tasks getting stuck in PENDING_NODE_ASSIGNMENT Ray Data	7	45	May 21, 2025
Distributed training with different number of batches Ray Data	0	20	May 11, 2025
Ray Data job hangs Ray Data	2	45	April 3, 2025
When using ray data, how can we provision two GPUs and share them across multiple tasks? Ray Data	6	38	April 2, 2025
Async and dataset transformation Ray Data	5	45	April 1, 2025
Problems with Ray Datasets Library Ray Data	3	353	March 31, 2025
The requested parallelism is too high Ray Data	0	21	March 26, 2025
Map parquet columns causes decoding error with binary data Ray Data	3	88	March 24, 2025
Reading data from hdfs meets Segmentation fault Ray Data	1	39	March 24, 2025
Streaming write using ray's write_parquet for llm inference Ray Data	1	30	March 17, 2025
Why does Ray Data execute 3 blocks first on MAC Ray Data	1	20	March 11, 2025
How to change GPU assignment strategy, from greedy to balanced? Ray Data	6	42	March 11, 2025
What’s the migration path for ray.data.datasource.tfrecords_datasource.TFRecordDatasource? Ray Data	2	24	March 10, 2025
Why Ray Data read tfrecord so slow Ray Data	2	115	March 6, 2025
What's the migration path for ray.data.aggregate's Max, Mean, Min, and Std functions? Ray Data	2	33	March 6, 2025
Ray.data.filter() much slower than without filter Ray Data	8	205	March 6, 2025
Metadata fetching seems to be a sequential run Ray Data	1	51	March 1, 2025
Node fault tolerance in Ray Data Ray Data	2	52	January 10, 2025
Ray read_iceberg doesn't scale at large iceberg table Ray Data	0	66	November 27, 2024
How to auto assign actors to different GPUs in ray.data.map_batches Ray Data	2	46	November 26, 2024
Loading Geotiff Images Into Ray Dataset Ray Data	3	286	November 22, 2024
Example Image Writing Code: 'list' object has no attribute '__array_interface__' Ray Data	3	49	November 20, 2024
`map_batches` fails with Huggingface NER pipeline Ray Data	0	42	November 19, 2024