How to use the same set of actors in multiple non-adjacent processing steps

mk6 · May 30, 2025, 7:06am

I’m working on a Ray Data pipeline where I need to utilize a pool of Ray actors (specifically, LLMs, each requiring a GPU) in multiple, non-adjacent processing steps. My pipeline looks like this:

LLM Generation: Use the LLM pool to generate some initial data.
Calculation: Perform calculations on the generated data without needing the LLMs.
LLM Generation: Use the same LLM pool to generate further data based on the results of Step 2.

The standard map_batches approach, which works well when LLMs are only used in a single step, doesn’t seem directly applicable here. I need a way to maintain and reuse the LLM actor pool across these separated steps.
(Merging all the steps into one seems to hurt performance since it reduces level of concurrency I presume.)

My current proposed solution is to launch the LLM pool with Ray Serve. This would allow me to treat the LLMs as a service and call them from different stages of the Ray Data pipeline.

Is there any better way?

Topic		Replies	Views
Can multiple Ray Data pipeline steps share the same large model instance for inference? Ray Data	3	57	August 6, 2025
Divide Work between Actors Ray Core	5	354	January 22, 2021
[Core] Question on optimizing machine learning project speed using ray Ray Core	5	490	February 1, 2022
Prevent restart of actors in DatasetPipeline	0	218	July 24, 2023
Ray always use a single actor Ray Core	0	307	November 9, 2021

How to use the same set of actors in multiple non-adjacent processing steps

Related topics