How to "bucket" batch requests on Serve?

amiasato · September 28, 2023, 1:13pm

Hello everyone!

I’d like to know if there’s a way to “bucket” batch requests on serve. Something along the lines of the following snippet:

@serve.batch(bucket_by="batch_param")
async def batch_process(self, samples: list[np.ndarray], batch_param: float):
    batch = build_batch(samples)
    return process_batch(batch, batch_param)

async def process(self, sample: np.ndarray, param: float):
    await batch_process(sample, param)

In which batch queues would be bucketed by param, i.e. requests with different param values would be queued in different batches.

Thanks!

shrekris · October 13, 2023, 4:52pm

Hi @amiasato! There’s no first-class way to do this on Serve, but I’m curious to hear more about your use case. Could you submit a feature request on the GitHub repo, so we can discuss it there and consider adding it?

Topic		Replies	Views
How to View Results of Post Request with Ray Serve Batching? Ray Serve	1	391	February 7, 2022
Batching when using non python client Ray Serve	1	411	March 24, 2021
Batching doesn't work: requests are processed one by one Ray Serve	2	573	June 19, 2021
How to post data to dynamic batch directly？ Ray Serve	1	32	October 24, 2024
Caching and batching with serve Ray Serve	1	492	April 17, 2021

How to "bucket" batch requests on Serve?

Related topics