Cannot pickle BatchInferModel when ds.map_batches(BatchInferModel)

Jiayi_Li · September 14, 2022, 6:54am

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

I read the tutorials in https://docs.ray.io/en/latest/data/pipelining-compute.html#example-pipelined-batch-inference, in which model is initialized in BatchInferModel’s init(), and inferencing is processed in call().

I write a similar test, in init():
self.session = onnxruntime.InferenceSession("/path/to/model.onnx")
in call():
self.session.run(...)
when I run:
ds.map_batches()

ERROR:
cannot pickle ‘onnxruntime.capi.onnxruntime_pybind11_state.InferenceSession’ object

This error seems to be because the model is initialized in one worker, serialized by pickle and then transferred to different workers.
How should I write so that initialization can be performed on their respective workers?

jianxiao · September 19, 2022, 9:22pm

Hi @Jiayi_Li, your usage looks right. The issue is that InferenceSession itself is not able to pickle. You may take a look at this for workaround: Fixes #643, implements __getstate__ in python API by xadupre · Pull Request #800 · microsoft/onnxruntime · GitHub

Topic		Replies	Views
Why is my torch AOTInductor model inference class not serializable?	1	189	April 30, 2024
Serve batching not working	0	192	December 6, 2023
Error when Pickling Actor from an imported file Ray Core	6	1460	July 29, 2021
Ray inferencing not happening in streaming way	7	382	December 13, 2023
Issues with Batch Overflow during exceptions while utilizing map_batches Ray Data	1	29	August 22, 2024

Cannot pickle BatchInferModel when ds.map_batches(BatchInferModel)

Related topics