Does preprocessor get "glued" to the trained model artifact?

jinnovation · October 10, 2022, 3:31pm

When using Ray AIR, does the preprocessor that gets passed into, say, TensorflowTrainer via the preprocessor kwarg get “attached” to the final trained model, e.g. the one accessible via ray.tune.Tuner.fit().get_best_result()? The Ray AIR documentation, e.g. the batch inference section, suggests that it is (given that the test dataset used for prediction is split from the same dataset as the train dataset, but used w/o preprocessing), but does not explicitly call this out.

If it is, at a high level, what are the underlying mechanisms used for doing so, and how do they differ across frameworks, e.g. TF, PT, XGB?

rliaw · October 10, 2022, 11:32pm

Hey John, thanks a bunch for opening this!

It doesn’t get attached to the model per se but rather to the Checkpoint that is generated. All checkpoints have a get_preprocessor method: Ray AIR API — Ray 2.0.0 which helps you achieve what you’re looking for.

This would be agnostic to all frameworks (since the Checkpoint holds the framework-specific model as a blob)

jinnovation · October 11, 2022, 12:51pm

Thanks Richard. It sounds in that case like it’s the user’s responsibility to replicate the preprocessing logic appropriately at serving time, e.g. using TorchServe custom handlers. Is that a fair assessment?

rliaw · October 11, 2022, 5:47pm

Yep that’s right. We’ve been in conversation with @Keshi_Dai1/@Keshi_Dai about having this attach to TF models automatically.

jinnovation · October 11, 2022, 6:05pm

We’ve been in conversation with @Keshi_Dai1/@Keshi_Dai about having this attach to TF models automatically.

Doesn’t surprise me I presume that one of the primary blockers to this would be implementing the attaching mechanism in a framework-agnostic way, rather than make something special-case for TF specifically?

rliaw · October 11, 2022, 6:21pm

Yeah, there will need to be a consideration to design this in a framework agnostic way, but to have hooks for tf.

jinnovation · October 11, 2022, 6:23pm

Anywhere I or other users could go to track the progress of that work/discussion? Or has this mostly been informal so far?

rliaw · October 11, 2022, 6:46pm

Mostly informal. We have to find funding for that project think will ask you guys to put together some rough requirements on the usage before we write any lines of code.

Keshi_Dai1 · October 20, 2022, 2:53pm

@rliaw, could you please elaborate more on the point of “the user’s responsibility to replicate the preprocessing logic appropriately at serving time”? Is this specific to “gluing the preprocessor to the trained TF model artifact”? or users still need to do this when they use Ray AIR’s checkpoint.

I would like to understand the routes (with and without Ray Serve ) via Checkpoint in Ray AIR to ensure the feature transformation logic consistency between training and serving. Thank you!

Topic		Replies	Views
Is it correct for this sample code? Ray Train	1	329	September 25, 2023
BatchPredictors for TensorRT/AITemplate models	1	406	February 3, 2023
[Ray Train] Memory overloading rapidly while training TensorFlow model Ray Train	12	2247	February 24, 2023
Saving ray model to tf/pytorch Checkpointing, Restoring	0	297	August 11, 2023
Parallelize TorchTrainer + Preprocessor + Training?	1	218	October 27, 2023

Does preprocessor get "glued" to the trained model artifact?

Related topics