Benchmarks for Ray Data?

Similar to my other question regarding benchmarks for Ray Serve, are there benchmarks that have either been published or in the works for Ray Data in comparison to TF Transform, Dataflow, or other preprocessing solutions?

Hi @jinnovation great question and right time. Benchmarking is one of key focus area of Ray AIR w/ Ray Dataset in Ray 2.1 & 2.2 that we’re actively testing and improving on weekly basis.

You should hear from us soon :slight_smile:

