I’d like to convert a Pytorch dataset to a Ray dataset.
Every batch has three keys: id (int), image:ndarray, mask:ndarray
However, I can’t find an out-of-the-box solution from Ray data to read two images from disk.
Currently, I see 2 Options:
Read a CSV with the ids and use the transform step to read the images
Build a custom Datasource, that reads two images from disk