How severe does this issue affect your experience of using Ray?
- Medium: It contributes to significant difficulty to complete my task, but I can work around it.
I’m trying to migrate a geospatial ML inference pipeline to use ray.
The first step in our pipeline is loading a bunch of geotiff files that represent satellite images. Where I’m stuck is trying to figure out the best way to get the geotiffs in the ray dataset
So far:
I tried using ray.data.read_images but it relies on pil/pillow and pil says that they support tifs (Image file formats - Pillow (PIL Fork) 10.2.0 documentation) but pil doesn’t seem to be very robust at opening geotiffs (is it possible to open a geotiff file in python without using gdal? - Geographic Information Systems Stack Exchange). I tried opening 4 different geotiffs and only 1 worked with pil and it wasn’t clear why.
I tried using gdal to open the geotiff and then store it in a python object and use ray’s from_items loader but it had trouble serializing the gdal opened image.
Do you have any tips on how to load geotiffs into a ray dataset?