Ray Data and Train connection options

Ahmet_Salih_Gundogdu · April 12, 2024, 3:10am

When using Ray’s Data and Train modules together, if the dataset hasn’t been materialized, it provides a batch execution (using read_* and map method) where the batches are sent to training as soon as they are ready. There’s also an option (here just use read_* method) to use a collate_fn within iter_torch_batches. I’m curious whether the collate_fn used inside that is processed in parallel across the Ray cluster, and how this approach differs from the first one I mentioned. (edited)

Topic		Replies	Views
Parallelize TorchTrainer + Preprocessor + Training?	1	215	October 27, 2023
Failed to read the results for 1 trials	3	494	July 26, 2023
Accessing Large Static Datasets with Ray Clusters	3	566	May 27, 2023
Interleaving file reads with custom datasource	0	240	January 23, 2024
Ray Data: How to yield entire groups from a batch?	5	271	January 27, 2024

Ray Data and Train connection options

Related topics