With Ray datasets, how do I handle skipping over–e.g, the first 500K batches in a dataset.
The use for this is, when I’m doing a run on spot instances and then the run stops because the spot instance shuts down.
Ideally, I’d store the number of training steps (let’s call it N) that have occurred so far, and then skip over N batches.