Infinite loop inside SampleBatch._get_slice_indices

Maxime_Riche · April 16, 2021, 2:39pm

You can get this infinite loop while working with R2D2 (with DQN+LSTM), and when forgetting to change the trainer from DQNTrainer to R2D2Trainer, then you can get an infinite loop inside SampleBatch._get_slice_indices

Debugging that is a bit painful.
In my debugging setup, I had: slice_size = 1 and self.seq_lens containing 2s.

Maybe this assert would work:
assert self.seq_lens[idx] < slice_size

class SampleBatch(dict):
[...]
    def _get_slice_indices(self, slice_size):
        i = 0
        slices = []
        if self.seq_lens is not None and len(self.seq_lens) > 0:
            start_pos = 0
            current_slize_size = 0
            idx = 0
            while idx < len(self.seq_lens):
                seq_len = self.seq_lens[idx]
                current_slize_size += seq_len
                # Complete minibatch -> Append to slices.
                if current_slize_size >= slice_size:
                    slices.append((start_pos, start_pos + slice_size))
                    start_pos += slice_size
                    if current_slize_size > slice_size:
                        overhead = current_slize_size - slice_size
                        start_pos -= seq_len - overhead
                        idx -= 1
                    current_slize_size = 0
                idx += 1
        else:
            while i < self.count:
                slices.append((i, i + slice_size))
                i += slice_size
        return slices

bill-anyscale · April 16, 2021, 7:57pm

Hey Maxime, could you clarify your question? I’m not 100% sure I understand the issue you’re facing.

Maxime_Riche · April 17, 2021, 11:16am

I think it is more an issue than a question but I am not sure. So maybe I should post it on GitHub instead.

sven1977 · April 21, 2021, 11:46am

Hey @Maxime_Riche , no worries, thanks for the suggestion. I’ll create a PR. Makes sense that it gets stuck in that loop when we mix up the trainers. We should at least give a meaningful error.

Thanks for the catch!

sven1977 · April 22, 2021, 9:44am

PR: [RLlib] Discussion 1759: SampleBatch._get_slice_indices stuck for R2D2 when using incorrect Trainer. by sven1977 · Pull Request #15451 · ray-project/ray · GitHub

Topic		Replies	Views
Understanding seq_lens RLlib	1	897	November 4, 2022
Need help with debugging: Getting batch size of 0 RLlib	1	308	October 8, 2021
Changing add_time_dimension logic RLlib	9	367	July 6, 2023
Errors during training using BC with custom rnn RLlib	2	339	March 7, 2023
When are MARL replay buffers zero padded? RLlib	8	517	October 12, 2021

Infinite loop inside SampleBatch._get_slice_indices

Related Topics