GRU hidden_state tensor batch dimension is incompatible with sample_batch

aadharna · August 23, 2021, 6:29pm

When I return: [self.conv.weight.new(1, self.cell_size).zero_().squeeze(0)] in the get_initial_state function, and remove the hacky batch fixing code, it results in an error: Input batch size 32 doesn't match hidden0 batch size 4

When I add back in the

        if not h_in.shape[0] == x.shape[0]:
            missing_h = self.conv.weight.new(x.shape[0] - h_in.shape[0], h_in.shape[1]).zero_()
            h_in = torch.vstack((h_in, missing_h))

and have the squeeze, it seems to run. But that still leaves my original question of: why is seq_len four 8s rather than thirty-two 1s since it seems that the batch size is determined by len(seq_len) rather than sum(seq_len)?

Topic		Replies	Views
Yet another question on RNN sequencing RLlib	7	849	January 4, 2022
State shapes incorrect using custom model (TorchModelV2) (PPO) RLlib	2	452	July 15, 2021
Why does a SampleBatch contain a different number of elements for the hidden states of the RNN than for the obs, actions, advantages...? RLlib	3	310	June 3, 2021
Problem with handling states in RNN RLlib	2	766	February 27, 2023
Custom LSTM Model, how to define the SEQ_LEN RLlib	5	2590	June 10, 2024

GRU hidden_state tensor batch dimension is incompatible with sample_batch

Related topics