RLPredictor with attention net

evo11x · November 23, 2022, 7:43pm

High: It blocks me to complete my task.

I have this code

 num_transformers = 1
 attention_dim = 64
 memory = 50
 state = [
   np.zeros([memory, attention_dim], np.float32)
   for _ in range(num_transformers)
 ]

predictor = RLPredictor.from_checkpoint(Checkpoint.from_directory(checkpoint_path))
action = predictor.predict(obs, state)

I get this error:

TypeError: predict() takes 2 positional arguments but 3 were given

Lars_Simon_Zehnder · November 23, 2022, 10:16pm

@evo11x predict() is defined in the Predictor class. It takes only a single argument.

evo11x · November 23, 2022, 10:35pm

With a single argument, only the observation I get an error with invalid seq lens

Lars_Simon_Zehnder · November 24, 2022, 3:57pm

The observations need to have a further dimension for the sequence length. If you have a sequence length of 4 you need a further dimension and along that you stack 4 observation tensors.

evo11x · November 26, 2022, 10:20am

What 4 observations tensors? Do you have an example?

Thanks!

Lars_Simon_Zehnder · November 26, 2022, 3:55pm

I think what is needed is the time dimension: (BATCH_SIZE, TIME_DIM: SEQ_LEN, OBS_DIM_1, OBS_DIM_2, etc.)

Topic		Replies	Views
Why couldn't I run rllib/examples/attention_net.py properly! RLlib	4	277	June 18, 2023
Multiple observations including RNN's RLlib	3	319	May 18, 2022
[RLlib] Restoring a GTrXLNet or use_attention=True fails RLlib	1	741	June 3, 2021
Understanding seq_lens RLlib	1	1150	November 4, 2022
Assert seq_lens is not None -> PPOTrainer RLlib	4	1412	October 14, 2021

RLPredictor with attention net

Related topics