Apply preprocessor in custom model

fedetask · May 3, 2022, 3:22pm

Yes, masking the forward works but I’m not using dueling dqn. Basically, DistributionalQTFModel does input → forward() → model_out. Now, model_out contains the masked Q values, as I want.

But if q_hiddens is specified, or use_noisy is True, other layers will be added on top of model_out which I guess will break the model, since they will process the masked Q values and produce new values (also, I guess that layers taking tf.float32.min values as input will behave very badly)

Topic		Replies	Views
Model doesn't recognize ObservationWrapper and keeps using orig_observation RLlib	4	341	October 7, 2022
Cannot understand how to create custom model for DQN RLlib	2	1493	April 29, 2022
Observation space in custom model is bounded in [1-, 1] RLlib	3	341	July 13, 2022
Dict observation space flattened RLlib	5	2468	January 25, 2021
How to not flatten action mask with Dict observation RLlib	0	468	April 8, 2022

Apply preprocessor in custom model

Related topics