Is a TorchPolicy or the DL model in this policy automatically set to evaluation mode (model.eval()) during the default evaluation phase?

LukasNothhelfer · June 6, 2021, 3:23pm

For example, I have a model that includes torch.nn.Dropout layers that behave differently during training and evaluation. In Torch, I need to use model.train() or model.eval() to set which mode the model should be in. Is this also done by RLlib if I use a custom_eval_fn or the default routine?

mannyv · June 6, 2021, 10:36pm

Hi @LukasNothhelfer,

The Torch trainer switches between eval and train modes in

compute_actions

github.com

ray-project/ray/blob/e80095591cc30bc3a047614892c0a7bd25254e74/rllib/policy/torch_policy.py#L277-L281

    
      
          # Switch to eval mode.
          if self.model:
              self.model.eval()
          
          
if self.action_sampler_fn:

and

learn_on_batch:

github.com

ray-project/ray/blob/e80095591cc30bc3a047614892c0a7bd25254e74/rllib/policy/torch_policy.py#L449-L451

    
      
          if self.model:
              self.model.train()
          # Callback handling.

If you are using the standard way to customize, for example using “with_updates” then it should be switched. You could add break points or print statements there if you want to be extra sure.

LukasNothhelfer · June 6, 2021, 11:46pm

+1 for sharing code references, thx

Topic		Replies	Views
[RLlib] Exporting a PyTorch policy for TorchScript RLlib	4	931	February 8, 2021
Custom eval function error with custom RNN model RLlib	0	299	April 14, 2022
How to train custom models with `SampleBatch.INFOS` RLlib	7	708	February 11, 2022
Issue with Custom PyTorch Model in Ray RLlib RLlib	0	306	November 3, 2023
Model.training never True RLlib	4	344	August 12, 2021

Is a TorchPolicy or the DL model in this policy automatically set to evaluation mode (model.eval()) during the default evaluation phase?

Related topics