[RLlib] why is TF value out initialization 100x smaller than torch?

raoul-khour-ts · January 26, 2021, 5:02pm

Hey does anyone know why the value_out std initialization in tf/fcnet.py is 0.01 while the torch std initialization in torch/fcnet.py is 1.0?

sven1977 · January 27, 2021, 10:35am

Great question @raoul-khour-ts ! There is no good reason to have these defaults for tf and torch be different. We should fix this. …

Peter_Pirog · January 27, 2021, 10:59am

@sven1977 If there is a reason to chage initializer maybe the better option will be changing to Xavier Glorot initializer as default ?

sven1977 · January 27, 2021, 11:18am

Also a good point (for CNNs, that’s what we already use). The thing is, I don’t want to change the default w/o having at least run some benchmarks again.
I did change torch’s vf layer to 0.01 (normc), just like tf’s fcnet works. The risk of that should be relatively low:

Topic		Replies	Views
Value Branch In fcnet.py RLlib	1	460	September 12, 2021
Default model architecture for tf and torch on some algorithms seem to be different RLlib	1	376	March 10, 2023
Why is my `rllib.models.torch.torch_modelv2.TorchModelV2` receiving a Tensor of shape ( 32, <observation size> )? Configure Algorithm, Training, Evaluation, Scaling	1	722	November 15, 2022
Compute Advantages broadcasting issue RLlib	1	28	November 7, 2024
[RLlib] Workaround for incorrect initial state shape with custom RNN models? RLlib	2	372	January 2, 2021

[RLlib] why is TF value out initialization 100x smaller than torch?

Related topics