Hi all! This is a question I posted on slack recently, which @sven1977 answered. Posting it here for broader reach.
I’ve been trying to understand how the config parameter
vf_share_layers affects learning, and I had a few questions. Would be grateful if someone could throw some light on any of these!
- While implementing a custom model , does toggling the value of
vf_share_layerschange learning behavior? If so, how? Asking because a Github search of the parameter showed me the parameter was used only in the existing models inside Rllib.
- When vf losses are high, how does disabling
vf_share_layersalleviate the issue?
- And why does
vf_loss_coeffneed to be tuned when
Here is the link to Sven’s answer, thanks again!