What is the use for Model Layer "_value_branch" for Gradient Free Optimization (ES, ARS)?

Zhao_Pengfei · September 11, 2021, 2:50pm

If I Initialize from a predefined model, I notice that there is an additional network for Value Function by default.

However ES/ARS never make use of the Value Function. So is this part of the model even needed or can it be safely deleted ?

If so then the original implementation in ray/es.py at master · ray-project/ray · GitHub should not update the parameters for the value function.

Possible script to run:

tune.run(ESTrainer, config={“env”: “CartPole-v0”,
“framework”: “torch”,
“num_workers”: 1,
“stepsize”: 0.01,
“model”: {
“fcnet_hiddens”: [],
}
},
stop={“training_iteration”: 10}
)

sven1977 · September 27, 2021, 8:33am

Hey @Zhao_Pengfei , great question. The answer is that there is no use for that branch
, it’s simply constructed b/c our default models all have that branch in them.

We are currently working on a new model builder API that would get rid of these unneeded value branches and give the user more control over what RLlib’s default models will look like.

Topic		Replies	Views
Value Branch In fcnet.py RLlib	1	463	September 12, 2021
How to have completely separate recurrent value function model? RLlib	0	17	November 14, 2024
Best way to have custom value state + LSTM RLlib	9	3067	April 10, 2022
Get value function values from IMPALA RLlib	4	498	June 30, 2021
Value function of recurrent state models RLlib	6	596	October 7, 2021

What is the use for Model Layer "_value_branch" for Gradient Free Optimization (ES, ARS)?

Related topics