Creating custom neural network in RlLib

dev1dze · June 6, 2024, 7:20am

I want to create a neural network with the same architecture as the policy network, the only difference would be that I want to replace a softmax on the last layer of the network with tanh.

Any help will be greatly appreciated!

mannyv · June 7, 2024, 12:41pm

Hi @dev1dze,

Which kind of algorithm are you using?

dev1dze · June 7, 2024, 1:29pm

Hi Mannyv and thanks for your reply!

I want to use the PPO.

mannyv · June 7, 2024, 3:00pm

There is no softmax in the network used for the actor in PPO. The network outputs the activation of the last layer unmodified as logits. There is no activation function applied.

The softmax is applied inside the Categorical action distribution used for Discrete outputs. You could create a custom action distribution to use instead but you will also have to define entropy, kl, log_prob etc for the distribution. Rllib does have SquashedGaussian distribution you can use with Box outputs but not Discrete.

dev1dze · June 7, 2024, 3:41pm

Thanks for your answer,

Let me try maybe explain it better: I am not trying to define a new policy network and use it to train the agent instead of the one currently used by the algorithm. I just want to use this network to learn some reward/cost functions for sampled states and actions in the sampled trajectory/episode.
That is why I would like to have a reward network with the same architecture as a policy network (with only difference not having softmax on the output layer) and with a custom loss function to be updated separately after the policy update.

if you can specify an example code or the file where I should be looking for it would be a great help.

Thanks a lot!

Topic		Replies	Views
Using custom neural network in RLlib RLlib	5	1210	December 22, 2022
Example of custom tensorflow or pytorch neural network RLlib	5	643	December 10, 2021
Custom loss and model implementatiom Configure Algorithm, Training, Evaluation, Scaling	3	153	June 25, 2024
Custom LSTM Model for R2D2 RLlib	0	375	December 12, 2021
[rllib] different networks for different policies RLlib	1	320	March 3, 2021

Creating custom neural network in RlLib

Related topics