Custom PyTorch model implementation for PPO training

overloader · May 23, 2023, 12:47am

Hello

Maybe someone can provide example of how can look implementation of custom cnn-lstm model in rllib for ppo training a discrete action space environment ? Or maybe some one give link for tutorials about that
Mostly interesting how can i write discrete output from lstm in forward method and value_function

arturn · July 23, 2023, 11:28pm

Hi @overloader ,

Please check out the newest version of RLlib and have a look at the following file:

Topic		Replies	Views
PPO custom model with LSTM RLlib	0	31	June 11, 2025
Output from custom policy network for PPO RLlib	1	441	November 15, 2022
PPO+LSTM custom model implementation problem ray2.10.0 Configure Algorithm, Training, Evaluation, Scaling	3	175	May 9, 2024
PPO+LSTM consistently not working Configure Algorithm, Training, Evaluation, Scaling	1	213	April 11, 2025
Seperate networks for actor and critic in the ppo RLlib	2	789	April 14, 2022

Custom PyTorch model implementation for PPO training

Related topics