Writing custom RLModule for custom Algorithm

Lars_Simon_Zehnder · November 27, 2023, 5:09pm

Hi @zygis, and welcome to the forum!

Great question! So, the RLModule is a part of our new stack and should replace the ModelV2 (not the Policy). The RLModule needs only the Policy if the algorithm samples with the RolloutWorker. In our new staack we are going to replace this latter class with the EnvRunner which will not need a Policy anymore. However, until the new stack is complete and fully tested you still need to subclass the TFPolicy or TorchPolicy to use the RLModule. Take a look into the PPO algorithm to get an understanding how to to subclass the Policy when using the RLModule, i.e. what methods to implement and take a look at PPO.get_default_policy_class()

Topic		Replies	Views
Custom RLmodule Configure Algorithm, Training, Evaluation, Scaling	2	19	May 8, 2025
Proper way to implement a custom Algorithm + Policy + Model RLlib	2	908	April 24, 2023
Help writing custom torch policies for interactive RL algorithms RLlib	0	195	July 7, 2022
PPO+LSTM custom model implementation problem ray2.10.0 Configure Algorithm, Training, Evaluation, Scaling	3	162	May 9, 2024
How to use my pretrained model as policy and value netwok RLlib	6	1158	December 26, 2023

Writing custom RLModule for custom Algorithm

Related topics