How to implement Generalized State Dependent Exploration?

cwijesundara · November 21, 2024, 8:07pm

Hello! I was wondering if it’s possible to implement gSDE in Rllib? Stable Baselines has it implemented so I was wondering how easy it would be to implement it similarly in Rllib.

Lars_Simon_Zehnder · November 25, 2024, 5:49pm

@cwijesundara thanks for posting this question. To implement gSDE in RLlib you might want to implement the state-dependent distribution in a TorchDistribution and write an RLModule that returns in its action_dist_inputs key the logits plus a state embedding. Make sure that the RLModule.action_dist_cls attribute holds your state-dependent TorchDistribution mentioned above (either by hard-coding or by writing a ray.rllib.core.models.catalog.Catalog).

cwijesundara · December 18, 2024, 8:31pm

Hi @Lars_Simon_Zehnder ! Thanks for the reply! How would I approach this if I’m using the older rllib framework that doesn’t use RLModule?

Lars_Simon_Zehnder · December 19, 2024, 4:37pm

@cwijesundara For the old API stack you would instead take a look at the Exploration class that underlies all exploration algorithms in RLlib’s old API stack. You can see many examples in the same folder that implement exploration of different complexity levels.

Please notice, however, that the new API stack will be very soon the new standard in RLlib and we will not support anymore then the old API stack.

Topic		Replies	Views
Will RLlib consider implementing more distributed RL algorithms? RLlib	2	341	July 6, 2022
How to use state embedding in RLlib RLlib	0	308	November 3, 2021
Implementing Jump Start Reinforcement Learning in RLLib RLlib	8	1161	May 27, 2022
Inverse reinforcement learning algorithms RLlib	4	521	January 23, 2025
Output from custom policy network for PPO RLlib	1	444	November 15, 2022

How to implement Generalized State Dependent Exploration?

Related topics