From stable-baselines3 to ray rl

upi · May 30, 2022, 7:40pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

Hi all. I have been trying out RL problems with the help of sb3. Given the interesting opportunities, ray has to offer I want to try it for a PPO problem. But I find it very complex:
My env has Dict observations, Discrete actions. I use a standardization tool for each key of the dict provided by sb3: VecNormalization wrapper. As a policy, I have a custom feature extractor for each key on the dict. Then I concat the result and feed a separated value and policy functions. (Can’t wait to take advantage of Attention and other resources built into Ray!).
How can I switch to ray? I can see that the way I work with sb3 is very different from what ray examples and doc show.

sven1977 · May 31, 2022, 7:54am

Hey @upi , thanks for the question!

Here are two example scripts describing the move from SB3 to RLlib:
rllib\examples\sb2rllib_rllib_example.py
rllib\examples\sb2rllib_sb_example.py

Hope these help. We should also add a migration guide on how to move to our docs.

upi · May 31, 2022, 7:48pm

Hey @sven1977 those scripts came in handly but won’t be enough to move a complex environment like mine but I will try it anyway. If I find a good way to go I will be pleased to share it with the Ray community

Topic		Replies	Views
Issues reproducing stable-baselines3 PPO performance with rllib RLlib	14	2463	March 16, 2022
Migrating from StableBaselines3, not able to reproduce results RLlib	1	98	April 14, 2024
Architecture of RLLib/ Migrating from SB3 RLlib	0	52	August 21, 2023
Reproducing results from stablebaselines 3 RLlib	2	651	August 6, 2021
Converstion to Ray 2.0 RLlib	1	290	October 25, 2022

From stable-baselines3 to ray rl

Related topics