[RLlib] Make it easier to play trained policies

RickLan · April 1, 2021, 5:24am

Hi @drozzy I feel your pain. I came from stable_baselines too. I just wrote a runnable script to try out a trained policy below. It’s for multi-agent but can be easily modified for single agent. I think the docs has an example for single agent, but I couldn’t remember where atm. Cheers,

Edit: there is definitely a higher learning curve for RLlib than stable_baselines, imho. For my research work, I wish I had started with RLlib than stable_baselines.

Edit 2: the single agent version is here: Getting Started with RLlib — Ray 3.0.0.dev0

Topic		Replies	Views
RLLib: How to use policy learned in tune.run()? RLlib	6	994	September 21, 2023
How to get and use a trained policy RLlib	0	487	September 8, 2024
Rollout/test a already trained policy employing PolicyServerInput and PolicyClient RLlib	1	299	October 30, 2021
How to deploy a trained Ray RLlib PPO policy/model in multi-agent-case? RLlib	5	831	November 10, 2021
How to compute actions with RLlib and Tune after training RLlib	6	520	July 15, 2025

[RLlib] Make it easier to play trained policies

Related topics