RLLib external environment set up for turn based game

christina · August 18, 2025, 8:29pm

Yep! I think so for the most part - you can read more about it here: ray/doc/source/rllib/external-envs.rst at releases/2.47.1 · ray-project/ray · GitHub

Regarding weight versioning and off-policy data: RLlib’s external environment setup (via RLlink) supports both on-policy and off-policy data collection. RLlib can train on these off-policy samples, though on-policy algorithms like PPO may see some degradation if the lag is large.

For high scalability (100+ games), batching trajectories and occasionally updating weights on the client is standard practice, and RLlib is designed to handle such asynchronous, parallel data ingestion (this discussion might be helpful even if it is a bit old).

Topic		Replies	Views
Trying to set up external RL environment and having trouble RLlib	14	1446	September 28, 2021
Training for turn-based sequential games RLlib	4	596	January 21, 2023
Best way to use rllib with player vs player games RLlib	3	802	March 15, 2021
RLlib's PolicyServer and external simulator as client RLlib	15	1753	April 12, 2021
External On-Policy Actions in PPO RLlib	3	653	June 23, 2021

RLLib external environment set up for turn based game

Related topics