Is it possible to implement Circular Decision in RLlib?

Morphlng · November 21, 2024, 2:41am

Hi! I’m working on a path planning task using RL. At each timestep, we would like to “freeze” the simulator, and do a simulation on the agent side to produce a sequence of waypoints.

The RL action output is the relative offset from current position, i.e. (dx, dy). We would like to do a “circular decision”, so that we get [(dx1, dy1), (dx2, dy2), …] (maxium at 10), each one is the offset from last position.

I’m wondering if this is possible? What part of the RLlib workflow should I modify, or do I simply use gym.spaces.Sequence?

Topic		Replies	Views
Applying rllib to robotics problems RLlib	4	845	April 25, 2021
Training for turn-based sequential games RLlib	4	573	January 21, 2023
Implementing Jump Start Reinforcement Learning in RLLib RLlib	8	1147	May 27, 2022
Terminating Action in Offline RL RLlib	3	218	June 29, 2023
Continuous action space and custom model RLlib	4	1527	July 17, 2021

Is it possible to implement Circular Decision in RLlib?

Related topics