Terminating Action in Offline RL

kris · June 27, 2023, 12:30am

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

Hi,
I am just starting with Ray RLlib so forgive me for any novice mistakes. I am working on building a lot of custom methods to perform offline RL. My difficulty is in how to implement certain actions. Consider a toy example, with X states, based on the current observation, the agent can decide to continue onwards to the next state or to terminate. Perhaps think along the lines of medical testing where it is important to not needlessly perform testing when it is not necessary. How can this action of “terminate” be implemented into the possible action space in an offline setting?

Rohan138 · June 29, 2023, 12:26am

Can your agent choose to go to any of the states from any of the other states? If so, you could model this as an X+1 discrete action space problem, where the extra action is the terminal one.

kris · June 29, 2023, 1:22am

Thank you. No, it is a sequential chain of states with two actions (move forward and terminate). Such as state 1 → state 2 → state 3. State 2 can not go back to state 1. And after state 3 it must terminate.

Rohan138 · June 29, 2023, 1:42am

So a Discrete(2) action space? Basic Usage - Gymnasium Documentation

Topic		Replies	Views
Rllib extremely complex action space Possible? RLlib	1	256	May 4, 2022
Action Masking Model: Deterministic selection of the best action RLlib	0	10	August 11, 2024
Actions created by Policy being modified before input to environment RLlib	4	275	March 15, 2023
Negative actions RLlib	0	15	August 7, 2024
Skipping some actions RLlib	2	314	May 9, 2022

Terminating Action in Offline RL

Related topics