How severe does this issue affect your experience of using Ray?
- Medium: It contributes to significant difficulty to complete my task, but I can work around it.
I am just starting with Ray RLlib so forgive me for any novice mistakes. I am working on building a lot of custom methods to perform offline RL. My difficulty is in how to implement certain actions. Consider a toy example, with X states, based on the current observation, the agent can decide to continue onwards to the next state or to terminate. Perhaps think along the lines of medical testing where it is important to not needlessly perform testing when it is not necessary. How can this action of “terminate” be implemented into the possible action space in an offline setting?