Hey,
is there a specific reason why CQL is available only for a continuous action space? I have an offline RL usecase where I have only access to a fixed MDP dataset with discrete actions and no access to a simulator. Can I somehow use the Rllib implementation for CQL for this use case?
1 Like
At the moment our CQL doesn’t support discrete spaces. You can extend either the CQL torch or tf policy to implement similar logic as the SAC tf/torch policy in order to implement discrete spaces though.
You’d have to follow one of our custom policy examples to actually implement this.
1 Like
thanks for the answer, that makes things clear to me!
Hi @Tobias_Kropmanns, I wonder have you implemented CQL for discrete action? If so, would you be able to share the code? Thanks!
I am still need to have CQL for discrete action! Has anyone figured it out?
Hey, you may have a look at d3rlpy; there is discrete cql implemented
Hi @Tobias_Kropmanns, thanks for your reply. My pipeline is entirely based on RLlib, so I would prefer to find a solution here. However, I have found it difficult to customize the current RLlib algorithms to build a discrete CQL.