CQL for discrete action space

Hey,

is there a specific reason why CQL is available only for a continuous action space? I have an offline RL usecase where I have only access to a fixed MDP dataset with discrete actions and no access to a simulator. Can I somehow use the Rllib implementation for CQL for this use case?

1 Like

At the moment our CQL doesn’t support discrete spaces. You can extend either the CQL torch or tf policy to implement similar logic as the SAC tf/torch policy in order to implement discrete spaces though.

You’d have to follow one of our custom policy examples to actually implement this.

1 Like

thanks for the answer, that makes things clear to me!

Hi @Tobias_Kropmanns, I wonder have you implemented CQL for discrete action? If so, would you be able to share the code? Thanks!

I am still need to have CQL for discrete action! Has anyone figured it out? :slight_smile:

Hey, you may have a look at d3rlpy; there is discrete cql implemented

Hi @Tobias_Kropmanns, thanks for your reply. My pipeline is entirely based on RLlib, so I would prefer to find a solution here. However, I have found it difficult to customize the current RLlib algorithms to build a discrete CQL.