How severe does this issue affect your experience of using Ray?
- High: It blocks me to complete my task.
I would like to save for each episode the observation space and action space with timestep granularity.
The way I’m trying to achieve this is using the on_postprocess_trajectory
callback. There, I have both postprocessed_batch
and original_batches
that provide access to a SampleBatch
object. In that object, I have what seems to be actions, but these actions are transformed.
Is there an easy way to get the environment representation of the action?