Saving episode trajectories during training

mzat · July 13, 2023, 3:10pm

How severe does this issue affect your experience of using Ray?

High: It blocks me to complete my task.

I would like to save for each episode the observation space and action space with timestep granularity.
The way I’m trying to achieve this is using the on_postprocess_trajectory callback. There, I have both postprocessed_batch and original_batches that provide access to a SampleBatch object. In that object, I have what seems to be actions, but these actions are transformed.
Is there an easy way to get the environment representation of the action?

Topic		Replies	Views
Post process trajectory with full episode RLlib	1	407	October 17, 2023
Save played trajectories in memory RLlib	1	432	August 17, 2022
RLlib Batch Postprocessing has steps from other trajectories RLlib	5	369	April 22, 2024
Skipping some actions RLlib	2	320	May 9, 2022
Log action sequence per episode RLlib	2	25	July 15, 2024

Saving episode trajectories during training

Related topics