Control sampling in action masking environment

PhilippWillms · March 24, 2024, 4:46pm

HI,
I am using ray 2.10 now and toch 2.21, further following the guide to implement action_masking model as outlined in the action_masking_example.

In first experiments, I get unexpectedöy many failed trials / died workers. This is due to expections raised by logic I encoded in the step() function. However, testing separately for envrionment verification seems o.k.

→ Hence, my question: How can I get to the env state or sequence of actions which was taken until the env crashed?

This would support reproduction of the error and hence finding the loose end in the env logic.

Topic		Replies	Views
Action masking redux RLlib	7	43	March 5, 2025
Log action sequence per episode RLlib	2	24	July 15, 2024
Action masking Problem RLlib	0	357	July 11, 2022
`training_step()` fails with custom environment RLlib	1	292	August 1, 2023
Questions and Confusion: Getting started with RLlib Configure Algorithm, Training, Evaluation, Scaling	0	40	February 19, 2025

Control sampling in action masking environment

Related topics