MARWIL with gymnasium Dict as action Space

RunLLM · October 10, 2025, 10:47pm

This issue—where MARWIL appears to run but the dataset progress stays at 0.00 row/s—often indicates a problem with how the Parquet file encodes nested structures, especially with Dict action spaces. If the action column contains dicts with numpy arrays or non-JSON-serializable types, Ray Data may silently fail to read or parse the rows, resulting in no data being processed. This is a common pitfall when saving nested dicts with numpy arrays directly to Parquet, as Parquet expects flat, serializable data types (Ray Data Parquet limitations).

To resolve this, ensure that before writing to Parquet, all nested values in your action dicts are converted to native Python types (e.g., lists for Box, ints for Discrete), and avoid numpy arrays or objects. You can preprocess your data like this before saving:

def to_serializable_action(action):
    return {
        "rotate": int(action["rotate"]),
        "thrust": [float(x) for x in action["thrust"]]
    }

for row in data:
    row["actions"] = to_serializable_action(row["actions"])

Then write the dataset as before. This ensures Ray Data can read and batch the rows, and MARWIL will process them. Would you like more detail or a code snippet for re-writing your dataset?

Sources:

Ray Data: Nested structure serialization

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Trainer.compute_action Error with Dict type observation inputs RLlib	4	899	December 12, 2020
RLlib and gym.space RLlib	4	740	November 14, 2021
Initial action for Dict action space RLlib	5	1351	July 23, 2021
Using random action policy with dict action space RLlib	0	279	April 12, 2021
Error: Custom observation Space not treated correctly RLlib	5	1047	July 15, 2021

MARWIL with gymnasium Dict as action Space

Related topics