Initial action for Dict action space

mannyv · July 23, 2021, 1:06pm

rollout.py is used to generate rollouts on a policy it does not do any training so I don’t think this is where you problem lies, unless I misunderstand your question.

The first place I would look, and maybe you already have is in the reset function of your environment. This is where the first observation will come from. Is it somehow returning something different for the observation then step is?

If I were you I would also be concerned with those nan’s.

Are you handling the combination of Discrete and Continuous actions in a special way? I do not remember seeing rllib handle mixed action spaces but in all honesty it could be there and I have not encountered it.

Manny

Topic		Replies	Views
[RLlib] Is it possible to change action_space during training? RLlib	1	402	March 22, 2022
[rllib] wrong action dimensions when using dictionary action space RLlib	3	555	July 15, 2021
Action masking error RLlib	9	1687	February 6, 2023
Rllib with Tuple action space RLlib	1	570	December 14, 2022
RLlib and gym.space RLlib	4	717	November 14, 2021

Initial action for Dict action space

Related topics