How do the bounds on an observation space affect convergence?

aydinguy · October 3, 2022, 1:55am

Hey, I’m an RL beginner and have a general RL question.

Suppose I have an N dimensional continuous observation space created using a gym.spaces.Box object. I can specify the lows and the highs for each element. Let’s say that in practice, every element in the observation space will range from -5 to 5.

How does it affect convergence if I create the Box with the correct bounds, i.e. [-5, 5], compared to if I created it with a much wider range, say [-1000, 1000]? If it does matter, what is the theory/reasoning?

Thanks!

arturn · October 12, 2022, 8:52am

Observations should be normalized anyway (they are automatically mean/std filtered in RLlib.

But generally speaking, parameters and their gradients in neural networks scale with inputs and outputs. Meaning that a large input will simply result in overall smaller weights when optimizing for the same objective. There are a lot of things that play into this dynamic though. For example gradients being clipped or the chosen floating point precision. You only have to deal with this if you don’t normalize your observations though, which by default, you don’t.

Cheers

Topic		Replies	Views
Dict observation space vs Box RLlib	0	274	June 16, 2022
Making a working observation space with numpy arrays? RLlib	3	633	August 31, 2022
Observation space in custom model is bounded in [1-, 1] RLlib	3	268	July 13, 2022
Observation dependent continuous action space ("Masking" continuous action space) RLlib	4	840	February 9, 2022
'Observation for a Box/MultiBinary/MultiDiscrete space should be an np.array, not a Python list.' RLlib	5	401	August 17, 2021

How do the bounds on an observation space affect convergence?

Related Topics