Observation and Reward Normalization

The observation and rewards need to be normalized if the the observation values are over 1000 and rewards sometimes over 100 ? Or rllib normalizes the observation and rewards ?

  • None: Just asking a question out of curiosity