How to normalize reward in PPO with new API stack?

mizhou0309 · June 4, 2025, 1:49am

**1. Severity of the issue: **
Medium: Significantly affects my productivity but can find a workaround.

2. Environment:

Ray version: 2.46.0

3. What happened:
I searched for the entire official document and also lots FAQ, but can’t find any example or instruction to normalize reward with new API. How to do this?

vittor · March 15, 2026, 5:08pm

Hi @mizhou0309, would you mind sharing your workaround for this problem? I am currently using environment wrappers, but honestly I don’t like this option that much.

Topic		Replies	Views
Normalize reward RLlib	4	2423	June 4, 2025
How to do the reward normalization in RLlib's PPO RLlib	2	3382	December 14, 2021
Wrong rewards: is there some reward normalization in PPO? Ray Tune	2	468	January 30, 2022
Proper implement of reward scaling in PPO RLlib	0	366	December 17, 2020
Unexpected dramatic drop in reward RLlib	8	1075	November 13, 2023

How to normalize reward in PPO with new API stack?

Related topics