How to normalize reward in PPO with new API stack?

**1. Severity of the issue: **
Medium: Significantly affects my productivity but can find a workaround.

2. Environment:

  • Ray version: 2.46.0

3. What happened:
I searched for the entire official document and also lots FAQ, but can’t find any example or instruction to normalize reward with new API. How to do this?