Normalizing observations in PPO+LSTM

Rohan138 · May 23, 2023, 1:08am

No, you shouldn’t have to modify your spaces, RLlib will handle that. For compute_single_action, make sure you use algo.compute_single_action and not the policy. method, since the former will automatically handle the filters. See: How to correctly apply observation normalization?

Topic		Replies	Views
How to correctly apply observation normalization? RLlib	2	1471	November 19, 2022
Normalizing Observations Configure Algorithm, Training, Evaluation, Scaling	5	1348	December 22, 2022
MeanStdFilter Observation filter also normalizes action mask RLlib	3	1011	December 21, 2022
[Rllib] compute_single_action() with an LSTM-PPO trainer fails RLlib	1	968	February 3, 2023
Inconsistent actions from Algorithm.compute_single_action RLlib	3	390	June 14, 2023

Normalizing observations in PPO+LSTM

Related topics