Freezing observation filter for a policy server

antoine-galataud · June 13, 2022, 9:50am

I noticed that when a policy is trained with "observation_filter": "MeanStdFilter" the mean and standard deviations get updated also the policy gets served (ie by a policy server) whenever a call to ExternalEnv.get_action is performed.
I could reproduce it with Ray 1.6.0 and 1.13.0

Is there a way to freeze the filter so to get fully deterministic behavior?

christy · June 14, 2022, 3:42am

Hi Antoine! Welcome to the Ray community! Thank you for the question!

Would you like to ask your question in RLlib Office Hours? Just add discuss link to your question to this doc: RLlib Office Hours - Google Docs

Thanks! Hope to see you there!

Topic		Replies	Views
MeanStdFilter Observation filter also normalizes action mask RLlib	3	1014	December 21, 2022
How to correctly apply observation normalization? RLlib	2	1487	November 19, 2022
Applying MeanStdFilter before forward inference RLlib	0	50	September 25, 2024
Normalizing Observations Configure Algorithm, Training, Evaluation, Scaling	5	1359	December 22, 2022
Extract and display policy RLlib	3	476	July 26, 2021

Freezing observation filter for a policy server

Related topics