Hello, I have a registered custom environment and an environment creation function I pass to the algorithm config during initialization, similar to:
config = PPOConfig().environment(env='MyEnv',env_config={'beta': 1}) algo = config.build()
MyEnv has the attribute “beta” which I would like to dynamically set during training (it changes how the rewards are calculated).
I have a loop over training epochs during which I run algo.train(). I’d like to change “beta” on all workers periodically, but not necessarily every epoch. How can I do this?
Thanks.