[RLlib] How does setting `training_enabled` to False in external_env._ExternalEnvEpisode trigger no training?

heng2j · July 28, 2021, 6:22pm

Hi Ray Team,

My team is currently exploring the feasibility to use policy serving pattern to conduct large scale inference. And we noticed the parameter training_enabled in external_env._ExternalEnvEpisode() may be a hint for us to achieve what we want by reverse engineering inference from training. But by looking at the API we can’t trace howtraining_enabled is being used during training to determine if policy update is disabled. We have used keyword searching for the entire Ray repo, but still didn’t have any clue.

Do you mind to share some lights on where and how does training_enabled being used? Some insights for performing large scale inference will be helpful as well.

Thank you,
Heng

sven1977 · July 28, 2021, 7:45pm

Actually, you are right. I think setting this does nothing, except publishing this information in the “infos” dict returned by the step method on the server side. So the server then still has to respect that information, which afaik it doesn’t do (it simply ignores it).

This will require a fix on the PolicyServer side.

sven1977 · July 28, 2021, 7:45pm

Thanks for raising this @heng2j !

heng2j · July 28, 2021, 9:41pm

No problem @sven1977. Glad that we were able to raise the attention to the team. Will raise an official Github issue to the repo.

heng2j · July 30, 2021, 11:49am

HI @sven1977, here is the bug report for this issue

Topic		Replies	Views
Trying to set up external RL environment and having trouble RLlib	14	1255	September 28, 2021
External Env crashes during training step RLlib	3	406	November 4, 2021
Do I have to change "input": "sampler" config when working with ExternalEnv API? RLlib	4	296	February 2, 2021
Example of ExternalEnv used to implement a REST policy server RLlib	6	452	July 13, 2022
ExternalEnv in a secuential simulator running locally? And how to register the environment RLlib	4	358	February 25, 2022

[RLlib] How does setting `training_enabled` to False in external_env._ExternalEnvEpisode trigger no training?

Related Topics