External Env crashes during training step

klausk55 · October 15, 2021, 9:48am

Hello,

I’ve implemented my simulator using the ExternalEnv API, i.e. my simulator-env queries the policy to obtain actions and so on (see code snippet).

@override(ExternalMultiAgentEnv)
def run(self):
    obs = self.reset()
    eid = self.start_episode()

    while True:
        action = self.get_action(eid, obs)
        # action = {obs.agent_id: 101}
        obs, reward, done, info = self.step(action)
        self.log_returns(eid, reward, info)
        if done:
            self.end_episode(eid, obs)
            obs = self.reset()
            eid = self.start_episode()

My problem is that if the external simulator-env has sampled enough steps and RLlib starts with the first train step, then my external simulator-env crashes since get_action throws an Empty exception triggered by a 60 seconds timeout (“queue empty”).

I guess the external simulator-env still queries the policy for an action while RLlib already has started with a train step. What can I do to prevent this problem?

mannyv · October 15, 2021, 11:13am

Hi @klausk55,

Are you using local_inference or remote_inference?

klausk55 · October 15, 2021, 2:47pm

Hi @mannyv,

I don’t use any server resp. client (i.e. there is no PolicyClient class or something like this!).
I use the ExternalEnv API for a “custom use case” where my external simulator-env queries the policy to obtain an action or log rewards.
There are reset and step methods and the run loop exactly like in the code snippet above. And what the PPOTrainer gets is simply “env=MyExtEnvSimulator” (i.e. the registered class).

alexsunny123 · November 4, 2021, 11:04am

thanks for the awesome information.

Topic		Replies	Views
ExternalEnv in a secuential simulator running locally? And how to register the environment RLlib	4	482	February 25, 2022
ExternalMultiAgentEnv dynamics RLlib	0	9	January 14, 2025
Using evaluation with ExternalEnv RLlib	1	220	October 5, 2021
Toy example for using ExternalEnv API RLlib	3	395	February 24, 2022
'client.end_episode()' don't make any difference RLlib	3	651	July 26, 2022

External Env crashes during training step

Related topics