Rollout Worker Index with ExternalEnv

Kai_Yun · February 10, 2021, 4:23am

I’m trying to train a single agent with multiple different environments simultaneously using a custom simulator via ExternalEnv. I know that with gym.Env you can access env_config.worker_index which allows you to implement this.
I’m wondering if there is something like this with ExternalEnv so that I can use the rollout worker indices to assign different environments respectively and run them together.
Note: Each rollout worker will have a different setting of the same simulator.

Kai_Yun · February 10, 2021, 7:34am

I think I found the answer to this question. Please let me know if I got the concept wrong, but at least the code is working as intended.
At ray/rllib-env.rst at master · ray-project/ray · GitHub, there’s an example of how to “wrap” your env_config with EnvContext so that you can access worker_index and vector_index. The following is the example code from the link:

from ray.tune.registry import register_env

def env_creator(env_config):
    return MyCustomSimulator(env_config)     # return an env instance

register_env("my_env", env_creator)

Honestly, I don’t really understand why having a function env_creator would wrap env_config with EnvContext. My understanding is that there is some method in the registry that wraps it when a callable is inputted? (If you know the reason, please point me to the code.)

However, now you would be able to access worker indices with env_config.worker_index. For example:

class MyCustomSimulator(ExternalEnv):
      def __init__(self, env_config):
            ...
            print(env_config.worker_index)
            ...

tune.run(
      "DQN",
      config={"env": "my_env", 
              "env_config": {...},
              "num_workers": 6}
)

This will print out numbers from 1 to 6. But it won’t be in order. Anyway, the number is all I needed.
There’s one thing to caution, however: env_config in your MyCustomSimulator class is now an EnvConfigDict type.

sven1977 · February 17, 2021, 9:05am

Perfect. Yes, the env_config is actually not only a dict, but an EnvContext object (from ray.rllib.env.env_context import EnvContext).

It’s a (config) dict for the env, but also has the following properties:

        self.worker_index = worker_index
        self.num_workers = num_workers
        self.vector_index = vector_index
        self.remote = remote

klausk55 · November 18, 2021, 1:29pm

Is there also a way to access trainer’s resp. rollout worker’s configs (e.g. discount factor gamma) in custom env?

sven1977 · May 9, 2022, 10:52am

Hey @klausk55 , sorry for the late response .
No, there actually is no way to access the Trainer’s config from inside your environment. It’s by design, I believe, to keep the environment as an independent entity that has no knowledge about where and by whom it’s being looped through.

evo11x · May 11, 2023, 5:42pm

just add worker_index to your env and ray will pass the index to it

def init(self, worker_index, config=None):

Topic		Replies	Views
Custom simulator with as RLlib environment RLlib	1	475	December 17, 2020
External Env vs Vectorized Env RLlib	3	474	March 12, 2021
I cannot for the life of me figure out how to get a reference to my environment. Please Help RLlib	5	324	February 28, 2023
Do I have to change "input": "sampler" config when working with ExternalEnv API? RLlib	4	334	February 2, 2021
[RLlib] How to access the environment of 'remote worker environments'? RLlib	4	1121	July 16, 2021

Rollout Worker Index with ExternalEnv

Related topics