`RolloutWorker` does not properly initialize`policy_map`

avnishn · March 9, 2022, 10:43pm

Ah so the problem here is in how you are trying to get your policy map.

The worker is now a remote object, so you can’t directly get any of its attributes.

You can however remotely call any of its functions, and we wrote a function on the worker called apply(worker, fn, *args, **kwargs), for the exact use case that you have here.

github.com

ray-project/ray/blob/c844c706bffd7868fb904384902f3b0b7d25f2a1/rllib/evaluation/rollout_worker.py#L1595

      
        
                if self.env is not None:
                    self.async_env.stop()
                # Close all policies' sessions (if tf static graph).
                for policy in self.policy_map.values():
                    sess = policy.get_session()
                    # Closes the tf session, if any.
                    if sess is not None:
                        sess.close()
            
            
@DeveloperAPI
            def apply(
                self,
                func: Callable[["RolloutWorker", Optional[Any], Optional[Any]], T],
                *args,
                **kwargs,
            ) -> T:
                """Calls the given function with this rollout worker instance.
            
            
    Useful for when the RolloutWorker class has been converted into a
                ActorHandle and the user needs to execute some functionality (e.g.
                add a property) on the underlying policy object.

so how you can use this function is by doing something like this:

worker = RolloutWorker.as_remote().remote(env_creator=lambda x: gym.make("MountainCarContinuous-v0"), policy_spec=RandomPolicy)
worker.apply.remote(lambda _worker: print(_worker.policy_map))

The reason I didn’t do something like this:
policy_map = ray.get(worker.apply.remote(lambda _worker: _worker.policy_map)), which is how I would normally retrieve attributes of a rollout worker,
is because the policy_map object has as threading.RLock() object stored under the attribute self._lock. This lock can’t be serialized by ray, so it, and the entire policy_map cannot be transferred between ray actors via ray.get calls. We require this lock for some of our algorithms.

Lemme know if you have any more questions.

And if you have any questions about ray remote actors (e.g. how to interact with them, and what operations are possible with them) you can refer to this this doc page.

Topic		Replies	Views
RLLib Rollout Worker Init Configure Algorithm, Training, Evaluation, Scaling	2	192	March 13, 2024
Get_objects of worker.py timeout Ray Tune	3	456	June 19, 2022
ValueError: RolloutWorker has no `input_reader` object! RLlib	8	559	March 6, 2024
Remote Rollout Workers use local module from repo instead of "pip-installed" module RLlib	2	455	August 9, 2022
A RolloutWorker died computing advantages RLlib	0	27	July 31, 2024

`RolloutWorker` does not properly initialize`policy_map`

Related topics