Remote Rollout Workers use local module from repo instead of "pip-installed" module

klausk55 · July 29, 2022, 1:48pm

Hello,

I have the following situation:
In my local repo I develop a Python package my_package and also build an installation wheel to install my_package via pip on my machine (…/site-packages/my_package). Furthermore, I have a (main) script where I import my_package (or parts of it) and run my RLlib training using Ray Remote (Rollout) Workers (i.e. num_workers > 0).
Now, suppose I alter something in the source code of a module (e.g. config) located in …/site-packages/my_package, but the source code in my repo folder is unchanged. The consequence is that the Ray Remote Workers do rollouts using the source code from my repo folder while some other parts of my RLlib training use the altered source code (config) in …/site-packages/my_package. These different versions lead to an exception and the death of all workers.

I guess the problem is that Ray Remote Workers cannot properly import my_package. It seems that the dependencies on my single-node cluster will not be properly resolved.
I have found this in the docs and tried that in my main script as follows

...
runtime_env = {"py_modules": [my_package]}
ray.init(log_to_driver=False, runtime_env=runtime_env)
...

At a first glance, the issue is solved, but I do not know whether this is the way to do it?

Lars_Simon_Zehnder · July 31, 2022, 8:41am

@klausk55 , as far as I know this is the proposed way to do it.

klausk55 · August 9, 2022, 9:43am

Thanks for your feedback @Lars_Simon_Zehnder!

Topic		Replies	Views
How can I set remote_workers from different machines or clusters RLlib	1	235	February 12, 2023
`RolloutWorker` does not properly initialize`policy_map` RLlib	1	1260	March 9, 2022
Configuring Ray RLib to only use the driver fails for PPO RLlib	3	519	October 31, 2021
Error when installing dependencies for worker nodes Ray Clusters	4	723	March 2, 2022
From ray.rllib.agents.registry import get_trainer_class RLlib	3	1631	March 3, 2021

Remote Rollout Workers use local module from repo instead of "pip-installed" module

Related topics