'ModuleNotFoundError' when connecting to remote cluster

aeroaks · January 5, 2022, 11:50am

Hi,

I am trying to use ray for scaling out the ‘task’ in the my notebook. The library ‘mod_a’ is in development so we install that and another dependency ‘mod_b’ in pip’s ‘-e’ mode. The remote cluster is deployed on Azure and when we run test examples without external dependencies, then it works fine.

The connection to remote cluster is defined using the following statement:

ray.init(f"ray://127.0.0.1:{LOCAL_PORT}",
         runtime_env={
             "working_dir": "../src", # to access files
         })

When I try to to use a basic version of our library’s code in the notebook and try to put the object from ‘mod_a’ in the cluster, it throws the following error:

C:\ProgramData\Anaconda3\envs\icrm\lib\site-packages\ray\util\client\worker.py in put(self, val, client_ref_id)
    396                 "call 'put' on it (or return it).")
    397         data = dumps_from_client(val, self._client_id)
--> 398         return self._put_pickled(data, client_ref_id)
    399 
    400     def _put_pickled(self, data, client_ref_id: bytes):

C:\ProgramData\Anaconda3\envs\icrm\lib\site-packages\ray\util\client\worker.py in _put_pickled(self, data, client_ref_id)
    405         if not resp.valid:
    406             try:
--> 407                 raise cloudpickle.loads(resp.error)
    408             except (pickle.UnpicklingError, TypeError):
    409                 logger.exception("Failed to deserialize {}".format(resp.error))

ModuleNotFoundError: No module named 'mod_b'

The mod_b is installed in editable mode in the local conda’s pip environment from where the jupyter kernel is running.

When using py_modules to define mod_a, mod_b as dependencies, the put command complains about ModuleNotFoundError for the dependencies of the mod_b.

How do I resolve this error?

Topic		Replies	Views
ModuleNotFoundError from the cluster Ray Core	4	2590	January 5, 2022
Couldn't run in remote cluster Ray Core	3	436	July 13, 2023
"ModuleNotFoundError: No module named in" when connecting in client mode Ray Tune	3	2334	November 15, 2021
[Core] [Client] Cloud pickle library missing in ray client dependency Ray Client	2	1099	April 23, 2021
Ray local cluster error Ray Core	0	325	April 4, 2022

'ModuleNotFoundError' when connecting to remote cluster

Related topics