Ray python parallel processing of deep learning model on multiple docker

kartik_sharma · August 31, 2022, 8:12am

Hi,
I have 3 docker containers with each container consisting a deep learning model (tensorflow).
in each docker container I have a job of batch inference over 1000 images lets say.
I have create batch inference actor and its parallelising the batches over multiple cpus.

which scenario will be most suited for the above task.

each docker container will have separate ray cluster (in each container python code)
ray cluster will be configured locally and each docker container will connect to it.
ray cluster will be configured as separate docker container and each docker container will connect to it.
one of the docker out of three will consist ray and three of the containers will connect it.

Thanks

sangcho · August 31, 2022, 10:43pm

I think the most common pattern is 4. I think you can look at this doc Launching an On-Premise Cluster — Ray 3.0.0.dev0. When you deploy via docker container, make sure all necessary ports are open. Configuring Ray — Ray 2.0.0

kartik_sharma · September 1, 2022, 4:51am

Thanks this helps!

I just checked that the dependency on these three docker container would be different like tensorflow , pytorch and someother

On top of it, what if 4th container is added as well with different dependencies

I believe ray cluster would need those dependencies as well, right?
Any standard way to handle this?

sangcho · September 1, 2022, 11:42pm

It is recommended to have the same dependencies on all head & worker nodes. Alternatively, you can use runtime environment to sync dependencies Environment Dependencies — Ray 3.0.0.dev0

Topic		Replies	Views
Implementing Ray with multiple docker on single machine Ray Core	3	735	February 8, 2022
Cluster configuration on Azure running docker containers Ray Clusters	4	948	April 21, 2022
Running multiple projects with different python versions/ray versions and docker images Ray Core	7	2230	November 14, 2022
Ray Cluster on Azure with runtime_env=docker help Ray Core	1	408	March 31, 2022
Using Ray Multiprocessing on Docker Ray Core	1	545	March 7, 2022

Ray python parallel processing of deep learning model on multiple docker

Related topics