Question about logging.info in ray actor

jeffrey82221 · March 12, 2022, 1:28am

I’ve made some experiment on different parallelization packages including multiprocessing, billiard, and ray.

Both multiprocessing and billiard (a fork of multiprocessing) provide Pool object for mapping a function to a list of items. However, logs in the child processes cannot be sent to the log monitor, while those in the parent can. (I am using logging.info to track the status of my code in the child process) It maybe because logging object has a process lock which is shared between processes, which might causing deadlock and causing some child process unable to close successfully. Good news is that logging in the Actor of ray somehow does not have this problem.

I wonder what is the fundamental difference between a process and a ray actor. What is the benefit of using Ray against the traditional multiprocessing ways of python (e.g., using multiprocessing / os.fork …). Also, how ray handles the logging issue mentioned above?

ericl · May 12, 2022, 2:14am

Hey @jeffrey82221, good questions.

I wonder what is the fundamental difference between a process and a ray actor.

They’re really quite similar. Ray launches each actor in a separate process (tasks are also run in separate processes).

What is the benefit of using Ray against the traditional multiprocessing ways of python (e.g., using multiprocessing / os.fork …).

One advantage is that Ray can scale out to multiple machines. This includes the ray.util.multiprocessing module. Another is that you can use build a larger distributed application using Ray tasks/actors/libraries, whereas usually multiprocessing is parallelizing a single computation.

Regarding the logging issue, Ray pipes all the stdout/stderr of its worker processes to the driver by default (unless you set ray.init(log_to_driver=False)), which might explain the difference.

Topic		Replies	Views
Difference in logging between Ray Actors and processes Ray Core	1	736	November 28, 2022
Streaming logs from launched processes by ray workers Dashboard, Monitoring & Debugging	2	686	October 21, 2022
How to Configure Ray to Use Standard Python Logger for Multiprocessing? Ray Core	0	151	March 27, 2024
Cannot call ray.get_actor in new multiprocess process	0	76	April 27, 2024
Logging module for custom environment not working RLlib	0	208	July 15, 2022

Question about logging.info in ray actor

Related topics