Worker pool in debug log not correct

valiantljk · January 9, 2022, 4:14am

Hi,
I started ray on local mac, which has 12 logical cores.
By enabling

export RAY_BACKEND_LOG_LEVEL=debug

I can see more logs in /tmp/ray/session_latest/logs/raylet.out

The weird thing is the number of workers in worker_pool doesn’t seem to change when worker processes are created.

[2022-01-08 19:59:00,872 I 14563 5040023] worker_pool.cc:449: Started worker process of 1 worker(s) with pid 14571, the token 1
[2022-01-08 19:59:00,873 D 14563 5040023] worker_pool.cc:257: Starting new worker process, current pool has 0 workers

Should I expect that the number increased to 12?

Clark_Zinzow · January 10, 2022, 7:26pm

The weird thing is the number of workers in worker_pool doesn’t seem to change when worker processes are created.

Those log entries indicate that the current pool size is 0, and it’s in the process of starting one new worker process, so I don’t think that is inconsistent state between those two log statements.

Should I expect that the number increased to 12?

I believe that prestarting num_cpus workers at node startup time only happens if you start Ray on a single node (your local mac) and if no runtime environment is given at start time. I assume that you’re not giving a runtime environment in ray.init()?

Also, are you seeing those log entries at node startup time (time of ray.init()), or when submitting a task?

valiantljk · January 10, 2022, 8:04pm

Those log entries indicate that the current pool size is 0, and it’s in the process of starting one new worker process, so I don’t think that is inconsistent state between those two log statements.

That makes sense.

I believe that prestarting num_cpus workers at node startup time only happens if you start Ray on a single node (your local mac)

If I start Ray with ray start on a cluster (two nodes). Do you mean no worker processes are pre-started?

I assume that you’re not giving a runtime environment in ray.init() ?

No

Also, are you seeing those log entries at node startup time (time of ray.init() ), or when submitting a task?

At node startup time, i.e., ray.init()
When submitting a task, the raylet.out has the following logs:

[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:478: Queuing and scheduling task d92fce596cade5cd17b0cea61222523a1df0bddd01000000
[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:77: Scheduling pending task d92fce596cade5cd17b0cea61222523a1df0bddd01000000
[2022-01-10 11:56:30,493 D 14563 5040023] scheduling_policy.cc:113: Node 734144275483469687 is available for request  {(10000) (0) (0) (0) }  []
 with critical resource utilization 0 based on local view  {
	CPU: (120000:120000)
	MEM: (107432263680000:107432263680000)
	GPU: (0:0)
	OBJECT_STORE_MEM: (53716131840000:53716131840000)
	-1:(10000:10000)
}

[2022-01-10 11:56:30,493 D 14563 5040023] cluster_resource_scheduler.cc:262: Scheduling decision. forcing spillback: 0. Best node: 734144275483469687 0078b396f8673cd20ba4618b5619014b663ff36296adcf957b05bce5, is infeasible: 0
[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:146: No args, task can be dispatched d92fce596cade5cd17b0cea61222523a1df0bddd01000000
[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:568: RayTask d92fce596cade5cd17b0cea61222523a1df0bddd01000000 has args of size 0
[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:570: Size of pinned task args is now 0
[2022-01-10 11:56:30,493 D 14563 5040023] worker_pool.cc:1086: Pop worker for task d92fce596cade5cd17b0cea61222523a1df0bddd01000000 task name {type=PythonFunctionDescriptor, module_name=__main__, class_name=, function_name=fun, function_hash=15b1bb967e104b9ab04cc38f4714aba6}
[2022-01-10 11:56:30,493 D 14563 5040023] cluster_task_manager.cc:256: Dispatching task d92fce596cade5cd17b0cea61222523a1df0bddd01000000 to worker 7dfc1e0e15023caed4a34f99120ff5e38bd6e62f33bab81d5237d2bc
[2022-01-10 11:56:30,502 D 14563 5040023] node_manager.cc:997: [Worker] Message NotifyDirectCallTaskUnblocked(12) from worker with PID 14577
[2022-01-10 11:56:30,502 D 14563 5040023] dependency_manager.cc:148: Canceling get request for worker 7dfc1e0e15023caed4a34f99120ff5e38bd6e62f33bab81d5237d2bc
[2022-01-10 11:56:30,506 D 14563 5040023] node_manager.cc:1888: Finished task d92fce596cade5cd17b0cea61222523a1df0bddd01000000
[2022-01-10 11:56:30,506 D 14563 5040023] dependency_manager.cc:94: Canceling wait request for worker 7dfc1e0e15023caed4a34f99120ff5e38bd6e62f33bab81d5237d2bc
[2022-01-10 11:56:30,506 D 14563 5040023] dependency_manager.cc:148: Canceling get request for worker 7dfc1e0e15023caed4a34f99120ff5e38bd6e62f33bab81d5237d2bc
[2022-01-10 11:56:30,507 D 14563 5040023] node_manager.cc:997: [Worker] Message NotifyUnblocked(10) from worker with PID nil
[2022-01-10 11:56:30,507 D 14563 5040023] dependency_manager.cc:148: Canceling get request for worker 01000000ffffffffffffffffffffffffffffffffffffffffffffffff
[2022-01-10 12:02:43,534 I 14563 5040023] node_manager.cc:628: Sending Python GC request to 13 local workers to clean up Python cyclic references.

Clark_Zinzow · January 10, 2022, 9:22pm

My current understanding is that no prestarting takes place for a multi-node cluster until the first task is submitted, at which point node.num_cpus workers (in your case, 12) will be prestarted. If running on a single-node (local cluster), this prestarting happens at node startup time (technically, when your driver/client connects to the cluster).

FYI I think that I had the log statement ordering on this mixed up:

Those log entries indicate that the current pool size is 0, and it’s in the process of starting one new worker process, so I don’t think that is inconsistent state between those two log statements.

It actually looks like the “current pool size” may lag behind the number of started worker processes, since starting a worker process is asynchronous, with the pool size updated when a process eventually actually starts. So it’s possible that the launch of all 12 processes is started (and therefore hitting that current pool size logging statement) before all 12 processes finish starting and this pool size updates.

Do you see a bunch of Started worker process of log messages? Checking cat /tmp/ray/session_latest/logs/raylet.out | grep "Started worker process of" should give a good indication, you should see 12 of those.

valiantljk · January 10, 2022, 9:44pm

Do you see a bunch of Started worker process of log messages? Checking cat /tmp/ray/session_latest/logs/raylet.out | grep "Started worker process of" should give a good indication, you should see 12 of those.

Yes

Clark_Zinzow · January 11, 2022, 12:31am

That’s my guess, then, that the worker pool is correctly prestarting workers, but because that measurement of the current pool size isn’t updated until the worker processes have finished starting, it keeps logging out a worker pool size of 0.

Would changing this to “current pool has 0 workers already running, and 10 workers starting” be more informative?

valiantljk · January 11, 2022, 12:51am

Would changing this to “current pool has 0 workers already running, and 10 workers starting” be more informative?

This is better.

Topic		Replies	Views
Ray init fails to register workers Ray Core	9	2806	August 17, 2022
Cannot figure out why number of processes less than number of available CPUs Ray Core	2	496	November 30, 2022
Pool in a Ray cluster is sending the same number of jobs to different nodes even though the nodes have different sizes/different number of CPUs Kubernetes	6	665	June 8, 2022
Worker node workers/cores aren't not working	1	598	May 2, 2022
[Core] Ray start more workers than cpus Ray Core	5	1892	April 10, 2021

Worker pool in debug log not correct

Related topics