How to assign actors to specific machines?

godsakurapeng · January 8, 2024, 8:11am

I’m not sure if this is a bug.
I have 4 machines with two gpu. At the same time, I have 5 tasks that require [2,2,2,1,1] gpu.I have set num_gpus.
The situation I envision is that three machines deploy two-gpu tasks, and one machine deploys two one-gpu tasks.
However, in fact, this allocation seems to be random. Sometimes it is allocated according to the scenario I expected. Sometimes two one-gpu tasks are distributed on two machines, and a two-gpu task is also allocated on two machines.

During initialization, you can see that two one-gpu tasks are distributed on two machines.

Because of the problem below, there are no pictures here. I’ll see if I can add them in the comment area.

Sorry, new users can only put one embedded media item in a post.

When I call a two-gpu GPU infer task, I can see that the task is running on two machines.
Infer is much slower due to the need for communication between machines

So, can ray assign actors to specific machines? Or is there any other way? I’m looking forward to the response

godsakurapeng · January 8, 2024, 8:12am

(add pictures

Jules_Damji · January 8, 2024, 10:51pm

@godsakurapeng Consider using placement groups to gang schedule them for resource affinity.

Some simple example code: misc-code/py/ray/placement_groups at master · dmatrix/misc-code · GitHub

Topic		Replies	Views
How to distribute actors to multiple GPUs Ray Core	6	1101	May 5, 2022
How to assign a specific actor to a specific GPU Ray Core	15	1478	February 16, 2021
How to auto assign actors to different GPUs in ray.data.map_batches Ray Data	2	50	November 26, 2024
How can I assign a ray actor to a specific gpu?	1	58	September 4, 2024
Spread accross several fractional GPUs or 1< num_gpus < 2 Ray Core	1	338	February 13, 2024

How to assign actors to specific machines?

Related topics