Dataset and task compute pipelining

abhullar · May 4, 2022, 6:40pm

I was wondering how I could use dataset pipelining and task compute pipelining.

I currently have two tasks (both utilizing one cpu resource) and I want both of them to run on specific nodes without explicitly pinning them.

For eg. I would want my dataset pipeline to feed task1 on node 1 and then results of actor1 to feed task2 on node 2

I have used the SPREAD scheduling strategy and a placement group with two bundles (each having one cpu resource) but if I run the tasks in a loop then I sometimes see task1 on node 2 and task2 on node 1

I was wondering how I could get the scheduler to do this without using NodeAffinitySchedulingStrategy

sangcho · May 4, 2022, 11:23pm

cc @jjyao @Clark_Zinzow can you guys address this question?

abhullar · May 11, 2022, 9:49pm

Just wanted to follow up in order to request support for this

jianxiao · May 12, 2022, 12:38am

@abhullar If I understand you correctly, I think what you did/saw is correct/expected

Without pinning tasks to nodes, using SPREAD should make scheduler spread the tasks across all the two nodes you provisioned. There is no guarantee that task1 will always on node1 and task2s on node2.
If you do want to have this assignment, you may use NodeAffinitySchedulingStrategy and set soft to false. In this case the assignment is rigid and less tolerant to node failures.

abhullar · May 12, 2022, 4:12pm

Is it possible to have multiple tasks where some are using SPREAD and some use NodeAffinitySchedulingStrategy?

abhullar · May 17, 2022, 5:59pm

@jianxiao
Im not able to import NodeAffinitySchedulingStrategy, I installed ray using

pip install -U "ray[default]"
# installed version is ray, version 1.12.1

I don’t see NodeAffinitySchedulingStrategy in ray/util/scheduling_strategies.py

jianxiao · May 17, 2022, 9:44pm

It is introduced in this PR: Node affinity scheduling strategy (#23381) · ray-project/ray@95714cc · GitHub, so not in 1.12 yet. Can you try master branch? If you are waiting for it, the good news is that the 1.13 release is coming soon which will have it.

jianxiao · May 17, 2022, 10:27pm

@abhullar Yes, you can submit those tasks as two batches, something like:

@ray.remote
def my_func(int idx):
    ...

# using SPREAD in first batch
[my_func.option(scheduling_strategy="SPREAD").remote(i) for i in range(10)]

# using node affinity in second batch
# The destination_node_id can be found by calling ray.nodes()
[my_func.option(scheduling_strategy=NodeAffinitySchedulingStrategy(destination_node_id, soft = False)).remote(i) for i in range(5)]

Topic		Replies	Views
How to make sure actors are triggered on the same set of hosts?	1	17	March 28, 2025
Set scheduling_strategy to SPREAD when using Modin on Ray Ray Core	3	354	December 2, 2022
How to prevent scheduling non-GPU tasks to GPU nodes Ray Core	6	130	September 30, 2024
Assign specific nodes to remote functions in ray cluster Ray Clusters	8	1836	June 8, 2022
Some workers are not assigned to any task Ray Core	10	619	May 24, 2021

Dataset and task compute pipelining

Related topics