Is there a way to schedule a task according to data affinity? if not is this something that is planned?
Say a task has dependency on 4 objects (equally sized for the sake of the example) 3 are located on node 1 and another one on node 2.
According to the whitepaper if the task originated from node 2 and the node has enough resources it will be scheduled on the node 2 and will transfer the 3 objects from node 1 to the node’s plasma.
In this case it might be beneficial to schedule the task on node 1 so it will transfer only 1 object between the nodes.