How severe does this issue affect your experience of using Ray?
- Medium: It contributes to significant difficulty to complete my task, but I can work around it.
Hello,
I’ve an On-premise ray cluster (i.e. provider.type: local
). I’d like to declare different worker node types in the cluster (e.g. “CPU only nodes” and “GPU nodes”). I tried adding an available_node_type
section in the cluster configuration as follows.
# Rest excluded for brevity
provider:
type: local
head_ip: 10.1.0.1
worker_ips: [10.1.0.2,10.1.0.3,10.1.0.4]
# Rest excluded for brevity
available_node_types:
head_node:
min_workers: 0
max_workers: 0
resources: {"CPU": 2}
cpu_node:
min_workers: 1
max_workers: 1
resources: { "CPU": 6}
gpu_node:
min_workers: 2
max_workers: 2
resources: { "CPU": 6, "GPU": 1}
head_node_type: head_node
# Rest excluded for brevity
When I ran ray up
to start the cluster, I got the following error:
The field available_node_types is not supported for on-premise clusters.
Is there way to declare different node types on on-premise
clusters?
The workaround that I’m thinking about ATM is to create a Ray cluster for each worker node type and have something external (to Ray) to schedule the workloads to the correct Ray cluster.
Thanks