How severe does this issue affect your experience of using Ray?
- Medium: It contributes to significant difficulty to complete my task, but I can work around it.
I am deploying models using Ray Serve and Kuberay on k8s cluster with autoscaling enabled on nodes. Each Node is having 8 CPU and 16 GB of memory, and at max we can have 10 nodes. My issue is when I am configuring head worker with 6 CPU it’s not able to deploy head on none of the node. I suspect it’s taking sum of head and worker resource limits for deployment, because head with 5 CPU and worker with 2 CPU is working. Same I am observing with memory configurations.
Is it doing it by design(if yes then can we know why), or it’s a known bug? What is the workaround here if I want to have my head and worker with high resource configurations.
Here are the configurations which I am trying to set
resources:
rayHead:
limits:
cpu: "6"
memory: "8Gi"
requests:
cpu: "6"
memory: "8Gi"
rayWorker:
limits:
cpu: "5"
memory: "6Gi"
requests:
cpu: "5"
memory: "6Gi"
minWorkerReplica: 0
maxWorkerReplica: 5
The configuration which is working currently,
resources:
rayHead:
limits:
cpu: "5"
memory: "8Gi"
requests:
cpu: "5"
memory: "8Gi"
rayWorker:
limits:
cpu: "2"
memory: "4Gi"
requests:
cpu: "2"
memory: "4Gi"
minWorkerReplica: 0
maxWorkerReplica: 5