How severe does this issue affect your experience of using Ray?
- High: It blocks me to complete my task.
This is the only Terraform script I could find that gives a complete example of running distributed training workloads with GPUs on AWS:
Are there other examples you are aware of that describe how to use Terraform to set up Ray with autoscaling on AWS?
I also found this related example for EKS: data-on-eks/ai-ml/ray/terraform at main · awslabs/data-on-eks · GitHub
(We are unable to use Kubernetes for now.)
Thank you so much!