i’d just have a quick q re the autoscaling behavior described here: A Glimpse into the Ray Autoscaler by Ameer Haj Ali - YouTube
There you’re saying that the autoscaler would calculate how much resources would be needed exactly for running a particular function. If i setup a ray cluster with 0 workers and i only run a function like you’re describing there — should i expect to see an instantaneous provisioning of just enough machines to provide 6 CPUs ?
Re the time it takes to bring up the machines – i guess this is all gated only by how fast the underlying IaaS can bring up VMs, right?
Besides that, which role (if any) does the ratio parameter play here? Can it limit the number of VMs getting provisioned concurrently? E.g. if i have a loop of 1000, will the autoscaler request 1000 VMs right away as soon as the control flow enters the loop?