Autoscaling behavior

mbehrendt · September 13, 2021, 1:51pm

i’d just have a quick q re the autoscaling behavior described here: A Glimpse into the Ray Autoscaler by Ameer Haj Ali - YouTube

There you’re saying that the autoscaler would calculate how much resources would be needed exactly for running a particular function. If i setup a ray cluster with 0 workers and i only run a function like you’re describing there — should i expect to see an instantaneous provisioning of just enough machines to provide 6 CPUs ?

Re the time it takes to bring up the machines – i guess this is all gated only by how fast the underlying IaaS can bring up VMs, right?

Besides that, which role (if any) does the ratio parameter play here? Can it limit the number of VMs getting provisioned concurrently? E.g. if i have a loop of 1000, will the autoscaler request 1000 VMs right away as soon as the control flow enters the loop?

@Ameer_Haj_Ali

Ameer_Haj_Ali · September 13, 2021, 2:09pm

Yes. But it takes time to bring up those machines. Also, if the headnode can fit the 6 CPUs it will not start any worker nodes.

Keep in mind that the scaling is exponential with the number of running nodes. so if 10 nodes are running, it will add another 10, then another 20, then another 40, etc…
If you want this to go faster, you need to specify upscaling_speed:99999 in the cluster yaml and then it will scale up instantly.

more info on upscaling_speed: Cluster YAML Configuration Options — Ray v2.0.0.dev0

Topic		Replies	Views
Controlling Scaling based on jobs in queue Ray Core	2	285	April 8, 2021
Autoscaler scales cluster up and down all the time RLlib	6	453	May 12, 2021
Autoscaling not working with ray.util.multiprocessing Kubernetes	5	775	June 17, 2021
Autoscaling - Adding new worker nodes - stopped? Ray Clusters	0	350	July 15, 2021
Autoscaler launches extra nodes Ray Clusters	0	376	June 14, 2023

Autoscaling behavior

Related topics