Ray OOM issue when the ray serve is launched

yin_Eric · November 22, 2022, 9:03am

I have always got this OOM problem as I try to launch the “ray serve” to establish a ML backend service. May I know how to set the memory of “7.68G”? how to enhance this threshold according to the ray dashboard.

yin_Eric · November 22, 2022, 9:04am

the dashboard memory limit is 7.68GB from the ray dashboard. How to enhance this value?

yin_Eric · November 22, 2022, 9:06am

The ML code is just like this with higher num_replicas set and then the OOM will happen since more replicants occupy much more memory?

ClarenceNg · December 14, 2022, 6:46am

The specific error (RayOutOfMemoryError) you are seeing here is being removed in Ray 2.2 - you may want to give it a try

ClarenceNg · December 14, 2022, 6:50am

As for the error you are seeing, it seems the node does not have enough memory to serve 2 replicas - you may want to increase the memory of the node or add another node

Topic		Replies	Views
Understanding Ray Serve Memory Consumption Ray Serve	6	769	December 14, 2023
24/7 Soak test result: Ray can't recover from OOM errors Ray Serve	6	539	December 20, 2022
Weird error logs when running Out Of Memory (OOM) Ray Core	6	2480	April 11, 2023
(raylet) node_manager.cc Workers (tasks / actors) killed due to memory pressure (OOM)	2	332	March 6, 2024
Ray Serve Object Store Memory Issue: ray.exceptions.ObjectStoreFullError Ray Serve	1	493	April 24, 2021

Ray OOM issue when the ray serve is launched

Related topics