What's the best way to wait for an actor to fully exit before creating a new one?

rkn · September 17, 2021, 10:39pm

For example, suppose I want to replace an actor (both of which use a GPU) and I want to make sure the first actor is fully dead and no longer using memory on the GPU.

rkn · September 17, 2021, 10:40pm

Note that you can do ray.kill(actor_handle), but ray.kill returns before the actor process finishes exiting. You can check with the following script.

import os
import ray
ray.init()

@ray.remote
class Actor:
    def get_pid(self):
        return os.getpid()

a = Actor.remote()
pid = ray.get(a.get_pid.remote())

ray.kill(a)
os.kill(pid, 0)  # Raises exception if process is dead, otherwise nothing

Chen_Shen · September 18, 2021, 2:42am

cc @yic is this something you are aware of?

yic · September 21, 2021, 10:12pm

Actor killing right now is async and we don’t have the option to make it sync. One workaround for this one is that you can have a remote task in the actor to clear all the GPU resources used by this actor first and then kill it and start a new one. In this way, you are sure no actor is using the resource when the new one is creating.

If you think sync exiting is an important feature, you can submit an issue for this feature.

Topic		Replies	Views
Is `ray.kill` actor asynchronous? Ray Core	1	241	May 4, 2023
How can I synchronously create an actor? Ray Core	2	266	January 13, 2021
Best way to clean up all stale actors? Ray Clusters	4	1238	June 6, 2021
The pending tasks/actors remain on Ray Cluster when the driver die unexpected Ray Core	13	2512	February 6, 2023
[Core] Keep Actors Alive Forever Ray Core	3	519	May 20, 2021

What's the best way to wait for an actor to fully exit before creating a new one?

Related topics