Slow Actor start time due to import overhead of dependencies

owal-mike · May 25, 2021, 5:04pm

Hi,

I’m working on a realtime video analytics application that depends on the scientific computing stack of numpy/scipy/opencv/torch/sklearn/matplotlib/etc. It spawns multiple actors for each stream and we’re running into issues with the startup time, which appears to be due to the overhead introduced due to the import time of these libraries. Because we have an actor for each session that spins up other actors for ingest and processing the cost blows up and it takes around 15 seconds for the initial ray.get(actor.method.remote()) to return.

Is there a way to get around this by potentially preallocating a process pool for the actors with the initial set of dependencies loaded?

sangcho · May 26, 2021, 6:37am

Have you looked into Actor pools https://docs.ray.io/en/master/actors.html?highlight=actor%20pool#actor-pool?

owal-mike · May 26, 2021, 1:55pm

Yes but it’s not the easiest solution in my case because the actors I’m using are stateful and get assigned to individual video streams.

I’m really looking for a way to make Actors get preallocated the way tasks are (as explained here Using Actors — Ray v2.0.0.dev0).

If not I’ll probably have to preallocate the actors myself at startup and have some extra logic to manage assigning free ones to new streams.

sangcho · May 29, 2021, 8:36am

Actually, tasks are not preallocated in this case. What’s happening is there are pre-created workers, and when actors are created, it chooses one of them to be initialized.

I think there’s no clear way to make this pre-import works right now. I recommend you to do what you mentioned, but feel free to create API requests to our Github issue page!.

Topic		Replies	Views
Actor launch overhead question Ray Core	7	448	October 5, 2022
Actor Scheduling Bug? Ray Core	2	153	February 27, 2024
Delay ray.get() seems cannot speed up for actors Ray Core	2	441	June 9, 2022
maximize the parallelization efficiency using Python ray ActorPool?	4	721	November 15, 2022
Divide Work between Actors Ray Core	5	334	January 22, 2021

Slow Actor start time due to import overhead of dependencies

Related topics