Multiple async actors vs single Actor / plain asyncio

ingandreaguidi · March 20, 2023, 4:04pm

How severe does this issue affect your experience of using Ray?

Medium: It contributes to significant difficulty to complete my task, but I can work around it.

Hello, first of all thank you for this great framework.
I would like to understand whether having multiple async Actors that process elements from a list vs having a single actor OR using plain async/await for that provides any real performance improvement, given that spawning more processes does not bring any value with event loop and coroutines.

My initial guess was that having multiple actors could help with achieving something close to how Rust’s Tokio works like, but I might be 100% wrong.

Jules_Damji · March 20, 2023, 5:56pm

Good question. I did some experiments but it was more about using async Actors vs sync actors vs threading. Not sure if that addresses this question.

cc: @cade

ingandreaguidi · March 20, 2023, 6:17pm

@Jules_Damji Thank you for your response. Actually, I had already found your article and I thought it was really interesting, my point though was more focused on understanding whether to rewrite some async/await code by using 1 or more Async Actors; I’m trying to understand which gain I might obtain, besides the builtin maximum concurrency handling (which is really nice, btw).

The code I have is all I/O bound code (network requests), and I am running it on a single multi-core Ubuntu server. Open to all kinds of suggestions from you all!

cade · March 20, 2023, 6:33pm

Hi @ingandreaguidi! This use-case feels well-served by a single Ray asyncio actor.

It’s hard to answer the root question in a general way; under high-load the single event loop will have a limit in throughput and/or latency in processing tasks. Once your event loop is saturated (can no longer increase throughput), or the latency variation is unacceptably high, you should move to multiple asyncio actors and shard the input data over them to better parallelize the processing. This should increase your throughput linearly (by the number of actors), and will then be limited by the number of cores in your CPU.

If you want high confidence, I would run a load test against a single async actor to determine the maximum throughput a single actor can sustain with reasonable latency.

Jules_Damji · March 20, 2023, 6:48pm

Excellent @cade. yes, I would agree for I/O bound, it seems like async is the best choice.

@ingandreaguidi, let us know how you fare with @cade’s suggestion.

Topic		Replies	Views
Ray ActorPool with 2 actors for Tensorflow resent-50 prediction is not performance better than single actor pool Ray Core	0	309	December 11, 2021
[Core] Why actors are executed sequentially? Ray Core	5	286	September 27, 2023
How to increase ray performance for cpu and io bound operations in a task Ray Core	9	1008	August 9, 2021
Running methods with actors is slower than running normal methods Ray Core	10	697	May 24, 2021
Actor spawning method Ray Core	1	328	June 21, 2022

Multiple async actors vs single Actor / plain asyncio

Related topics