Best practices for unit testing actors in large, complex codebase with many dependencies?

jhallard · February 3, 2023, 5:56pm

I have some questions about best practices and limitations for unit testing Ray Actors, especially in non-toy examples.
For any sufficiently complex system, unit testing involves mocking large portions of that system, including environment variables, constants, network requests, and especially external dependencies like database. However, when unit testing Ray Actors, the only way for mocks to actually apply is to do the following:

Run in local mode when initializing a local Ray cluster
Create actors in a “JIT” fashion by forgoing the @ray.remote() decorator and instead turning your classes into actors at the last minute (ray.remote(**options)(ActorType).remote(**kwargs) )

Some questions about this

Is this correct about local mode? From what I can tell, when not running in local mode the Actors are actually run in another process after being pickled and sent via the GCS to the worker. Pickling and running remotely in this fashion avoids picking up any of the mocks/global fixtures that one generally relies on when unit testing
If this is the case, and given that local mode appears to be widely used for unit testing purposes (based on Ray github issues that mention it), why is local mode being deprecated? It seems like the only reasonable way to actually test a sufficiently complex system where mocks are required and especially if someone is trying to integrate actors into an existing complex codebase
Does the Ray team (or anyone else here who has built a sufficiently complex system with actors) have any other tips and tricks for unit testing and specifically mocking subcomponents of actors? We would love to run in non-local mode to better test the full system and have access to functionality like killing actors, the Ray state APIs, etc, but it seems impossible without just writing full integration tests that forgo the mocks

Chen_Shen · February 13, 2023, 7:26pm

Hi @jhallard

This is a great question. The problem with the local mode is that we are understaffed to maintain it, today it missing a few critical features (such as runtime-env, or features like iterators).

The best practice for the Ray team adopted today, is actually running integration tests where it starts multiple nodes on the same cluster without mocking. I guess you are already doing that today, but (ray/test_basic.py at master · ray-project/ray · GitHub) could be a starting point.

Topic		Replies	Views
[Core] How to make sure an actor is initialized? Ray Core	5	1219	February 17, 2023
Unittest __init__ exception handling Ray Core	2	257	December 20, 2023
Ray remote Actors with modularized codebase Ray Client	1	494	December 21, 2021
How to configure a Ray cluster to have actor/task source code and avoid pickling overhead? Ray Core	4	519	February 24, 2023
Possible bug in ray 2.3.1: Setting max_calls=1 for a method and local_mode=True leads to a ValueError Ray Core	3	772	May 1, 2023

Best practices for unit testing actors in large, complex codebase with many dependencies?

Related topics