Increasing number of Actor doesn't decrease the running time

trunhng · August 11, 2024, 10:58am

How severe does this issue affect your experience of using Ray?

Low: It annoys or frustrates me for a moment.

Greeting!
I’ve been trying to implement the MuZero agent. My implementation is strongly based on the muzero-general.
Unfortunately, when using Ray for running self-play games in parallel within test mode, the running time of my script was roughly the same no matter of how many self-play Actors I used. Since I’m totally new to Ray, I’m not sure if the way I was doing with it is correct.

Please help me out! Thank you in advanced and have a nice day!

Here is how I’m doing it

checkpoint = torch.load(os.path.join(self.config.log_dir, 'model.checkpoint'))
self_play_workers = [
    SelfPlay.remote(deepcopy(self.game), checkpoint, self.config, self.config.seed + 10 * i)
    for i in range(self.config.workers)
]
histories = []

for _ in tqdm(range(math.ceil(self.config.tests / self.config.workers)), desc=f'Testing'):
    hs = [
        ray.get(worker.play.remote(
            0,  # select actions with max #visits
            self.config.opponent,
            self.config.muzero_player,
            self.config.render)
        ) for worker in self_play_workers
    ]
    for h in hs:
        histories.append(h)

trunhng · August 12, 2024, 8:40am

For those who might come across at the same situation, as mentioned by Ray’s document, it is not recommended to call ray.get() inside a loop.
Thus, I have fixed my code into

checkpoint = torch.load(os.path.join(self.config.log_dir, 'model.checkpoint'))
self_play_workers = [
    SelfPlay.remote(deepcopy(self.game), checkpoint, self.config, self.config.seed + 10 * i)
    for i in range(self.config.workers)
]
histories = []

for _ in range(math.ceil(self.config.tests / self.config.workers)):
    histories += [
        worker.play.remote(
            0,  # select actions with max #visits
            self.config.opponent,
            self.config.muzero_player,
            self.config.render
        ) for worker in self_play_workers
    ]
histories = ray.get(histories)

And it worked!

Topic		Replies	Views
Delay ray.get() seems cannot speed up for actors Ray Core	2	439	June 9, 2022
Confused with coreworker and worker Ray Core	3	660	August 7, 2022
Actor spawning method Ray Core	1	301	June 21, 2022
Inconsistency when configuring selfplay with shared parameters Configure Algorithm, Training, Evaluation, Scaling	3	359	December 2, 2022
maximize the parallelization efficiency using Python ray ActorPool?	4	701	November 15, 2022

Increasing number of Actor doesn't decrease the running time

Related topics