Launching subprocesses inside a ray.remote is 10x slower

frib · February 14, 2024, 2:07am

Launching subprocesses using asyncio.create_subprocess_exec inside a ray.remote is 10x slower. For example:

import asyncio
import ray
from tqdm.asyncio import tqdm_asyncio


async def arun():
    await tqdm_asyncio.gather(*[
        asyncio.create_subprocess_exec("bash", "-c", "sleep 0.0001")
        for _ in range(3000)
    ])


@ray.remote
def run():
    asyncio.run(arun())


ray.init()

asyncio.run(arun())  # 3s, 1k it/s
ray.get(run.remote())  # 30s, 100 it/s (10x slower)

It’s even worse when the task is more heavy (e.g. you get a 30x when cating a file to dev/null)
I’m using ray-2.9.2 and python 3.10.11 in Ubuntu 22.04

The use case: I want to train a model to write correct code, and during eval I generate many programs and run them against test cases.

I don’t understand why it slows down, and I would really appreciate an explanation of why this happens and how to fix it!

Topic		Replies	Views
Unexpected Subprocess behaviour inside a loop in a ray method	5	2392	February 10, 2021
Issue with execution priority when running multiple high stress remote functions Ray Core	3	219	November 18, 2023
[Nondeterministic] Ray Serve & Fast API Asyncio Pipeline	2	44	January 14, 2025
Understanding differences in performance for Ray.remote vs Ray Serve Ray Serve	2	1427	April 13, 2023
How to increase ray performance for cpu and io bound operations in a task Ray Core	9	1004	August 9, 2021

Launching subprocesses inside a ray.remote is 10x slower

Related topics