Best way to scale ingestion of IoT sensor streams with Ray?

aria_12 · December 27, 2025, 9:37pm

Hi everyone, I’m working on a project where dozens of ESP32-based sensors stream motion/telemetry data to a central service, and I’m evaluating options to scale ingestion and processing efficiently. For context, I’ve been exploring this ESP32 motion detector example, https://www.theengineeringprojects.com/2022/03/iot-based-motion-detection-with-email-alert-using-esp32.html which shows how devices can send frequent HTTP events, and I’d love to architect a scalable pipeline around that.

I’ve also seen Arduino forum threads and Raspberry Pi community projects where sensor telemetry is pushed to cloud endpoints or message brokers, and some IoT discussions about using lightweight event buses (MQTT, WebSockets). In Ray, what pattern works best for ingesting and processing a high volume of small sensor events (e.g., actor pools, streaming ingestion with Ray Serve, queues) while keeping latency low? Any lessons learned or starter examples would be great!

RunLLM · December 27, 2025, 9:38pm

For high-volume, low-latency ingestion of small sensor events in Ray, the recommended pattern is to use async actors or actor pools for I/O-bound workloads, as they efficiently handle concurrent event streams and scale with the number of CPU cores. Ray Serve can also be used for HTTP-based ingestion, providing autoscaling and request routing, but for pure event streaming (especially if not HTTP), async actors are often more efficient. Ray’s built-in queue (ray.util.queue.Queue) is available but is generally slower and less scalable than actor-based approaches for this use case. For batch or offline processing, Ray Data is preferred, but for real-time streaming, async actors or Serve deployments with async endpoints are best suited.

Lessons learned:

For I/O-bound sensor streams, start with a single async actor and load test; scale to multiple actors if the event loop saturates.
Use actor pools to distribute events across N actors for load balancing.
Ray Serve is ideal if you want HTTP endpoints with autoscaling and built-in request routing.
Avoid using Ray’s distributed Queue for high-throughput ingestion, as it introduces more overhead than actor-based solutions.
For MQTT/WebSocket ingestion, use an async actor to subscribe and process messages, then fan out to worker actors if needed.

Would you like a code example or more detail on a specific ingestion pattern?

Sources:

Hint: Mention @RunLLM in the post for followups.

Topic		Replies	Views
Stream processing of events (feature pre-processing) with "at least once" guarantee & auto-scaling	2	491	April 21, 2025
Ray Async actors and Apache Kafka Ray Core	1	760	February 2, 2021
What is the best approach for long running IO tasks (pollers)? Ray Core	1	591	August 9, 2021
Keypoint streaming usecase Ray Serve	7	637	May 26, 2022
Multi-stage fanning pipeline using Ray: Queues + Actors vs. Workflows Ray Core	3	993	April 22, 2022

Best way to scale ingestion of IoT sensor streams with Ray?

Related topics