I have a question for ray.data in realtime streaming process scenario

kyoka_gong · September 11, 2023, 10:03am

Hi everyone
I have been following the development of Ray and using it to solve problems in my work.
About 2 years ago, ray cooperated with ant group, tried to develop ray streaming like flink for realtime streaming process scenario. At that time, I was hoping that one day I could use Ray for real-time computing.
However, about 1 year ago, this development (ray streaming) was separated from ray to become mobius. And this project has not been updated for one year. The development plan of that ray streaming (which was stored in the Google document) has also disappeared.
Recently, I m trying to use ray for realtime streaming process scenario and I wanted to upgrade ray.data to fit what I want.
I would like to ask if there are some insurmountable problems that Ray will encounter in realtime streaming process scenario, which leads to giving up this path. If so, I will also give up this path.
If possible, I hope someone can tell me the reasons why I have to give up.

Thanks

Jules_Damji · September 12, 2023, 6:34pm

@kyoka_gong We are a bit all heads down. for Ray Summit coming next week. We replies/responses will be delayed until after next week.

cc; @ericl @chengsu

kyoka_gong · September 12, 2023, 11:53pm

thx, jules.

I m also expecting ray submit

skc361 · December 5, 2023, 6:15am

Hi. Jules, is there any updates here?

ivw · August 6, 2024, 5:28pm

Well it’s been a year; can you answer your own question with your learnings? I for one am very interested in the comparison/choice between Ray and Flink.

sjl · August 22, 2024, 8:28pm

For what it’s worth, Ray Data now natively uses streaming execution for datasets. You can read more about it in our docs:

One major difference between Ray and Flink is that Ray does not currently support unbounded data streams. Ray Data is more suitable for mixed CPU+GPU workloads, as we can take advantage of heterogeneous clusters.

Topic		Replies	Views
Ray Data streaming not streaming smoothly Ray Data	8	762	May 30, 2023
Does ray currently support some form of streaming, similar to kafka-stream	1	645	May 2, 2022
What's the policy of auto-scaling in Ray streaming? Ray Data	3	390	August 1, 2022
Benchmarks for Ray Data? Ray Data	13	1042	October 5, 2023
Ray is not meant as general ETL tool Ray Core	10	4215	April 13, 2023

I have a question for ray.data in realtime streaming process scenario

Related topics