How to distribute a very huge FC layer?

ecolss · July 7, 2021, 10:38pm

Hi Ray community, I’m a beginner of Ray and excited to learn it.

I want to implement a simple logistic regression model, but the number of features is very big, e.g. 2^32, so that means the torch.Linear layer would be a huge FC layer, and the input data is actually sparse arrays with feature ids.

In pytorch, we can define such a huge FC layer, but that would not be efficient.
I read about RaySGD, but looks like it only supports data parallelism, however my issue is more about model parallelism I think.

And typically I think in industry, parameter servers would be used for this use case, so I wonder how to do this in Ray?
Thanks

sangcho · July 12, 2021, 6:00pm

cc @rliaw can you follow up with him? I think this is relevant to your team

Topic		Replies	Views
Ray pytorch model partition Ray Core	1	29	October 31, 2024
Model Parallelism in Ray Ray Train	9	2993	November 18, 2023
Accessing Large Static Datasets with Ray Clusters	3	566	May 27, 2023
Sharing big ML models using only Ray Core Ray Core	1	395	July 6, 2022
Parallel inference using CPUs Ray Core	2	830	July 7, 2023

How to distribute a very huge FC layer?

Related topics