Tune architecture related parameters

GammaRamma · November 12, 2021, 12:05am

Hi,

I’m new to Ray and have a question which I can’t find answers to. I want to tune architecture related parameters, like number of layers, number of feature maps, etc. But these parameters affect the number and distribution of weights in the network, so different networks can’t just swap parameters. As I understand it, population based training tries to use parameters from other networks, but this would create a problem, because trading parameters would change the architecture and it wouldn’t fit with the state_dict anymore.

So I am thinking maybe a search techniques exists in raytune where each network always starts from the first epoch (like basic ASAH) but the networks are still crossbred according to their performance, like in PBT. This would work as the networks wouldn’t have to have the same weight structures.

Any help would be appreciated!!! Thank you!

Topic		Replies	Views
Evaluation using other trial's model parameters Ray Tune	2	313	October 13, 2022
Ray Tune PBT - Structural Hyperparameters Ray Tune	1	13	November 15, 2024
Tuning model hyperparameters with v2 API Ray Tune stopping condition & comparisons	1	45	October 4, 2024
Should not change your training model or data during the hyperparameters tuning! Ray Tune	2	364	January 26, 2022
Hierarchical hyperparameter optimization Ray Tune	2	580	December 11, 2020

Tune architecture related parameters

Related topics