How to tune hyperparameters such as layer count/sizes of inner layers/dropout probabilities with Huggingface Transformers

Luca_Guarro · June 5, 2021, 12:28am

The Huggingface Documentation makes mention that I can tune parameters like layer count, sizes of inner layers, dropout probabilities of a transformers model with a Ray Tune trial object by passing it to the user-defined model_init function.

How exactly would the implementation look for this? Perhaps using this example as a reference.

rliaw · June 17, 2021, 1:35am

Hey @Luca_Guarro , you can make model_init take a config parameter. For example, in the script above, you can do:

    def get_model(params):
        return AutoModelForSequenceClassification.from_pretrained(
            model_name,
            config=config + params,
        )

...
Trainer(model_init=get_model, ...)

Luca_Guarro · July 17, 2021, 12:23am

Do I need to do anything in particular to make sure the parameters actually get passed?

I defined the function like so:

def get_model(params):
        db_config = db_config_base
        db_config.update({'alpha': params['alpha_val'], 'dropout': params['dropout_val']})
        return DistilBERTForMultipleSequenceClassification.from_pretrained(db_config, num_labels1 = 2, num_labels2 = 8)

But then when I run it, I get the error:

TypeError: ‘NoneType’ object is not subscriptable

Aka ‘Params’ is none. So how do I actually ensure that the get_model function receives an instance of the ray tune trial object?

rliaw · July 25, 2021, 2:05am

Can you provide a full script that you’re running? I’d love to try to take a look at this myself on my computer.

Luca_Guarro · July 25, 2021, 7:25pm

Hi Richard, we actually solved this issue on this forum. Let me know if that suffices

Topic		Replies	Views
Hyperparameter tuning	1	468	February 27, 2023
Tuning TF pretrained models hyperparameters Ray Tune	1	365	February 27, 2023
Hierarchical hyperparameter optimization Ray Tune	2	583	December 11, 2020
Conditioned hyperparam_mutations ray/tune Ray Tune	4	498	March 24, 2021
Tuning model hyperparameters with v2 API Ray Tune stopping condition & comparisons	1	53	October 4, 2024

How to tune hyperparameters such as layer count/sizes of inner layers/dropout probabilities with Huggingface Transformers

Related topics