Not receiving the same results from a manual Grid Search

hgersten · November 13, 2023, 12:53am

I have a manual grid search, which within every loop, the code appends rmse, mae and mape to a file along with the combination of hyperparameters that the model ran with. It runs all the combinations of hyper-parameters, and then after the loop finishes all trials I have a function that combines 3 different losses (first it filters all runs by the top 10 least rmse, then filter to the 4 least mae and then take the least mape). After that, I get the combination of hyper-parameters, and those are the parameters it uses to run my model deployed.

Here is the code of the the manual.

lr_list = [.013,  .0147, .017, .019]
hs_list = [16, 32]
BATCH_SIZE = 128
OUTPUT_SIZE = 7
for quantile in quantile_list:
    QUANTILES = quantile
    LOSS = QuantileLoss(quantiles = QUANTILES) 
    for lr in lr_list:
        LEARNING_RATE = lr
        for hs in hs_list:
            HIDDEN_SIZE = hs
            run_model(data)

I have code to use Ray-tune (2.8.0) as a wrapper and it runs all the combinations of the manual loop on top, and it references the same run_model function:

import ray
from pytorch_forecasting import TimeSeriesDataSet
from ray.tune.error import TuneError
import lightning.pytorch as pl

BATCH_SIZE=128 #128                              
OUTPUT_SIZE = 7
   

pl.seed_everything(42)

def run_functions(config):
    try:
        trial_id = ray.train.get_context().get_trial_id()
        run_model_20(data_id)

        ray.train.report({'dummy_metric': 1})
    except Exception as e:
        print(f"An error occurred during the run: {e}")
        raise e

try:
    ray.init(address='auto', _node_ip_address=node_ip_address,log_to_driver=False)

except ConnectionError as e:
    print(f"Could not connect to Ray cluster: {e}")
    exit(1)

data_id = ray.put(data)

hyperparameter_space = {
    "learning_rate": tune.grid_search([.013,  .0147, .017, .019]),
    "hidden_size": tune.grid_search([16, 32]),
    "quantiles": tune.grid_search([[.1,.1,.3,.4,.6,.7,.8],  [.1,.5,.7,.8,.9,.09,.9], [.5,.6,.7,.8,.9,.9,.9], [.1,.2,.3,.4,.5,.6,.7]]),
}
# Resources per trial
num_cpus = ray.available_resources().get('CPU', 1)
num_gpus = ray.available_resources().get('GPU', 0)
resources_per_trial = {"cpu":1, "gpu": num_gpus}

# Setup result directory
ray_results_dir = os.path.abspath("./ray_results")
os.makedirs(ray_results_dir, exist_ok=True)
try:

    # Run the experiment
    analysis = tune.run(
        run_functions,
        config=hyperparameter_space,
        num_samples=1,
        resources_per_trial=resources_per_trial,
    # scheduler=scheduler,
        local_dir=ray_results_dir,
        verbose=1  # Increased verbosity
    )
    trial_id= ray.train.get_context().get_trial_id()
#best_config = analysis.get_best_config("dummy_metric", "max")
#print("Best config: ", best_config)
except TuneError as e:
    print(f"Trial did not complete: {e}")
finally:
    ray.shutdown()

I am using the same parameters and the same random seed and the same funciton. However, the results of the losses are not the same and I am getting different parameters, therefore my model results are not matching.

I would imagine logically it’s a grid search, and it’s using the same function it should give me the same results. Why am I receiving different best parameters.

matthewdeng · November 13, 2023, 5:18pm

Can you try moving this inside of run_functions?

hgersten · November 13, 2023, 5:27pm

Thank you for your response! Any help would be greatly appreciated as I’ve spent hours on this project trying to replicate. I only added the pl.seed_everything in the run_function as an after thought after it wasn’t matching. I have pl.seed_everything(42) within the run_model function, so it should really be exact to the manual grid search…
please please, if you can help!

hgersten · November 13, 2023, 6:40pm

If you need more information, please let me know and I’d be glad to provide.

matthewdeng · November 14, 2023, 4:11am

Grid search itself shouldn’t introduce any difference.

To verify this, maybe one thing you can test is something like:

Test 1: Run function directly

run_functions({"learning_rate": . 013, "hidden_size": 16, "quantiles": [.1,.1,.3,.4,.6,.7,.8]})

Test 2: Use Ray Tune with a single config

hyperparameter_space = {
    "learning_rate": tune.grid_search([.013]),
    "hidden_size": tune.grid_search([16]),
    "quantiles": tune.grid_search([[.1,.1,.3,.4,.6,.7,.8]]),
}

This should help narrow down whether or not the results are the same, and/or if the random seeding is properly taking place.

hgersten · November 15, 2023, 2:22pm

Good idea - let me try it and will let you know what happens.

hgersten · November 15, 2023, 4:20pm

When I run on just 1 set of parameters I get the same exact scores - It seems it just when it’s running the multiple ones.
What can be done to achieve the same results but using Ray

hgersten · November 21, 2023, 4:09pm

Any ideas would be appreciated

matthewdeng · November 21, 2023, 9:17pm

Maybe you can try sharing a minimal repro? It’s hard for me to say, since running multiple sets of parameters shouldn’t modify each individual run.

Topic		Replies	Views
Nested Cross Validation with Ray and Tune Ray Core	6	2897	March 19, 2021
Should not change your training model or data during the hyperparameters tuning! Ray Tune	2	367	January 26, 2022
Hyper parameter tuning performance Ray Tune	3	325	December 12, 2020
[tune] multiple runs with same hyperparameter, different random seed Ray Tune	4	2064	January 29, 2021
Model training is slower in Ray Tune Ray Tune	8	1125	June 30, 2023

Not receiving the same results from a manual Grid Search

Related topics