Issue to run the TuneGridSearchCV

ivanv6 · December 29, 2020, 8:32am

Hi, I’m trying to use the TuneGridSearchCV to optimize hyperparameterer tuning but, after installlation, I found this error:

    A worker died or was killed while executing task ffffffffffffffff3bf0c85601000000.
    A worker died or was killed while executing task ffffffffffffffffa7357af301000000.
    The actor or task with ID ffffffffffffffffe1083dbe01000000 cannot be scheduled right now. It requires {CPU: 1.000000} for placement, but this node only has remaining {memory: 4.052734 GiB}, {CPU: 4.000000}, {node:192.168.1.71: 1.000000}, {object_store_memory: 1.367188 GiB}. In total there are 0 pending tasks and 4 pending actors on this node. This is likely due to all cluster resources being claimed by actors. To resolve the issue, consider creating fewer actors or increase the resources available to this Ray cluster. You can ignore this message if this Ray cluster is expected to auto-scale.

My code is below:

    from sklearn.tree import DecisionTreeClassifier
    from ray.tune.sklearn import TuneGridSearchCV 
    clf = DecisionTreeClassifier()
    #hypertuning paramenters
    parameter_grid = {
    'criterion':['gini','entropy'],
    'splitter':['best','random'],
    'max_depth': [5, 8,10, 15, 25],
    'min_samples_split' : [2, 5, 10, 15,20,25],
    'min_samples_leaf' : [1, 2, 5, 10,15,20],
    'random_state' : [seed],
                        }
    print("TUNING ############################")
    startgrid=time.time()
    cv = RepeatedStratifiedKFold(n_splits=10, n_repeats=3, random_state=1)
    grid_searchdt = TuneGridSearchCV(clf, parameter_grid, cv =3, verbose = 0, n_jobs = -1,early_stopping=False,max_iters=10)
    bestDT = grid_searchdt.fit(x_train, y_train)
    print(bestDT.best_params_)
    best_grid_dt = bestDT.best_estimator_
    print(best_grid_dt)
    endgrid = time.time()
    print("Grid time: "+str(endgrid-startgrid))

Could you please help me with this error? Thanks

rliaw · December 29, 2020, 9:03am

This seems odd; can you post the zip of your /tmp/ray/session_latest/logs?

ivanv6 · December 29, 2020, 9:26am

Sure, here the zip file WeTransfer - Send Large Files & Share Photos Online - Up to 2GB Free

If you want, i’m also on slack

rliaw · December 29, 2020, 7:05pm

Cool, thanks!

Let’s stay on discourse for now as it is more visible. Can you provide the output from running verbose=3? It seems like things are failing after a while, but not immediately.

ivanv6 · January 4, 2021, 9:17am

Yes of course. Here the output with verbose = 3.

Topic		Replies	Views
Tune.run works but TuneGridSearchCV.fit does not work for me Ray Tune	15	1156	August 31, 2022
Issue running TuneGridSearchCV Ray Tune	0	333	October 7, 2021
Issue with TuneGridSearchCV Ray Tune	1	424	October 20, 2021
Warnings with TuneSearchCV Ray Tune	15	1062	May 7, 2021
A worker died or was killed while executing a task by an unexpected system error Ray Tune	6	4438	May 8, 2023

Issue to run the TuneGridSearchCV

Related topics