Hi,
I am using Ray[tune] to find the best hyper-parameter settings for my Pytorch-lightning implemented network. Out of 100 trials, 3 trials raised error due to numerical exceptions inside pytorch backward method. I was wondering whether there is anyway to tell ray to skip the failed trials and get the best model after.
Thanks in advance