I am using step()
to return my losses like this:
return {'train_epoch_loss': train_epoch_loss.detach().numpy(), 'validate_epoch_loss': validate_epoch_loss.detach().numpy()}
While the experiment runs, both metrics appear in the CLIReporter together with the standard metrics such as the hyperparameters or time_this_iter_s
.
Once the trials terminate, both metrics also appear in progress.csv
and result.json
, however they do not appear in “Scalars” or “Hparams” in Tensorboard (only in “Distributions”) and calling analysis.get_best_config(metric = 'validate_epoch_loss', mode = 'min', scope = 'last'))
only returns:
WARNING experiment_analysis.py:557 -- Could not find best trial. Did you pass the correct metric parameter?
Any idea how to resolve this?
Ray version: 1.0.1.post1
PyTorch version: 1.7.0