Tune.run() vs Tuner.fit()

raytune_kuberay_user · August 1, 2022, 9:54pm

Recently saw the readme change from using tune.run() to use Tuner.fit()

Will tune.run() API be removed in the future?
How stable is this new Tuner.fit() API?
Which API do you recommend?

xwjiang2010 · August 1, 2022, 11:32pm

@raytune_kuberay_user
Thanks for asking this question.
Tuner is the recommended way of running hpo workload on Ray AIR. The migration is needed for various Ray components (Ray Tune/ Ray Train etc) in Ray AIR to have consistent feel and APIs.
If you are looking to expand your use case beyond just tuning, Tuner would be a better API to use. Tuner is currently at the beta stage (with the new ray 2.0.0 release). In the long run, tune.run will be deprecated. If you see any gap with the tune.run API, please file a bug and let Ray Team know. Thank you!!

raytune_kuberay_user · August 2, 2022, 6:00pm

@xwjiang2010 Thanks for the reply.

In terms of functionality, are there anything that tune.run() supports and Tuner doesn’t support? (Also tuner supports but tune.run() doesn’t support?)

Our use cases are to use ray.tune + tensorflow + horovod/tf.distributed strategy. In the future, we might want to try elastic horovod with ray

kai · August 3, 2022, 2:41pm

There are minor API differences - e.g. the export_models argument has not been carried over to Tuner() as it will be deprecated (instead you should just export the models within the checkpoints).

At the moment, Tuner() uses tune.run internally, but this may change in the future.

We believe that all use cases should be covered in the Tuner() API, so if any functionality is missing, please let us know and we’ll add it!

As for benefits, the Tuner() API supports e.g. better restoration, failure handling, and a neater output format (results grid instead of experiment analysis).

zhh210 · November 4, 2022, 5:28pm

where to pass over checkpoint_freq in Tuner()? it works find in tune.run(checkpoint_freq=1) but can’t find where it can be passed to Tuner. Neither run_config nor checkpoint_config accepted checkpoint_freq.

xwjiang2010 · November 4, 2022, 5:38pm

use run_config.checkpoint_config.

a28091 · September 3, 2023, 1:56pm

Hi! In tune.run(…) it is possible to use

tune.run(..., config={"evaluation_config": {..}}

to evaluate your agent after training.
What is the proper way to implement this within tune.Tuner() API?

kai · September 5, 2023, 8:10am

@a28091 this is an rllib-specific setting, but it should work the same in tune.run and Tuner.fit() - i.e. just pass it as part of your param_space:

Tuner(
    ...,
    param_space={"evaluation_config": ...}
)

Topic		Replies	Views
Where do I find documentation on the tune.run method	3	2099	June 12, 2023
Improper 'run' - not string nor trainable Ray Tune	10	1862	March 8, 2021
How to interactively stop running trials Ray Tune	2	474	May 30, 2023
Correct way of using tuner.restore() Ray Tune	6	2263	November 16, 2022
Tuner.fit() never terminates Ray Tune	4	383	January 23, 2025

Tune.run() vs Tuner.fit()

Related topics