Hi everyone, this should be a straight forward question but when I connect to an external cluster via
ray.init(address=f"ray://{head_node_ip_address}:10001")
and run a Tune job, I am not able to see the Jupyter notebook progress reporter.
My setup look something like this:
tuner = tune.Tuner(
MyTrainable,
tune_config=tune.TuneConfig(
# configs here
),
param_space=search_space,
run_config = air.RunConfig(
progress_reporter=JupyterNotebookReporter(overwrite=True),
checkpoint_config=air.CheckpointConfig(
checkpoint_at_end=False,
checkpoint_frequency=0,
num_to_keep=None,
),
sync_config=sync_config,
local_dir=log_dir,
log_to_file=False,
verbose=1,
)
)
When I run .fit()
I get this in the cells:
(TunerInternal pid=286391) <IPython.core.display.HTML object>
(TunerInternal pid=286391) <IPython.core.display.HTML object>
(TunerInternal pid=286391) <IPython.core.display.HTML object>
...
(TunerInternal pid=286391) <IPython.core.display.HTML object>
(TunerInternal pid=286391) <IPython.core.display.HTML object>
(TunerInternal pid=286391) <IPython.core.display.HTML object>
What I am expecting is this:
== Status ==
Current time: 2022-10-25 10:09:10 (running for 00:00:12.81)
...
Number of trials: 3/250 (1 PENDING, 2 RUNNING)
Trial Name | status | loc | x | y |
---|---|---|---|---|
TrainableFunc_f3139036 | RUNNING | 0.319237 | 0.319237 | |
TrainableFunc_f3139036 | RUNNING | 0.44374 | 1.319237 | |
TrainableFunc_f3139036 | RUNNING | 0.77797 | 2.319237 | |
… | … | … | … | … |
TrainableFunc_f3139036 | RUNNING | 0.77797 | 2.319237 |
What am I doing wrong here ?
Thanks!