Use Policy_Trainer with TensorBoard

Denys_Ashikhin · November 11, 2021, 12:38am

Hi All,

I am using a policy client + server for training purposes, however, I can’t figure out how to have tensorboard display any information for the training runs? Is there a parameter I need to pass?

Moreover, can I have tensorboard statistics on the policy_client (even though this is a PPO model so all training happens on the server) for custom metrics (for clients I am mainly interested in specific reward sources to see what contributes to a reward over time to find trends in the AI’s decision making process).

Thanks in advance!

Roller44 · November 11, 2021, 4:27am

You can write your custom metrics in your custom callback class. See: RLlib Training APIs — Ray v1.8.0

Denys_Ashikhin · November 11, 2021, 1:38pm

I saw that, my question is, how do I actualy hook tensorboard to display my values? What folder do I specify from cli? And are other values logged automicaly (like avg episode length, loss etc) or I need to log them myself?

Roller44 · November 11, 2021, 1:44pm

Normally logs are stored in ~/ray_results. You can type in the Tensorboard commands when you are in the ray_results folder.

Denys_Ashikhin · November 11, 2021, 1:44pm

Where’s the ~/ray_results folder located?

Roller44 · November 11, 2021, 1:45pm

Hmm… Do you use Ray on Linux?

Denys_Ashikhin · November 11, 2021, 1:46pm

No, it’s currently on windows

Lars_Simon_Zehnder · November 11, 2021, 1:47pm

Probably on C:\Users\yourusername\ray-results?

Roller44 · November 11, 2021, 1:47pm

Oh. I only know where the ray_results folder on Linux, so I cannot give an answer to your question. Sorry about that.

Denys_Ashikhin · November 11, 2021, 1:51pm

This looks correct?

Lars_Simon_Zehnder · November 11, 2021, 1:54pm

Yes, it does :

If you now call tensorboard --logdir C:\Users\yourusername\ray-results\PPO_RandomEnv_2021-11-11_07-39-05qxgyfj5g you should be able to see the results.

Denys_Ashikhin · November 11, 2021, 1:59pm

Yup that works! Awesome, thanks a bunch everyone!

Denys_Ashikhin · November 11, 2021, 7:21pm

One last question, how would I go about giving a specific name to the folders so they are more legible and not complete slew of characters?

Lars_Simon_Zehnder · November 11, 2021, 8:31pm

I have never done it myself, as I consider the choice of the name as sufficient (Trainer used, date, time and hashcode). You can also just call

tensorboard --logdir C:\yourname\home\ray-results

to get all logs loaded into Tensorboard:

You can check the ones you want to see.

In case you still want to change names, this might help you.

mannyv · November 11, 2021, 8:41pm

You can provide a name to tune.run that I think should affect the name in the log directory.

Denys_Ashikhin · November 11, 2021, 8:48pm

that’d would be very nice, is this as simple as adding a few lines to make tune.run(policy_trainer) ?

mannyv · November 11, 2021, 8:50pm

tune.run(policy_trainer, name="run_name")

Denys_Ashikhin · November 11, 2021, 8:53pm


    print(pretty_print(trainer.train()))
    print(f"Finished train run #{i + 1}")
    i += 1
    if i % 2 == 0:
        checkpoint = trainer.save(checkpoint_path)
        print("Last checkpoint", checkpoint)

That’s current my loop. Do I change print(pretty_print(trainer.train())) to tune.run(trainer, name=“run_name”)?

mannyv · November 11, 2021, 9:04pm

You could do for example,

tune.run(“PPO”, config=config, stop={#your stop criteria}, name=“run_name”)

Another option is that it should include the env name so you could register the env with different names that include extra info you want on each run.

Denys_Ashikhin · November 11, 2021, 9:08pm

I don’t really have a stop criteria though - I just want to make a save of the model every x iterations (each iteration takes like 1-4 minutes to train and like 35 mins to collect enough samples)…

Topic		Replies	Views
Reporting Custom Metrics From Policy_Clients RLlib	0	257	November 12, 2021
Ray Actor and TensorFlow Logging Ray Core	9	470	June 24, 2021
Parallel workers reporting to tensorboard RLlib	0	277	January 14, 2022
Tensorboard in RLLIB Algo Config without using Tune RLlib	0	380	July 5, 2023
Cumulative reward chart RLlib	3	511	July 26, 2021

Use Policy_Trainer with TensorBoard

Related topics