Looking for the tensorboard source code part

saeid93 · February 8, 2021, 10:39pm

I looked into the rllib libarary, but I had difficulty finding the file responsible for writing training results to the tensorboard. Also, where exactly min, max, and mean of the custom metrics I add via callbacks are being computed? Is it min, max and mean over the episodes or per episode? Furthermore, how the steps in the tensorboard are being computing? In other words, what is the x-axis of each graph means in the tensorboard console? Is it the batch size timesteps or is it the mean of the result of all episodes in the same timestep?

rliaw · February 9, 2021, 4:29am

Tensorboard is being created in ray/tune/logger.py.

@sven1977 can tell you more about min/max/mean.

saeid93 · February 9, 2021, 11:28am

Thank you for your answer!
for the min/max/mean, I mean the final results that are shown in the tensorboard graph e.g. for the custom metric I have named num_consolidated I have only added the num_consolidated to my callback not it’s min/max/mean but in the tensorboard result I see three values for num_consolidated_max, num_consolidated_mean and num_consolidated_min.

There is the same case for rllib built-in tensorboard stats e.g. reward that it shows a min/max/mean graph for each of the stats. how these min/max/mean is computed? Is it min/max/mean of all episodes in a training batch? And also where in the rllib source code they are being computed?

My other question was what does the x-axis in the tensorboard graphs represents? Is it the number of steps in a batch? or steps in a sample episode?
For example, in the graphs I sent you, what does step 16.8k in the x-axis means in RL terminology?

NicoleRichards1998 · May 4, 2022, 8:10am

Hi, sorry I’m a bit late to this discussion. I have the exact same questions that you had, did you ever manage to find answers to them?

saeid93 · May 4, 2022, 8:52am

I think I finally managed to find something but honestly, I don’t remember. I think I figured it out by looking at the code and as far as I remember the code responsible for the metrics (not sure) was part of the ray tune code rather than the rllib.

avnishn · May 4, 2022, 8:36pm

Min, Max, and Mean are computed on a per episode basis, and then averaged across episodes. The min and max are not global mins and maxes

Topic		Replies	Views
Custom Tensorboard Metric (episode.total_reward auto generates as mean, min, max) RLlib	5	269	June 24, 2024
How rllib train log the reward on tensorboard? RLlib	1	538	March 25, 2022
Add episode reward variance into matrix and tensorboard RLlib	4	541	February 15, 2022
Custom metrics only mean value RLlib	3	883	February 16, 2022
Mean reward per agent in MARL RLlib	11	1119	January 12, 2023

Looking for the tensorboard source code part

Related topics