I am using a policy client + server for training purposes, however, I can’t figure out how to have tensorboard display any information for the training runs? Is there a parameter I need to pass?
Moreover, can I have tensorboard statistics on the policy_client (even though this is a PPO model so all training happens on the server) for custom metrics (for clients I am mainly interested in specific reward sources to see what contributes to a reward over time to find trends in the AI’s decision making process).
I saw that, my question is, how do I actualy hook tensorboard to display my values? What folder do I specify from cli? And are other values logged automicaly (like avg episode length, loss etc) or I need to log them myself?
If you now call tensorboard --logdir C:\Users\yourusername\ray-results\PPO_RandomEnv_2021-11-11_07-39-05qxgyfj5g you should be able to see the results.
I don’t really have a stop criteria though - I just want to make a save of the model every x iterations (each iteration takes like 1-4 minutes to train and like 35 mins to collect enough samples)…