Ray tune Integration with Weights and Biases

Athe-kunal · March 5, 2023, 5:32am

As per the docs, I implemented the integration of weights and biases to Ray tune by doing the following

from ray.air.integrations.wandb import WandbLoggerCallback

wandb_callback = WandbLoggerCallback(project="Ray Tune Trial Run",log_config=True,save_checkpoints=True)

But when I am checking the artifacts that are saved due to save_checkpoints=True and downloading it, I am unable to load the RL agent

It is not storing the checkpoints. Am I missing something, or I have to manually save the checkpoints?

justinvyu · March 6, 2023, 11:29pm

Hi @Athe-kunal,

Are you using an s3 upload directory? If so, this is due to trial artifacts (these wandb checkpoints) not being uploaded to the cloud. We have recently added artifact syncing here: https://github.com/ray-project/ray/pull/32334.

You can try it out on the latest ray nightly. Note that there are still a few limitations (see the PR description for more details). Let me know if any of these limitations block your usage in any way.

Athe-kunal · March 7, 2023, 3:05am

Hi @justinvyu
No, I am not using the s3 upload directory. I am training the model in the local directory and uploading it to the weights and biases. But I am only getting the policy checkpoints in the weights and biases, from which I cannot load the model.

justinvyu · March 7, 2023, 5:47pm

Are you trying to load a RLlib policy? Does this Policy.from_checkpoint work for you? Saving and Loading your RL Algorithms and Policies — Ray 2.3.0

justinvyu · March 8, 2023, 8:11pm

@Athe-kunal This may be related to [Tune][wandb] Not all logs and artifacts get uploaded to wandb when a Tune experiment finishes · Issue #33129 · ray-project/ray · GitHub. Currently investigating it.

Athe-kunal · March 9, 2023, 5:49pm

Thank you @justinvyu for clarifying it, I will follow the GitHub issue closely to resolve this.

Topic		Replies	Views
How to use the wandb logger callback when using algorithm api RLlib	5	625	December 29, 2022
Fails restoring weights #41508 RLlib	2	423	December 29, 2023
Ray Tune and Ray Train not working with windows path (storage_path)	2	989	October 4, 2023
Tune + RLLIB + Wandb integration Configure Algorithm, Training, Evaluation, Scaling	0	130	June 17, 2024
Save model parameters on each checkpoint Ray Tune	21	3366	March 29, 2023

Ray tune Integration with Weights and Biases

Related topics