hello, I am training a PPO agent with RLlib in one machine using windows and then I copy the experiment folder to a different machine with Linux for testing purposes.
To get the analysis object I perform the following operation hoping to be able to get
get_best_checkpoint() and thenafter be able to build the PPO algorithm from the checkpoint:
analysis_object = ExperimentAnalysis(Linux_experiment_path,
analysis_object path always refers to the original windows path producing errors.
What is the proper workflow in this case?
Please let me know if you find a solution.
This is a known issue that’s being tracked here: [Train/Tune] Restore an experiment from a different machine/path · Issue #40585 · ray-project/ray · GitHub
Targeting a fix for Ray 2.9, but will keep this thread updated if a nightly is available earlier for you to use. Thanks for raising this issue.
Thank you very much @justinvyu
I am glad is something identified.
I have been trying to solve this for days before advancing since If I started to train models on the remote machine I could not load checkpoints on my local machine and test analyse, etc
Thank you very much. As soon as I try I will inform here.