This is linked to this :
but i need to use ray tune for Wandb integration and those solution use PPOTrainer directly