Hi all, I’m getting back into reinforcement learning so I thought I’d try some basic algorithms again. However, I’m getting an error that I have never found before, and no one else seems to have dealt with it either. All I could find is a similar error that ray actor’s throw on purpose, but then it doesn’t make sense that an error that is thrown by design is not handled properly. The error occurs after training is complete, it seems, and the training results seem fine.
Here’s my code and a screenshot of the error:
analysis = tune.run(
rllib.agents.ppo.PPOTrainer,
name='trainingwheels',
config={ 'env': 'CartPole-v0' },
stop={
'episode_reward_mean': 195,
# fallback in case mean episode length does not go high enough
'training_iteration': 15
},
local_dir='ray_results',
verbose=1
)
Does anyone know how I can fix this?