PER Buffer throws KeyError during training of SAC

Hi,

Yes, I shall first test it out with the latest version of Ray (If there has been an update since when I posted this) before raising an issue in github/ray-project/ray