[RLLib] Distinguishing hyperparameter tuning from single excution of RL algorithm

Thanks, @RickLan . I have one more related question. I posted it here: https://discuss.ray.io/t/rlllib-how-to-use-policy-learned-in-tune-run/2222