Hello, RLLIB PPo does not seem to solve a super easy instance of a 1d positioning problem. The env simply tries to make the agent move to a specific point with max velocity 1. I tried some parameters but it never seems to break -150 in reward, even though other simple PPO libraries converge very qu…

Ray RLLIB PPO does not solve very simple problem

Benedikt_Schesch November 8, 2023, 4:05pm 3

Solution was to use the change in reward as the reward and not the distance directly

Topic		Replies	Views
RLlib + PPO -> Value Error: Expected parameter loc Configure Algorithm, Training, Evaluation, Scaling	1	393	February 24, 2024
PPO algorithm with Custom Environment Configure Algorithm, Training, Evaluation, Scaling	5	166	February 13, 2025
A little help for a novice RLlib	1	416	October 26, 2022
Help with ppo config in multiagent env with complex observations Configure Algorithm, Training, Evaluation, Scaling	0	16	April 11, 2025
Ray.rllib.agents.ppo missing RLlib	3	7359	March 27, 2023