SAC Training Performance Detirioration

Stale_neutrino · June 14, 2022, 7:01pm

Hey guys,

Disclaimer, first time working with an RL agent !

For my project we are using an SAC agent to drive a vehicle in a simulation. Long story short, I’ve had 0 success getting my agent to reach an optimal state. Furthermore, the agent seems to be performing great in the beginning and then its performance drops drastically later in training. This run used the default HPs for SAC.

Due to a memory leak with the sim that I;m using I can only run the agent for about about 20K steps before having to save the last checkpoint and restore.

Thanks

christy · June 16, 2022, 3:59am

Hi there!

Would you like to ask your question in RLlib Office Hours? Just add your question to this doc: RLlib Office Hours - Google Docs

Thanks! Hope to see you there!

christy · June 24, 2022, 5:26am

Hi Sebastian, I moved your question to July 5th Office Hours, hope that’s OK? Let me know if you need help getting unstuck sooner.

Thanks!
Christy

sven1977 · July 5, 2022, 11:48am

Hey @Stale_neutrino , thanks for posting this question.
If you have an unstable environment, you may want to try some of RLlib’s fault tolerance settings described here:

recreate_failed_workers=True
restart_failed_sub_environments=True
ignore_worker_failures=True

In order for these settings to work, though, you would need some remote workers setup.
For SAC, you usually wouldn’t use many remote workers (default is actually 0, meaning just use the “local” one), but maybe you can try it with at least a few remote workers: num_workers=2 or so.

On the collapsing performance: Not sure what this could be. Hard to tell from this distance.

Topic		Replies	Views
SAC Agent 'Forgets' During Training RLlib	5	297	September 13, 2022
SAC trainer slows down drastically RLlib	6	670	May 29, 2022
SAC terminates early in the training RLlib	2	273	October 27, 2021
Agent consistently stops improving at the same point, despite not appearing to be in a local maxima RLlib	2	44	June 18, 2025
Scaling Battle Experiments RLlib	1	373	January 26, 2022

SAC Training Performance Detirioration

Related topics