[Rllib] Proper number for PPO rollout workers

Here is a helpful rule of thumb: Training APIs — Ray 1.13.0

Here is a similar issue where I ask a question about what seems to be performance slow down wrt number of workers (unfortunately have not had time to explore this more): Num workers speedup?

I suggest you perform a few scaling studies to see what works well for your computer+algorithm+simulation.