Support for annealing gamma

I just saw this: [RLlib] updating batch_size or similar while training - #2 by sven1977