I am interested in learning how to customize policies/models by reading DQN’s code (because the official RLlib documentation is really hard to follow). However, I feel pretty confused when reading it.
Where I should start to learn/read?
Is there any clearer tutorial relating to policies/models customization?
Should I have a strong TensorFlow or PyTorch background?