Search
How not to repeat yourself. We've already found a way, but maybe there's a better place somewhere. Aka exploration vs. exploitation.
After this practice session, the student
Effect of discount factor on policy. See pdf
Reinforcement learning is now a very active area, also supported by rapid progress in deep neural network learning. A few links for further inspiration: