Table of Contents

09 Reinforcement Learning II

How not to repeat yourself. We've already found a way, but maybe there's a better place somewhere. Aka exploration vs. exploitation.

Learning outcomes

After this practice session, the student

Program

Exercise / Solving together

Effect of discount factor on policy. See pdf

Bonus quiz

Homework

Reinforcement learning plus

Reinforcement learning is now a very active area, also supported by rapid progress in deep neural network learning. A few links for further inspiration: