09 Reinforcement Learning II

How not to repeat yourself. We've already found a way, but maybe there's a better place somewhere.

Quiz

Traditional quiz, this time to calculate Q values from training episodes using time difference method

Individual Work: next assignment

Work on the Reinforcement learning assignment.

Reinforcement learning plus

Reinforecement learning is now a very active area, also supported by rapid progress in deep neural network learning. A few links for further inspiration:

Table tennis robot player. Starting from imitation, then generalizing through RL.
Robotics@google. Well, they can afford many learning episodes many iterations
Pong game. Learning to play the very old computer game with the help of AI-Gym. YT Video

Table of Contents

09 Reinforcement Learning II

Quiz

Individual Work: next assignment

Reinforcement learning plus