Warning
This page is located in archive. Go to the latest version of this course pages. Go the latest version of this page.

09 Reinforcement Learning II

How not to repeat yourself. We've already found a way, but maybe there's a better place somewhere.

Quiz

Traditional quiz, this time to calculate Q values from training episodes using time difference method

Individual Work: next assignment

Reinforcement learning plus

Reinforecement learning is now a very active area, also supported by rapid progress in deep neural network learning. A few links for further inspiration:

courses/be5b33kui/labs/weekly/week_09.txt · Last modified: 2019/04/05 11:04 by gamafili