This page is located in archive. Go to the latest version of this course pages. Go the latest version of this page.

08 Reinforcement Learning I

We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?


Traditional quiz: calculate Q values from training episodes using direct evaluation.

Individual work

Finish the Markov decision process assignment: deadline at the end of the week.

Start working on the Reinforcement Learning assignment, deadline on BRUTE.


courses/be5b33kui/labs/weekly/week_08.txt · Last modified: 2019/04/08 10:14 by gamafili