Table of Contents

08 Reinforcement Learning I

We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?

Quiz

Traditional quiz: calculate Q values from training episodes using direct evaluation.

Individual work

Finish the Markov decision process assignment: deadline at the end of the week.

Start working on the Reinforcement Learning assignment, deadline on BRUTE.

Other