Warning
This page is located in archive. Go to the latest version of this course pages. Go the latest version of this page.

08 Reinforcement Learning I

We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?

Exercise for bonus points

  • Direct Q value evaluation
  • 0.5 points
  • submit your solution to BRUTE lab08quiz by April 17, midnight
  • format: text file, photo of your solution on paper, pdf - what is convenient for you
  • solution will be discussed on the next lab
  • Students with their family name starting from A to K (included) have to solve and upload subject A , while students with family name from L to Z have to solve and upload subject B.

Exercise II / Solving together during interactive lab

  • Policy estimation from training episodes pdf

Individual work

Start working on the Reinforcement Learning assignment, deadline on BRUTE.

courses/be5b33kui/labs/weekly/week_08.txt · Last modified: 2023/04/17 14:35 by gamafili