Warning
This page is located in archive. Go to the latest version of this course pages. Go the latest version of this page.

10 Reinforcement Learning III

  • State values during a random walk
  • Approximation minimizing least squares error (LSQ)
  • Approximative Q-learning

Quiz for bonus points

  • Calculate state values during a random walk policy
  • 0.5 points
  • submit your solution to BRUTE lab10quiz by April 26, midnight
  • format: text file, photo of your solution on paper, pdf - what is convenient for you
  • solution will be discussed on the next lab
  • quiz assignment: [Students with their family name starting from A to L (included) have to solve and upload subject A , while students with family name from M to Z have to solve and upload subject B]

Quiz II / Solving together during interactive lab

  • Approximation minimizing least squares error (LSQ)
  • Approximative Q-learning

Individual Work

courses/be5b33kui/labs/weekly/week_10.txt · Last modified: 2021/05/10 10:07 by gamafili