====== 10 Reinforcement Learning III ======

  * State values during a random walk
  * Approximation minimizing least squares error (LSQ)
  * Approximative Q-learning

===== Quiz for bonus points =====

  * Calculate state values during a random walk policy
  * 0.5 points
  * submit your solution to BRUTE **lab10quiz** by May 4, midnight
  * format: text file, photo of your solution on paper, pdf - what is convenient for you
  * solution will be discussed on the next lab
  * quiz assignment: [will be accessible from Monday, May 4]

> {{page>courses:be5b33kui:internal:quizzes#state_values_for_a_random_walk}}


===== Quiz II / Solving together during interactive lab =====

  * Approximation minimizing least squares error (LSQ)
  * Approximative Q-learning
  * {{ :courses:be5b33kui:labs:weekly:learning_by_approximation.pdf |(see pdf)}}

===== Individual Work =====

Work on the [[courses:be5b33kui:labs:rl:start|Reinforcement learning assignment]].