====== 10 Reinforcement Learning III ====== * State values during a random walk * Approximation minimizing least squares error (LSQ) * Approximative Q-learning ===== Quiz for bonus points ===== * Calculate state values during a random walk policy * 0.5 points * submit your solution to BRUTE **lab10quiz** by May 4, midnight * format: text file, photo of your solution on paper, pdf - what is convenient for you * solution will be discussed on the next lab * quiz assignment: [will be accessible from Monday, May 4] > {{page>courses:be5b33kui:internal:quizzes#state_values_for_a_random_walk}} ===== Quiz II / Solving together during interactive lab ===== * Approximation minimizing least squares error (LSQ) * Approximative Q-learning * {{ :courses:be5b33kui:labs:weekly:learning_by_approximation.pdf |(see pdf)}} ===== Individual Work ===== Work on the [[courses:be5b33kui:labs:rl:start|Reinforcement learning assignment]].