====== 08 Reinforcement Learning I ====== We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do? ===== Quiz for bonus points ===== * Direct Q value evaluation * 0.5 points * submit your solution to BRUTE **lab08quiz** by April 20, midnight * format: text file, photo of your solution on paper, pdf - what is convenient for you * solution will be discussed on the next lab * quiz assignment: [will be accessible from Monday, April 20] ===== Quiz II / Solving together during interactive lab ===== * Policy estimation from training episodes {{:courses:be5b33kui:labs:weekly:policy_estimation_example.pdf |(pdf)}} > {{page>courses:be5b33kui:internal:quizzes#policy_estimation_from_training_episodes}} ===== Individual work ===== Start working on the [[courses:be5b33kui:labs:rl:start|Reinforcement Learning]] assignment, deadline on [[https://cw.felk.cvut.cz/upload/|BRUTE]].