====== 08 Reinforcement Learning I ======

We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?

===== Quiz for bonus points =====
  * Direct Q value evaluation
  * 0.5 points
  * submit your solution to [[https://cw.felk.cvut.cz/brute/|BRUTE]] **lab08quiz** by April 12, midnight
  * format: text file, photo of your solution on paper, pdf - what is convenient for you
  * solution will be discussed on the next lab
  * quiz assignment: [Students with their family name starting from A to L (included) have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_A_2021.pdf |subject A}} , while students with family name from M to Z have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_B_2021.pdf |subject B}}]
 
===== Quiz II / Solving together during interactive lab =====
  * Policy estimation from training episodes {{ :courses:be5b33kui:labs:weekly:Policy_estimation_example.pdf|pdf}}

> {{page>courses:be5b33kui:internal:quizzes#policy_estimation_from_training_episodes}}

===== Individual work =====

Start working on the [[courses:be5b33kui:labs:rl:start|Reinforcement Learning]] assignment, deadline on [[https://cw.felk.cvut.cz/upload/|BRUTE]].