Differences

This shows you the differences between two versions of the page.

--- courses:be5b33kui:labs:weekly:week_08 [2018/06/20 18:07]
+++ courses:be5b33kui:labs:weekly:week_08 [2024/04/18 15:21] (current)
xposik [Individual work]
@@ Line 1: / Line 1: @@
+====== 08 Reinforcement Learning I ======
+We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?
+===== Exercise for bonus points =====
+  * Direct Q value evaluation
+  * 0.5 points
+  * submit your solution to [[https://cw.felk.cvut.cz/brute/|BRUTE]] **lab08quiz** by April 17, midnight
+  * format: text file, photo of your solution on paper, pdf - what is convenient for you
+  * solution will be discussed on the next lab
+  * Students with their family name starting from A to K (included) have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_a_2024.pdf|subject A}} , while students with family name from L to Z have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_b_2024.pdf|subject B}}.
+===== Exercise II / Solving together during interactive lab =====
+  * Policy estimation from training episodes {{ :courses:be5b33kui:labs:weekly:policy_estimation_example.pdf |pdf}}
+> {{page>courses:be5b33kui:internal:quizzes#policy_estimation_from_training_episodes}}
+===== Individual work =====
+Start working on the [[courses:be5b33kui:semtasks:04_rl:start|Reinforcement Learning]] assignment, deadline on [[https://cw.felk.cvut.cz/upload/|BRUTE]].