Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Previous revision
courses:be5b33kui:labs:weekly:week_08 [2018/06/20 18:07]
courses:be5b33kui:labs:weekly:week_08 [2024/04/18 15:21] (current)
xposik [Individual work]
Line 1: Line 1:
 +====== 08 Reinforcement Learning I ======
 +
 +We don't know the model of the robot-agent;​ it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?
 +
 +===== Exercise for bonus points =====
 +  * Direct Q value evaluation
 +  * 0.5 points
 +  * submit your solution to [[https://​cw.felk.cvut.cz/​brute/​|BRUTE]] **lab08quiz** by April 17, midnight
 +  * format: text file, photo of your solution on paper, pdf - what is convenient for you
 +  * solution will be discussed on the next lab
 +  * Students with their family name starting from A to K (included) have to solve and upload {{ :​courses:​be5b33kui:​labs:​weekly:​DirectQEvaluation_a_2024.pdf|subject A}} , while students with family name from L to Z have to solve and upload {{ :​courses:​be5b33kui:​labs:​weekly:​DirectQEvaluation_b_2024.pdf|subject B}}.
 + 
 +===== Exercise II / Solving together during interactive lab =====
 +  * Policy estimation from training episodes {{ :​courses:​be5b33kui:​labs:​weekly:​policy_estimation_example.pdf |pdf}}
 +
 +> {{page>​courses:​be5b33kui:​internal:​quizzes#​policy_estimation_from_training_episodes}}
 +
 +===== Individual work =====
 +
 +Start working on the [[courses:​be5b33kui:​semtasks:​04_rl:​start|Reinforcement Learning]] assignment, deadline on [[https://​cw.felk.cvut.cz/​upload/​|BRUTE]].
 +