Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Previous revision
Last revision Both sides next revision
courses:be5b33kui:labs:weekly:week_08 [2018/05/21 12:20]
courses:be5b33kui:labs:weekly:week_08 [2024/04/15 10:53]
dantuswa [Exercise II / Solving together during interactive lab]
Line 1: Line 1:
 +====== 08 Reinforcement Learning I ======
 +
 +We don't know the model of the robot-agent;​ it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?
 +
 +===== Exercise for bonus points =====
 +  * Direct Q value evaluation
 +  * 0.5 points
 +  * submit your solution to [[https://​cw.felk.cvut.cz/​brute/​|BRUTE]] **lab08quiz** by April 17, midnight
 +  * format: text file, photo of your solution on paper, pdf - what is convenient for you
 +  * solution will be discussed on the next lab
 +  * Students with their family name starting from A to K (included) have to solve and upload {{ :​courses:​be5b33kui:​labs:​weekly:​DirectQEvaluation_a_2024.pdf|subject A}} , while students with family name from L to Z have to solve and upload {{ :​courses:​be5b33kui:​labs:​weekly:​DirectQEvaluation_b_2024.pdf|subject B}}.
 + 
 +===== Exercise II / Solving together during interactive lab =====
 +  * Policy estimation from training episodes {{ :​courses:​be5b33kui:​labs:​weekly:​policy_estimation_example.pdf |pdf}}
 +
 +> {{page>​courses:​be5b33kui:​internal:​quizzes#​policy_estimation_from_training_episodes}}
 +
 +===== Individual work =====
 +
 +Start working on the [[courses:​be5b33kui:​labs:​rl:​start|Reinforcement Learning]] assignment, deadline on [[https://​cw.felk.cvut.cz/​upload/​|BRUTE]].
 +
  
courses/be5b33kui/labs/weekly/week_08.txt · Last modified: 2024/04/18 15:21 by xposik