CourseWare Wiki
Switch Term
Summer 2023 / 2024
Summer 2022 / 2023
Summer 2021 / 2022
Summer 2020 / 2021
Summer 2019 / 2020
Summer 2018 / 2019
Summer 2017 / 2018
Older
Search
Log In
b232
courses
be5b33kui
labs
weekly
week_08
Differences
This shows you the differences between two versions of the page.
View differences:
Side by Side
Inline
Go
Link to this comparison view
Both sides previous revision
Previous revision
2024/04/18 15:21 xposik [Individual work]
2024/04/15 10:53 dantuswa [Exercise II / Solving together during interactive lab]
2024/02/13 14:37 gamafili [Exercise II / Solving together during interactive lab]
2024/02/13 14:37 gamafili [Exercise for bonus points]
2023/04/17 14:35 external edit
Go
Previous revision
2024/04/18 15:21 xposik [Individual work]
2024/04/15 10:53 dantuswa [Exercise II / Solving together during interactive lab]
2024/02/13 14:37 gamafili [Exercise II / Solving together during interactive lab]
2024/02/13 14:37 gamafili [Exercise for bonus points]
2023/04/17 14:35 external edit
Go
courses:be5b33kui:labs:weekly:week_08 [2018/06/20 18:07]
courses:be5b33kui:labs:weekly:week_08 [2024/04/18 15:21]
(current)
xposik
[Individual work]
Line 1:
Line 1:
+
====== 08 Reinforcement Learning I ======
+
+
We don't know the model of the robot-agent; it's behaving somewhat strangely, the path to the goal is unknown, with some traps along the way: what do we do?
+
+
===== Exercise for bonus points =====
+
* Direct Q value evaluation
+
* 0.5 points
+
* submit your solution to [[https://cw.felk.cvut.cz/brute/|BRUTE]] **lab08quiz** by April 17, midnight
+
* format: text file, photo of your solution on paper, pdf - what is convenient for you
+
* solution will be discussed on the next lab
+
* Students with their family name starting from A to K (included) have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_a_2024.pdf|subject A}} , while students with family name from L to Z have to solve and upload {{ :courses:be5b33kui:labs:weekly:DirectQEvaluation_b_2024.pdf|subject B}}.
+
+
===== Exercise II / Solving together during interactive lab =====
+
* Policy estimation from training episodes {{ :courses:be5b33kui:labs:weekly:policy_estimation_example.pdf |pdf}}
+
+
> {{page>courses:be5b33kui:internal:quizzes#policy_estimation_from_training_episodes}}
+
+
===== Individual work =====
+
+
Start working on the [[courses:be5b33kui:semtasks:04_rl:start|Reinforcement Learning]] assignment, deadline on [[https://cw.felk.cvut.cz/upload/|BRUTE]].
+