CourseWare Wiki
Search
Log In
b232
courses
smu
lectures
Differences
This shows you the differences between two versions of the page.
View differences:
Side by Side
Inline
Go
Link to this comparison view
Both sides previous revision
Previous revision
2024/05/29 15:07 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/05/20 15:11 zelezny [Literature]
2024/05/20 14:36 tothjan2
2024/05/20 14:35 tothjan2
2024/04/29 15:15 souregus
2024/04/22 15:03 souregus
2024/04/15 14:57 souregus [Lecture 8 - Natural Language Processing 2]
2024/04/08 10:29 souregus [Lecture 7 - Natural Language Processing 1]
2024/03/26 09:52 tothjan2 [Literature]
2024/03/25 15:51 kuzelon2 [Lecture 6 - Reinforcement Learning 6]
2024/03/18 15:00 kuzelon2 [Lecture 5 - Reinforcement Learning 5]
2024/03/18 14:57 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/18 14:57 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/18 14:54 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/03/11 15:59 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/04 14:47 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/02/26 15:02 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 15:01 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 14:49 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 14:45 kuzelon2 [Lecture 2 - Reinforcement Learning 2]
2024/02/16 15:13 souregus
2024/02/16 15:12 souregus [Lecture 7 - Natural Language Processing 1]
2024/02/16 14:07 kuzelon2
2024/02/09 10:14 external edit
Go
2024/05/29 15:07 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/05/20 15:11 zelezny [Literature]
2024/05/20 14:36 tothjan2
2024/05/20 14:35 tothjan2
2024/04/29 15:15 souregus
2024/04/22 15:03 souregus
2024/04/15 14:57 souregus [Lecture 8 - Natural Language Processing 2]
2024/04/08 10:29 souregus [Lecture 7 - Natural Language Processing 1]
2024/03/26 09:52 tothjan2 [Literature]
2024/03/25 15:51 kuzelon2 [Lecture 6 - Reinforcement Learning 6]
2024/03/18 15:00 kuzelon2 [Lecture 5 - Reinforcement Learning 5]
2024/03/18 14:57 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/18 14:57 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/18 14:54 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/03/11 15:59 kuzelon2 [Lecture 4 - Reinforcement Learning 4]
2024/03/04 14:47 kuzelon2 [Lecture 3 - Reinforcement Learning 3]
2024/02/26 15:02 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 15:01 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 14:49 kuzelon2 [Lecture 1 - Reinforcement Learning 1]
2024/02/26 14:45 kuzelon2 [Lecture 2 - Reinforcement Learning 2]
2024/02/16 15:13 souregus
2024/02/16 15:12 souregus [Lecture 7 - Natural Language Processing 1]
2024/02/16 14:07 kuzelon2
2024/02/09 10:14 external edit
Go
courses:smu:lectures [2024/05/20 15:11]
zelezny
[Literature]
courses:smu:lectures [2024/05/29 15:07]
(current)
kuzelon2
[Lecture 3 - Reinforcement Learning 3]
Line 71:
Line 71:
**Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. //
**Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. //
+
+
**Erratum:** There is an error on slide 73, where we should be updating Q(b,right) - we are always updating the state with the "small" ladybug symbol, Thanks to M. Komínek for discovering this error.
**Relevant videos from Prof Brunskill's course:** [[https://youtu.be/j080VBVGkfQ|Lecture 4]]
**Relevant videos from Prof Brunskill's course:** [[https://youtu.be/j080VBVGkfQ|Lecture 4]]
courses/smu/lectures.txt
· Last modified: 2024/05/29 15:07 by
kuzelon2