Differences

This shows you the differences between two versions of the page.

--- courses:smu:lectures [2024/05/20 15:11]
zelezny [Literature]
+++ courses:smu:lectures [2024/05/29 15:07] (current)
kuzelon2 [Lecture 3 - Reinforcement Learning 3]
@@ Line 71: / Line 71: @@
 **Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video.   //
+**Erratum:** There is an error on slide 73, where we should be updating Q(b,right) - we are always updating the state with the "small" ladybug symbol,  Thanks to M. Komínek for discovering this error.
 **Relevant videos from Prof Brunskill's course:** [[https://youtu.be/j080VBVGkfQ|Lecture 4]]