Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
courses:smu:lectures [2024/05/20 15:11]
zelezny [Literature]
courses:smu:lectures [2024/05/29 15:07] (current)
kuzelon2 [Lecture 3 - Reinforcement Learning 3]
Line 71: Line 71:
  
 **Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. ​  // **Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. ​  //
 +
 +**Erratum:​** There is an error on slide 73, where we should be updating Q(b,right) - we are always updating the state with the "​small"​ ladybug symbol, ​ Thanks to M. Komínek for discovering this error.
  
 **Relevant videos from Prof Brunskill'​s course:** [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]] **Relevant videos from Prof Brunskill'​s course:** [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]]
courses/smu/lectures.txt · Last modified: 2024/05/29 15:07 by kuzelon2