Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
courses:smu:lectures [2024/04/29 15:15]
souregus
courses:smu:lectures [2024/05/29 15:07] (current)
kuzelon2 [Lecture 3 - Reinforcement Learning 3]
Line 14: Line 14:
  
 RL & NLP are available online. ​ RL & NLP are available online. ​
- 
-You are strongly discouraged from using this course'​s materials from previous years as you would run into confusions. 
  
 The RL part of the course is heavily based on the RL course of prof Emma Brunskill. The relevant lectures from prof Brunskill'​s course are: [[https://​youtu.be/​FgzM3zpZ55o|Lecture 1]],​[[https://​youtu.be/​E3f2Camj0Is|Lecture 2]], [[https://​youtu.be/​dRIhrn8cc9w|Lecture 3]], [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]], [[https://​youtu.be/​buptHUzDKcE|Lecture 5]], [[https://​youtu.be/​gOV8-bC1_KU|Lecture 6]], [[https://​youtu.be/​RN8qpSs8ozY|Lecture 11]]. The RL part of the course is heavily based on the RL course of prof Emma Brunskill. The relevant lectures from prof Brunskill'​s course are: [[https://​youtu.be/​FgzM3zpZ55o|Lecture 1]],​[[https://​youtu.be/​E3f2Camj0Is|Lecture 2]], [[https://​youtu.be/​dRIhrn8cc9w|Lecture 3]], [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]], [[https://​youtu.be/​buptHUzDKcE|Lecture 5]], [[https://​youtu.be/​gOV8-bC1_KU|Lecture 6]], [[https://​youtu.be/​RN8qpSs8ozY|Lecture 11]].
Line 73: Line 71:
  
 **Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. ​  // **Note:** //This lecture is heavily based on a lecture by Prof Emma Brunskill (all potential errors are likely mine). There was a typo in the pseudocode of SARSA - the action sampled inside the loop should be sampled from \pi(s_{t+1}) instead of \pi(s_t). The error remains in the video. ​  //
 +
 +**Erratum:​** There is an error on slide 73, where we should be updating Q(b,right) - we are always updating the state with the "​small"​ ladybug symbol, ​ Thanks to M. Komínek for discovering this error.
  
 **Relevant videos from Prof Brunskill'​s course:** [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]] **Relevant videos from Prof Brunskill'​s course:** [[https://​youtu.be/​j080VBVGkfQ|Lecture 4]]
Line 138: Line 138:
 ---- ----
 ==== Lecture 11  - Computational Learning Theory 1 ==== ==== Lecture 11  - Computational Learning Theory 1 ====
-{{ :​courses:​smu:​colt-1.pdf |COLT - lecture 1}}+**Slides:​** ​{{ :​courses:​smu:​colt-1.pdf | COLT - lecture 1}} 
 + 
 +**Video:** [[https://​www.youtube.com/​watch?​v=e6XCe84AYEc&​list=PLQL6z4JeTTQlgt77fhOe2Jovjjz4THF_G&​index=2 | COLT - lecture 1]] 
 +----
  
---- 
 ==== Lecture 12  - Computational Learning Theory 2 ==== ==== Lecture 12  - Computational Learning Theory 2 ====
-{{ :​courses:​smu:​colt-2.pdf |COLT - lecture 2}}+**Slides:​** ​{{ :​courses:​smu:​colt-2.pdf |COLT - lecture 2}} 
 + 
 +**Video:** [[https://​www.youtube.com/​watch?​v=1oK1zcl7lpA&​list=PLQL6z4JeTTQlgt77fhOe2Jovjjz4THF_G&​index=3 | COLT - lecture 2]] 
 +----
  
---- 
 ==== Lecture 13  - Computational Learning Theory 3 ==== ==== Lecture 13  - Computational Learning Theory 3 ====
-{{ :​courses:​smu:​colt-3.pdf |COLT - lecture 3}}+**Slides:​** ​{{ :​courses:​smu:​colt-3.pdf |COLT - lecture 3}}
  
----+**Video:** [[https://​www.youtube.com/​watch?​v=6DIJdKXoggY&​list=PLQL6z4JeTTQlgt77fhOe2Jovjjz4THF_G&​index=4 | COLT lecture 3]]
courses/smu/lectures.txt · Last modified: 2024/05/29 15:07 by kuzelon2