====== 07 Sequential II ====== Sequential decisions and how do we calculate the proper policy? ====== Quiz ====== Policy evaluation on a small map. > {{page>courses:be5b33kui:internal:quizzes##Policy evaluation.}} MDP Let's have the following game. We roll the dice and pay 1kc for each roll. If we roll six two times in a row, we win 1000CZK and the game is over. \\ The game can be terminated at any time without payment. \\ 1) Formulate as MDP task (states, actions, T (s, a, s '), r (s, a, s')). \\ 2) Determine the optimal policy. ===== Individual task ===== Work with the [[courses:be5b33kui:labs:sequential_decisions:start|Markov decision process]] task. .