Search
What if we need to decide multiple times with uncertainty and with decisions influencing our future decisions?
+ α,β exercise solution
Navigating through a gridworld and calculating the proper path..
+ other exercises [See pdf]
Markov decision process. Try running mdp_sandbox.py and ask if something is not clear.
mdp_sandbox.py