- DATA.ML.310
- 6. Markov Decision Process and Reinforcement Learning
- 6.4 Quiz: Three state MDP
Quiz: Three state MDP¶
Consider the three state MDP for a robot trying to walk. The three states of the robot are ‘Fallen’, ‘Standing’, and ‘Moving’. In each of the states the robot can take either a slow or a fast action, denoted by black and green arrows, respectively. The slow action is more likely to keep the robot upright, but the fast action enables the robot to gain more distance.

Each edge in the image is labeled with two numbers: the transition probability and the reward.
Notice that these numbers are different for each question.
Posting submission...