This course has already ended.

Markov Decision Process and Reinforcement LearningΒΆ


Markov Decision Process (MDP)

Bellman equations

MDP Algorithms: Value & Policy Iteration

Reinforcement Learning



Read chapters 17 and 21 from the textbook and/or watch the lectures.

Answer the quizzes.

Solve the programming exercise, submit your code, and complete the peer grading.

Assistance for the tasks:

Q&A sessions:

Tuesday, at 15-17.

Join Zoom Meeting:

Passcode: 276151

Posting submission...