Markov Decision Processes | Free Reinforcement Learning Course Module 2

Welcome to module 2 of our free course in reinforcement learning.

Today we’re going to discover what a Markov Decision Process is, and how this relates to reinforcement learning. In a nutshell, processes that have the Markov property are amenable to a mathematical framework that makes calculating expected future rewards straightforward.

We’ll get back to the Bellman equation, and see how it relates to the Markov property.

In tomorrow’s module, we’ll handle the explore exploit dilemma, so that we’re set up to start solving the Bellman equation using dynamic programming.

