Markov Decision Processes | Free Reinforcement Learning Course Module 2

#reinforcementlearning #artificialintelligence

Welcome to module 2 of our free course in reinforcement learning.

Today we’re going to discover what a Markov Decision Process is, and how this relates to reinforcement learning. In a nutshell, processes that have the Markov property are amenable to a mathematical framework that makes calculating expected future rewards straightforward.

We’ll get back to the Bellman equation, and see how it relates to the Markov property.

In tomorrow’s module, we’ll handle the explore exploit dilemma, so that we’re set up to start solving the Bellman equation using dynamic programming.

Learn how to turn deep reinforcement learning papers into code:

Deep Q Learning:

Actor Critic Methods:

Curiosity Driven Deep Reinforcement Learning

Natural Language Processing from First Principles: Learning Fundamentals

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion:
Grokking Deep Learning:
Grokking Deep Reinforcement Learning:

Come hang out on Discord here:


Source of this AI Video

AI video(s) you might be interested in …