Reinforcement Learning in Continuous Action Spaces | DDPG Tutorial (Tensorflow)

Let’s use deep deterministic policy gradients to deal with the bipedal walker environment. Featuring a continuous action space and 24 elements in the observation vector, this is a perfect environment for a non trivial example of DDPG.

#DDPG #BipedalWalker #DeepDeterministicPolicyGradients

Learn how to turn deep reinforcement learning papers into code:

Deep Q Learning:

Actor Critic Methods:

Curiosity Driven Deep Reinforcement Learning

Natural Language Processing from First Principles: Learning Fundamentals

Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion:
Grokking Deep Learning:
Grokking Deep Reinforcement Learning:

Come hang out on Discord here:


Source of this AI Video

AI video(s) you might be interested in …