Get in Touch

Course Outline

Introduction

  • Learning through positive reinforcement

Core Components of Reinforcement Learning

Key Terminology (Actions, States, Rewards, Policy, Value, Q-Value, etc.)

Overview of Tabular Solution Methods

Developing a Software Agent

Understanding Value-based, Policy-based, and Model-based Approaches

Working with the Markov Decision Process (MDP)

How Policies Define an Agent's Behavior

Utilizing Monte Carlo Methods

Temporal-Difference Learning

n-step Bootstrapping

Approximate Solution Methods

On-policy Prediction with Approximation

On-policy Control with Approximation

Off-policy Methods with Approximation

Understanding Eligibility Traces

Utilizing Policy Gradient Methods

Summary and Conclusion

Requirements

  • Experience with machine learning
  • Programming experience

Audience

  • Data scientists
 21 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories