Overview
In this course, we focus on building a Q-learning agent step by step. We start with the Bellman equation and the Q-table update, then implement a basic Q-learning function. Next, we incorporate an exploration policy (ε-greedy), and finally we demonstrate how to use the learned Q-table for decision making.
Syllabus
Taught by
Tags