Ce que vous devez savoir avant
Vous commencez

Débute 4 June 2026 03:18

Se termine 4 June 2026

00 Jours
00 Heures
00 Minutes
00 Secondes
course image

Sample-based Learning Methods

Plongez dans le monde des méthodes d'apprentissage basées sur des exemples avec un cours complet offert par l'Université de l'Alberta sur Coursera. Ce cours explore en profondeur les algorithmes qui maîtrisent des politiques quasi-optimales à travers des interactions d'essai et d'erreur avec leur environnement, mettant en évidence le pouvoir d'appr.
University of Alberta via Coursera

University of Alberta

6 Cours


L'Université de l'Alberta est une université de recherche de premier plan située à Edmonton, au Canada. Elle est reconnue pour son excellence en enseignement, en recherche, en innovation et pour son dévouement à l'engagement communautaire.

Non spécifié

Amélioration optionnelle disponible

Tous niveaux

Progressez à votre rythme

Free

Amélioration optionnelle disponible

Aperçu

Delve into the world of Sample-based Learning Methods with a comprehensive course offered by the University of Alberta on Coursera. This course dives deep into algorithms that master near-optimal policies through trial and error interactions with their environment, showcasing the power of learning directly from an agent's personal experience.

Uncover the essentials of intuitively simple yet potent Monte Carlo methods and the intricacies of temporal difference learning methods, including the renowned Q-learning.

Embark on a journey to understand how to merge model-based planning with temporal difference updates to significantly boost the learning process. By the course's completion, participants will have gained the ability to:

  • Comprehend the nuances of Temporal-Difference learning and Monte Carlo methods for estimating value functions based on sampled experiences.
  • Recognize the critical role of exploration in leveraging sampled experience over dynamic programming sweeps.
  • Draw connections between Monte Carlo, Dynamic Programming, and TD methods.
  • Develop the skills to implement and utilize the TD algorithm for accurate value function estimation.
  • Apply Expected Sarsa and Q-learning techniques for control purposes.
  • Distinguish between on-policy and off-policy control mechanisms.
  • Explore planning strategies that use simulated experience.
  • Implement a model-based approach to Reinforcement Learning (RL) through Dyna, enhancing sample efficiency with simulated experiences.

This course is categorized under Artificial Intelligence Courses, Reinforcement Learning Courses, and specifically Q-learning Courses, making it an ideal fit for anyone keen to excel in these areas.


Enseigné par

Martha White and Adam White


Matières