Ce que vous devez savoir avant
Vous commencez

Débute 7 June 2026 13:16

Se termine 7 June 2026

00 Jours
00 Heures
00 Minutes
00 Secondes
course image

Optimiser l'IA générative sur les processeurs Arm

Découvrez comment optimiser les modèles d'intelligence artificielle générative pour les processeurs Arm dans les environnements mobiles, edge et cloud en utilisant les techniques SIMD, de quantification et de bibliothèques KleidiAI.
Arm Education via Coursera

Arm Education

2889 Cours


10 hours 10 minutes

Amélioration optionnelle disponible

Not Specified

Progressez à votre rythme

Paid Course

Amélioration optionnelle disponible

Aperçu

AI models are becoming increasingly powerful—but also increasingly demanding. As Generative AI moves from cloud data centers to mobile phones, autonomous systems and embedded IoT devices, the need to optimize performance across diverse hardware environments has never been more critical.

Arm-based processors power more than 300 billion devices globally, from smartphones to hyperscale cloud servers, making them a key foundation for efficient AI deployment across the compute landscape. To meet this growing demand, learners need the skills to translate machine learning models into real-time, hardware-aware implementations across Arm-based platforms.

Optimizing Generative AI on Arm Processors:

from Edge to Cloud is designed for intermediate machine learning practitioners who want to bridge the gap between model design and deployment efficiency. Rather than revisiting ML fundamentals, this course dives straight into performance engineering for Generative AI on Arm-based platforms, including mobile, edge and cloud environments.   You’ll explore real-world constraints, Arm architecture features, and software techniques used to accelerate AI inference—including SIMD (SVE, Neon), low-bit quantization, and the KleidiAI library.

Each concept is taught using concise, interactive notebooks and narrated examples, enabling you to measure, tweak, and iterate on actual hardware like the Raspberry Pi 5 or AWS Graviton3 cloud instances.

Programme

  • Module 1 : Défis de l'inférence GenAI dans le Cloud et en périphérie
  • Module 2 : Modèles d'intelligence artificielle générative
  • Module 3 : Cadres ML et bibliothèques optimisées
  • Module 4 : Optimisation pour l'inférence sur CPU

Enseigné par

Arm Education


Matières

Computer Science