What You Need to Know Before
You Start

Starts 7 June 2026 13:16

Ends 7 June 2026

00 Days
00 Hours
00 Minutes
00 Seconds
course image

Optimizing Generative AI on Arm Processors

Discover how to optimize Generative AI models for Arm processors across mobile, edge, and cloud environments using SIMD, quantization, and KleidiAI library techniques.
Arm Education via Coursera

Arm Education

2889 Courses


10 hours 10 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Paid Course

Optional upgrade avallable

Overview

AI models are becoming increasingly powerful—but also increasingly demanding. As Generative AI moves from cloud data centers to mobile phones, autonomous systems and embedded IoT devices, the need to optimize performance across diverse hardware environments has never been more critical.

Arm-based processors power more than 300 billion devices globally, from smartphones to hyperscale cloud servers, making them a key foundation for efficient AI deployment across the compute landscape. To meet this growing demand, learners need the skills to translate machine learning models into real-time, hardware-aware implementations across Arm-based platforms.

Optimizing Generative AI on Arm Processors:

from Edge to Cloud is designed for intermediate machine learning practitioners who want to bridge the gap between model design and deployment efficiency. Rather than revisiting ML fundamentals, this course dives straight into performance engineering for Generative AI on Arm-based platforms, including mobile, edge and cloud environments.   You’ll explore real-world constraints, Arm architecture features, and software techniques used to accelerate AI inference—including SIMD (SVE, Neon), low-bit quantization, and the KleidiAI library.

Each concept is taught using concise, interactive notebooks and narrated examples, enabling you to measure, tweak, and iterate on actual hardware like the Raspberry Pi 5 or AWS Graviton3 cloud instances.

Syllabus

  • Module 1: Challenges Facing Cloud and Edge GenAI Inference
  • Module 2: Generative AI Models
  • Module 3: ML Frameworks and Optimized Libraries
  • Module 4: Optimization for CPU Inference

Taught by

Arm Education


Subjects

Computer Science