Wat je moet weten voordat je
begint

Start 17 July 2026 09:29

Einde 17 July 2026

00 Dagen

00 Uren

00 Minuten

00 Seconden

Registreren

Inference techniques for local and cloud LLM deployment

Master LLM inference techniques for deploying Llama models locally and in the cloud, including quantization, prompt pipelines, and scalable real-world applications.

Meta via Coursera

Not Specified

Optionele upgrade beschikbaar

Not Specified

Ga in je eigen tempo vooruit

Paid Course

Optionele upgrade beschikbaar

Overzicht

Expand your AI development skills by learning to run Llama models on local machines and at scale in the cloud. Master the principles of LLM inference, apply quantization for efficiency, and design prompt pipelines for real-world tasks.

Through hands-on projects, you’ll deploy an LLM-enabled tool that demonstrates scalability, performance, and practical impact.

Lesprogramma

Introduction to LLM (Large Language Models)

Overview of Llama models

Key concepts in LLMs: Parameters, training, and inference

LLM Inference Principles

Understanding inference in AI models

Differences between training and inference

Challenges in LLM inference

Local LLM Deployment

Setting up a local environment for LLM

Running Llama models on local machines

Optimizing local performance

Cloud-Based LLM Deployment

Introduction to cloud platforms for AI

Deploying Llama models in the cloud

Managing cloud resources for scalability

Efficiency in LLM Inference

Techniques for model optimization

Understanding and applying quantization

Balancing accuracy and efficiency

Designing Prompt Pipelines

Introduction to prompt engineering

Building and testing prompt pipelines

Adapting prompts for various tasks

Hands-on Projects

Project 1: Deploy a Llama model locally

Project 2: Scale an LLM on a cloud platform

Project 3: Design a prompt pipeline for a specific application

Evaluation and Performance Monitoring

Metrics for evaluating model performance

Tools for monitoring inference efficiency

Iterative improvements based on feedback

Real-World Applications and Impact

Case studies of LLM deployment in industry

Scalability and practical implications

Ethical considerations in LLM deployment

Course Recap and Future Trends

Review key learnings

Discussion of emerging technologies and trends in LLMs

Guidance on further learning and development in the field of AI

Gegeven door

Taught by Meta Staff

Vakgebieden

Computer Science

Wat je moet weten voordat je begint

Inference techniques for local and cloud LLM deployment

Not Specified

Not Specified

Paid Course

Overzicht

Lesprogramma

Gegeven door

Vakgebieden

AI for FP&A Automation & Modeling

FP&A with AI: Capstone Project

Interpretability of LLMs - Generating SAE Feature Descriptions - Spring 2026

CodeCloak: A DRL-Based Method for Mitigating Code Leakage by LLM Code Assistants

Generative AI for NLP with PyTorch

Machine Learning Engineer: ML and Deep Learning Models

Wat je moet weten voordat je
begint