What You Need to Know Before
You Start

Starts 14 July 2026 08:00

Ends 14 July 2026

00 Days

00 Hours

00 Minutes

00 Seconds

The Computer Vision Bootcamp

Master Computer Vision by exploring Vision Transformers, Meta's SAM, and deploying scalable pipelines on AWS using Python and production-ready AI infrastructure.

via Zero To Mastery

6 hours

Optional upgrade avallable

Advanced

Progress at your own speed

Paid Course

Optional upgrade avallable

Overview

Learn how Computer Vision models work, including Vision Transformers and Meta’s SAM, and how they power real-world image systems. Then put your knowledge into practice by deploying a scalable computer vision pipeline on AWS using production-ready tools and infrastructure.Understand how Vision Transformers process imagesBreak down attention math without the hand-wavingUse Meta’s SAM for prompt-based segmentationVisualize and evaluate segmentation outputsConnect detection models with segmentation pipelinesBuild scalable computer vision workflows in PythonDeploy vision systems on AWS infrastructureDesign production-ready AI pipelines for real products

Syllabus

Introduction

Introduction

What We're Building

Exercise: Meet Your Classmates and Instructor

Course Resources

ZTM Plugin + Understanding Your Video Player

Set Your Learning Streak Goal

Mathematics behind Vision Transformers

Vision Transformers vs Convolutional Neural Networks

Quadratic Operations

Introduction to ViTs and Joint Training with Embeddings

Understanding Attention Mechanisms, Brief Summary

Understanding the Full ViT Pipeline

Let's Have Some Fun (+ More Resources)

Mathematics Behind Meta's SAM (Segment Anything Model)

Introduction to Prompt Encoders for SAM

SAM AutoPrompt Mode

SAM Manual Click Mode

ViT Embeddings inside SAM

Calculating Attention Score for Vision Transformers in SAM

How SAM is Trained

Calculating Prompt Self Attention for SAM

Prompt Image Cross Attention

Image to Prompt Cross Attention

(Optional) Finishing SAM Example Part 1

(Optional) Finishing SAM Example Part 2

Finishing

Unlimited Updates

Setting up Our AWS Environment

Creating our SagemakerAI Domain

Starting Domain and Understanding Pricing

Installing Libraries

Stopping Instances and Servers

Course Check-In

Setting up Open Source Models Like Meta's SAM

Downloading the SAM Model from Meta

Updating IAM Permissions

Importing Libraries

Understanding how we use Rekognition with SAM

Defining Helper Functions

Clarification Regarding Helper Functions

Rekognition Detection and Filtering

Initialise SAM Model from S3

Main Processing Function Part 1

Main Processing Function Part 2

Running the Main Processing Cell

Implement a New Life System

Visualizing our Outputs

Visualizing Rekognition Detections

Visualize All SAM Masks

Visualizing Match Quality IOU Scores Part 1

Visualizing Match Quality IOU Scores Part 2

Visualizing Image Segmentations with Bounding Boxes

Visualizing Masks and Labels Without Bounding Boxes

Visualizing Segementations in Black and White Masks

Exercise: Imposter Syndrome

Saving Results to S3

Saving Metadata to S3

Save Images to S3

Saving Individual Masks to S3

Testing + Setup

Adding a GPU Server to our Notebook and AWS Quotas

Testing Our Full Pipeline

Minor Corrections

Productionizing + Cleanup

Where To Go From Here?

Thank You!

Review This Course!

Become An Alumni

Learning Guideline

ZTM Events Every Month

LinkedIn Endorsements

Taught by

Patrik Szepesi

Subjects

Artificial Intelligence

What You Need to Know Before You Start

The Computer Vision Bootcamp

6 hours

Advanced

Paid Course

Overview

Syllabus

Taught by

Subjects

CodeCloak: A DRL-Based Method for Mitigating Code Leakage by LLM Code Assistants

Generative AI for NLP with PyTorch

Machine Learning Engineer: ML and Deep Learning Models

Data Preparation & Applied Machine Learning

Building an AI Cooking Helper with Django

Feature Engineering and Feature Stores for AI and ML

What You Need to Know Before
You Start