What You Need to Know Before
You Start
Starts 8 June 2025 14:02
Ends 8 June 2025
00
days
00
hours
00
minutes
00
seconds
3 hours 18 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Paid Course
Optional upgrade avallable
Overview
In this course, we will be creating a data lake using AWS Lake Formation and bring data warehouse capabilites to the data lake to form the lakehouse architecture using Amazon Redshift. Using Lake Formation, we also collect and catalog data from different data sources, move the data into our S3 data lake, and then clean and classify them.
Syllabus
- Introduction to Data Lakes and Lakehouse Architecture
- Setting up the AWS Environment
- Introduction to AWS Lake Formation
- Creating a Data Lake in AWS Lake Formation
- Data Ingestion and Cataloging
- Data Cleaning and Classification
- Building Lakehouse with Amazon Redshift
- Security and Access Management
- Advanced Topics in Lake Formation
- Case Study and Hands-on Project
- Review and Next Steps
Overview of Data Lakes
Understanding Lakehouse architecture
Key concepts in AWS Lake Formation and Amazon Redshift
Introduction to AWS services needed
Configuring AWS Identity and Access Management (IAM)
Setting up AWS Simple Storage Service (S3) buckets
Overview of AWS Lake Formation features
Understanding Lake Formation permissions
Registering S3 locations
Creating databases and tables in the data catalog
Using Blueprints for data import
Collecting data from various sources
Automating data movement into S3 using Lake Formation
Cataloging data with AWS Glue
Data cleaning best practices
Using transforms and classifiers in Lake Formation
Ensuring data quality and consistency
Setting up Redshift clusters
Integrating Redshift with Lake Formation
Querying data across the data lake using Redshift Spectrum
Implementing granular data access controls
Configuring data encryption and privacy settings
Auditing and monitoring data activity
Automating workflows with AWS Glue and Step Functions
Implementing machine learning models in the data lake
Real-world example of building a secure data lake
Group project: Design and implement a mini-lakehouse architecture
Best practices for maintaining a data lake
Upcoming AWS features and updates in data lake technology
Resources for further learning
Taught by
Yomi Owoyemi
Subjects
Business