What You Need to Know Before
You Start

Starts 10 June 2025 03:05

Ends 10 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Building Modern Data Lakes for Analytics Using Object Storage

Discover how to architect high-performance data lakes using object storage, focusing on MPP query engines and advanced features for scalable analytics across multi-cloud environments.
Presto Foundation via YouTube

Presto Foundation

2565 Courses


12 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Discover how to architect high-performance data lakes using object storage, focusing on MPP query engines and advanced features for scalable analytics across multi-cloud environments.

Syllabus

  • Introduction to Data Lakes
  • Definition and components of a data lake
    Benefits of data lakes over traditional data warehouses
  • Overview of Object Storage
  • Key characteristics of object storage
    Comparison with other types of storage (block and file storage)
    Use cases in data lake architecture
  • Architecting Data Lakes
  • Core design principles
    Data ingestion strategies
    Ensuring data quality and consistency
    Data governance and security considerations
  • Multi-Cloud Data Lake Architectures
  • Advantages and challenges of multi-cloud environments
    Best practices for deploying data lakes across multiple clouds
    Data migration strategies and interoperability
  • MPP (Massively Parallel Processing) Query Engines
  • Functionality and benefits of MPP engines in data lakes
    Popular MPP query engines (e.g., Apache Presto, Amazon Redshift, Google BigQuery)
    Integration with object storage
  • Scalable Analytics Strategies
  • Techniques for optimizing performance and scaling analytics workloads
    Use of data partitioning, indexing, and caching
    Implementing workload management and optimization
  • Advanced Features for Analytics
  • Machine learning integration with data lakes
    Real-time analytics and stream processing
    Leveraging AI for enhanced data insights
  • Security and Compliance in Data Lakes
  • Data encryption and access control in object storage
    Regulatory compliance considerations
    Monitoring and auditing data access
  • Future Trends and Innovations
  • Emerging technologies in data lake ecosystems
    Trends in data analytics and storage solutions
    Preparing for future advancements in data processing and management
  • Case Studies and Practical Applications
  • Real-world examples of successful data lake implementations
    Lessons learned and best practices
  • Conclusion
  • Recap of key learnings
    Next steps and resources for continued learning

Subjects

Business