What You Need to Know Before
You Start
Starts 10 June 2025 03:05
Ends 10 June 2025
00
days
00
hours
00
minutes
00
seconds
Building Modern Data Lakes for Analytics Using Object Storage
Discover how to architect high-performance data lakes using object storage, focusing on MPP query engines and advanced features for scalable analytics across multi-cloud environments.
Presto Foundation
via YouTube
Presto Foundation
2565 Courses
12 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Free Video
Optional upgrade avallable
Overview
Discover how to architect high-performance data lakes using object storage, focusing on MPP query engines and advanced features for scalable analytics across multi-cloud environments.
Syllabus
- Introduction to Data Lakes
- Overview of Object Storage
- Architecting Data Lakes
- Multi-Cloud Data Lake Architectures
- MPP (Massively Parallel Processing) Query Engines
- Scalable Analytics Strategies
- Advanced Features for Analytics
- Security and Compliance in Data Lakes
- Future Trends and Innovations
- Case Studies and Practical Applications
- Conclusion
Definition and components of a data lake
Benefits of data lakes over traditional data warehouses
Key characteristics of object storage
Comparison with other types of storage (block and file storage)
Use cases in data lake architecture
Core design principles
Data ingestion strategies
Ensuring data quality and consistency
Data governance and security considerations
Advantages and challenges of multi-cloud environments
Best practices for deploying data lakes across multiple clouds
Data migration strategies and interoperability
Functionality and benefits of MPP engines in data lakes
Popular MPP query engines (e.g., Apache Presto, Amazon Redshift, Google BigQuery)
Integration with object storage
Techniques for optimizing performance and scaling analytics workloads
Use of data partitioning, indexing, and caching
Implementing workload management and optimization
Machine learning integration with data lakes
Real-time analytics and stream processing
Leveraging AI for enhanced data insights
Data encryption and access control in object storage
Regulatory compliance considerations
Monitoring and auditing data access
Emerging technologies in data lake ecosystems
Trends in data analytics and storage solutions
Preparing for future advancements in data processing and management
Real-world examples of successful data lake implementations
Lessons learned and best practices
Recap of key learnings
Next steps and resources for continued learning
Subjects
Business