Wat je moet weten voordat je
begint

Start 6 June 2026 07:31

Einde 6 June 2026

00 Dagen
00 Uren
00 Minuten
00 Seconden
course image

Open Source and the Data Lakehouse - Understanding Components and Technologies

Join us on a journey to understand the transformative power of data lakehouses, where cutting-edge technology meets economic efficiency. Delve into the world of Apache Arrow, Iceberg, and Project Nessie, and discover how they serve as revolutionary alternatives to traditional data warehouses. This exploration offers insights into how these o.
OSACon via YouTube

OSACon

6076 Cursussen


26 minutes

Optionele upgrade beschikbaar

Not Specified

Ga in je eigen tempo vooruit

Free Video

Optionele upgrade beschikbaar

Overzicht

Join us on a journey to understand the transformative power of data lakehouses, where cutting-edge technology meets economic efficiency. Delve into the world of Apache Arrow, Iceberg, and Project Nessie, and discover how they serve as revolutionary alternatives to traditional data warehouses.

This exploration offers insights into how these open-source components maximize both performance and affordability, paving the way for advancements in data handling and storage.

Lesprogramma

  • Introduction to Data Lakehouses
  • Definition and key characteristics
    Comparison with data warehouses and data lakes
    Benefits and limitations of data lakehouses
  • Core Components of Data Lakehouses
  • Storage and compute separation
    Metadata management
    Query engines and optimization
  • Apache Arrow
  • Overview of Apache Arrow
    In-memory columnar format
    Performance benefits for data lakehouses
    Integration with other data technologies
  • Apache Iceberg
  • Introduction to Apache Iceberg
    Architecture and features
    Advantages over traditional table formats
    Use cases and implementation examples
  • Project Nessie
  • Overview of Project Nessie
    Version control for data lakehouses
    Branching, merging, and reproducibility
    Ecosystem and integration
  • Comparing Open Source Data Lakehouse Technologies
  • Use cases and performance comparisons
    Cost and affordability analysis
    Case studies of successful implementations
  • Practical Considerations and Best Practices
  • Data governance and security
    Performance optimization strategies
    Choosing the right components for specific needs
  • Future Trends and Developments in Data Lakehouses
  • Emerging technologies and innovations
    Industry adoption and evolution
    Speculations on future directions in data management
  • Course Review and Final Thoughts
  • Recap of key concepts and technologies
    Discussion on the impact of data lakehouses in the industry
    Q&A and interactive discussions

Vakgebieden

Business