Was Sie vorher wissen sollten
bevor Sie beginnen

Beginnt 4 June 2026 14:20

Endet 4 June 2026

00 Tage
00 Stunden
00 Minuten
00 Sekunden
course image

Data Munging to Wrangling - 7 Steps to Mastering Data Preparation for Data Science

Discover essential techniques for data preparation in AI/ML projects. Learn to source, clean, and transform data effectively, enhancing the quality and predictive power of machine learning models.
PASS Data Community Summit via YouTube

PASS Data Community Summit

6076 Kurse


1 hour 16 minutes

Optionales Upgrade verfügbar

Not Specified

Lernen Sie in Ihrem eigenen Tempo

Conference Talk

Optionales Upgrade verfügbar

Übersicht

Discover essential techniques for data preparation in AI/ML projects. Learn to source, clean, and transform data effectively, enhancing the quality and predictive power of machine learning models.

Lehrplan

  • Introduction to Data Preparation
  • Importance of data preparation in AI/ML
    Overview of the 7-step process
  • Step 1: Data Sourcing
  • Identifying data needs
    Exploring various data sources
    Data collection techniques
  • Step 2: Data Understanding
  • Exploring data structure and content
    Statistical data exploration
    Identifying data outliers and anomalies
  • Step 3: Data Cleaning
  • Handling missing data
    Techniques for dealing with noise and errors
    Data deduplication methods
  • Step 4: Data Transformation
  • Data normalization and standardization
    Feature scaling and selection
    Encoding categorical variables
  • Step 5: Data Enrichment
  • Data integration from multiple sources
    Augmentation techniques
    Use of external datasets for enrichment
  • Step 6: Data Reduction
  • Dimensionality reduction techniques
    Feature extraction and selection
    Data summarization
  • Step 7: Data Validation and Testing
  • Ensuring data quality and integrity
    Data validation techniques
    Creating and using validation datasets
  • Conclusion and Best Practices
  • Recap of key techniques and tools
    Tips for efficient data preparation
    Common pitfalls and how to avoid them
  • Practical Project
  • Apply the 7-step process on a real-world dataset
    Present findings and insights

Fachgebiete

Conference Talks