מה צריך לדעת לפני
שתתחיל

מתחיל 6 June 2026 07:54

נגמר 6 June 2026

00 ימים
00 שעות
00 דקות
00 שניות
course image

Training Overparametrized Neural Networks: Early Alignment Phenomenon and Simplicity Bias

Investigate the early alignment phenomenon in the training dynamics of overparametrized neural networks. Understand the process where neurons orient themselves towards crucial directions, forming sparsity that enhances generalization capabilities, despite the presence of potential local minima.
Institut des Hautes Etudes Scientifiques (IHES) via YouTube

Institut des Hautes Etudes Scientifiques (IHES)

6076 קורסים


57 minutes

שדרוג אופציונלי זמין

Not Specified

התקדמות בקצב שלך

Free Video

שדרוג אופציונלי זמין

סקירה כללית

Investigate the early alignment phenomenon in the training dynamics of overparametrized neural networks. Understand the process where neurons orient themselves towards crucial directions, forming sparsity that enhances generalization capabilities, despite the presence of potential local minima.

סילבוס

  • Introduction to Neural Networks
  • Overview of neural network architectures
    Understanding parameters and overparameterization
  • Early Alignment Phase
  • Definition and importance
    Mathematical foundations
    Empirical evidence from recent studies
  • Dynamics of Neuron Alignment
  • Key directions and feature selection
    Role of gradients in early alignment
    Case studies of effective neuron alignment
  • Sparsity in Neural Networks
  • Definition and measurement of sparsity
    Connection between early alignment and sparsity
    Benefits of sparsity for neural network generalization
  • Simplicity Bias in Training
  • Understanding complexity versus simplicity
    How early alignment induces simplicity
    Simplicity bias effects on generalization
  • Training Dynamics and Local Minima
  • Exploration of local versus global minima
    Early alignment as a mechanism to avoid poor minima
    Practical techniques for navigating training landscapes
  • Practical Implications
  • Designing neural networks with alignment and sparsity in mind
    Real-world applications and case studies
    Tools and libraries for implementing the concepts
  • Advanced Topics and Current Research
  • Recent advancements in understanding early alignment
    Open research questions and future directions
  • Conclusion and Further Reading
  • Recap of key concepts
    Suggested literature for deeper exploration

נושאים

Computer Science