מה צריך לדעת לפני
שתתחיל
מתחיל 4 June 2026 10:08
נגמר 4 June 2026
Monitoring GPUs at Scale for AI - ML and HPC Clusters
CNCF [Cloud Native Computing Foundation]
6076 קורסים
36 minutes
שדרוג אופציונלי זמין
Not Specified
התקדמות בקצב שלך
Conference Talk
שדרוג אופציונלי זמין
סקירה כללית
Learn how NVIDIA monitors GPU clusters for AI/ML workloads using open-source tools, addressing deployment, maintenance, security, and scale challenges for various user personas.
סילבוס
- Introduction to GPU Monitoring
- Understanding GPU Architectures and Performance Metrics
- Tools for Monitoring NVIDIA GPUs
- Deployment of Monitoring Solutions at Scale
- Maintenance and Updates
- Security Considerations in GPU Monitoring
- Scaling GPU Monitoring Solutions
- Addressing User Personas in GPU Monitoring
- Case Studies and Real-world Examples
- Practical Exercises and Lab Sessions
- Conclusion and Future Trends
- Q&A and Course Wrap-up
נושאים
Conference Talks