What You Need to Know Before
You Start
Starts 7 June 2025 18:21
Ends 7 June 2025
00
days
00
hours
00
minutes
00
seconds
Unlocking New Pose in HPC - Containerization, Cloud, and GPU-based Workloads
Explore containerization, cloud, and GPU-based workloads in HPC. Learn about Kubernetes for resource management, GPU virtualization, custom scheduling, and monitoring for efficient AI development and deployment.
CNCF [Cloud Native Computing Foundation]
via YouTube
CNCF [Cloud Native Computing Foundation]
2544 Courses
41 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Conference Talk
Optional upgrade avallable
Overview
Explore containerization, cloud, and GPU-based workloads in HPC. Learn about Kubernetes for resource management, GPU virtualization, custom scheduling, and monitoring for efficient AI development and deployment.
Syllabus
- Introduction to HPC and AI Workloads
- Fundamentals of Containerization
- Deep Dive into Kubernetes
- GPU Virtualization and Utilization
- Custom Scheduling in HPC
- Monitoring and Performance Tuning
- Cloud Integration with HPC
- Case Studies and Real-World Applications
- Future Trends and Innovations in HPC
- Course Conclusion
Overview of HPC and its relevance to AI
Introduction to GPU-based workloads
The role of cloud computing in HPC
Basics of containers and their benefits in HPC
Container orchestration tools overview
Kubernetes architecture and components
Setting up a Kubernetes cluster for HPC
Deploying and managing applications with Kubernetes
Resource management techniques in Kubernetes
Understanding GPU architecture and capabilities
Tools and techniques for GPU virtualization
Best practices for maximizing GPU utilization in HPC
Introduction to custom schedulers in Kubernetes
Designing custom scheduling algorithms for optimized performance
Hands-on exercises with Kubernetes scheduling
Monitoring tools and techniques for Kubernetes
Analyzing performance bottlenecks in GPU-based workloads
Techniques for effective performance tuning
Exploring cloud service providers and their offerings for HPC
Design patterns for hybrid cloud HPC environments
Case studies of successful cloud-enabled HPC deployments
Analyzing real-world AI applications using HPC
Lessons learned from deploying AI workloads in HPC clusters
Emerging technologies and their potential impact
Research directions in AI and HPC integration
Recap of key concepts learned
Final project: designing a scalable AI workload deployment strategy using Kubernetes and cloud resources.
Subjects
Conference Talks