Was Sie vorher wissen sollten
bevor Sie beginnen
Beginnt 6 June 2026 08:14
Endet 6 June 2026
Scale to 0 LLM Inference: Cost Efficient Open Model Deployment on Serverless GPUs
Devoxx
6076 Kurse
17 minutes
Optionales Upgrade verfügbar
Not Specified
Lernen Sie in Ihrem eigenen Tempo
Free Video
Optionales Upgrade verfügbar
Übersicht
Discover the innovative approach to deploying LLM models on serverless GPUs that scale efficiently to zero during inactivity. This session will guide you through the process of running Ollama on these advanced infrastructures, allowing for cost-effective open LLM deployment.
Gain complete control over both models and private data, optimizing performance and expenditure.
Lehrplan
- **Introduction to Serverless GPU Computing**
- **Overview of Ollama and LLM Deployment**
- **Setting Up a Serverless Environment**
- **Deploying LLMs on Serverless GPUs**
- **Cost Optimization Strategies**
- **Maintaining Model and Data Privacy**
- **Performance Optimization**
- **Troubleshooting and Support**
- **Capstone Project**
- **Course Conclusion and Future Directions**
Fachgebiete
Computer Science