מה צריך לדעת לפני
שתתחיל
מתחיל 6 June 2026 10:24
נגמר 6 June 2026
Scale to 0 LLM Inference: Cost Efficient Open Model Deployment on Serverless GPUs
Devoxx
6076 קורסים
17 minutes
שדרוג אופציונלי זמין
Not Specified
התקדמות בקצב שלך
Free Video
שדרוג אופציונלי זמין
סקירה כללית
Discover the innovative approach to deploying LLM models on serverless GPUs that scale efficiently to zero during inactivity. This session will guide you through the process of running Ollama on these advanced infrastructures, allowing for cost-effective open LLM deployment.
Gain complete control over both models and private data, optimizing performance and expenditure.
סילבוס
- **Introduction to Serverless GPU Computing**
- **Overview of Ollama and LLM Deployment**
- **Setting Up a Serverless Environment**
- **Deploying LLMs on Serverless GPUs**
- **Cost Optimization Strategies**
- **Maintaining Model and Data Privacy**
- **Performance Optimization**
- **Troubleshooting and Support**
- **Capstone Project**
- **Course Conclusion and Future Directions**
נושאים
Computer Science