Wat je moet weten voordat je
begint
Start 6 June 2026 10:24
Einde 6 June 2026
Scale to 0 LLM Inference: Cost Efficient Open Model Deployment on Serverless GPUs
Devoxx
6076 Cursussen
17 minutes
Optionele upgrade beschikbaar
Not Specified
Ga in je eigen tempo vooruit
Free Video
Optionele upgrade beschikbaar
Overzicht
Discover the innovative approach to deploying LLM models on serverless GPUs that scale efficiently to zero during inactivity. This session will guide you through the process of running Ollama on these advanced infrastructures, allowing for cost-effective open LLM deployment.
Gain complete control over both models and private data, optimizing performance and expenditure.
Lesprogramma
- **Introduction to Serverless GPU Computing**
- **Overview of Ollama and LLM Deployment**
- **Setting Up a Serverless Environment**
- **Deploying LLMs on Serverless GPUs**
- **Cost Optimization Strategies**
- **Maintaining Model and Data Privacy**
- **Performance Optimization**
- **Troubleshooting and Support**
- **Capstone Project**
- **Course Conclusion and Future Directions**
Vakgebieden
Computer Science