מה צריך לדעת לפני
שתתחיל
מתחיל 6 June 2026 13:48
נגמר 6 June 2026
Sesame AI and RVQs - The Network Architecture Behind Viral Speech Models
Neural Breakdown with AVB
6076 קורסים
19 minutes
שדרוג אופציונלי זמין
Not Specified
התקדמות בקצב שלך
Free Video
שדרוג אופציונלי זמין
סקירה כללית
Join us on a fascinating journey into the inner workings of the Sesame Conversational Speech Model. Discover how the Mimi Encoder utilizes split RVQ tokenization to process semantic and acoustic codes efficiently.
Uncover the role of the Autoregressive Transformer Backbone in enabling seamless and natural speech interactions. This insightful session is brought to you by YouTube, tailored for enthusiasts in Artificial Intelligence and Computer Science.
סילבוס
- Introduction to Conversational Speech Models
- Sesame Conversational Speech Model Architecture
- Mimi Encoder and Tokenization
- Split Residual Vector Quantization (RVQ)
- Semantic and Acoustic Codes
- Autoregressive Transformer Backbone
- Applications of Sesame AI
- Practical Implementation and Case Studies
נושאים
Computer Science