מה צריך לדעת לפני
שתתחיל
מתחיל 5 June 2026 06:17
נגמר 5 June 2026
Probabilistic Safety Guarantees Using Model Internals
Simons Institute
6076 קורסים
46 minutes
שדרוג אופציונלי זמין
Not Specified
התקדמות בקצב שלך
Free Video
שדרוג אופציונלי זמין
סקירה כללית
Join us for an insightful exploration of probabilistic safety guarantees for language models. Led by Jacob Hilton from the Alignment Research Center, this session focuses on the critical analysis of model internals.
Ideal for enthusiasts and professionals in artificial intelligence and computer science, this YouTube event offers cutting-edge insights into enhancing model safety and reliability.
סילבוס
- Introduction to Probabilistic Safety
- Fundamentals of Model Internals
- Analyzing Model Internals
- Probabilistic Methods in AI Safety
- Developing Safety Guarantees
- Case Studies and Practical Examples
- Implementing Safety Frameworks
- Evaluating Safety in Language Models
- Tools and Resources
- Guest Lecture by Jacob Hilton
- Conclusion and Future Directions
- Final Project
נושאים
Computer Science