What You Need to Know Before
You Start
Starts 8 June 2025 00:39
Ends 8 June 2025
00
days
00
hours
00
minutes
00
seconds
1 hour 11 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Free Video
Optional upgrade avallable
Overview
Explore the concept of simulating counterfactual training in the context of safety-guaranteed LLMs with Roger Grosse from the University of Toronto.
Syllabus
- Introduction to Counterfactual Training
- Theoretical Foundations of Counterfactuals
- Counterfactual Training in Large Language Models
- Safeguarding AI with Counterfactuals
- Techniques for Simulating Counterfactuals
- Case Studies: Real-world Applications
- Future Directions and Research Opportunities
- Practical Workshop: Implementing Counterfactual Training
- Course Review and Conclusion
Definition and importance in AI safety
Historical context and development
Overview of learning large language models (LLMs)
Counterfactual reasoning in AI
Causality and its relation to counterfactuals
Key mathematical formulations
Understanding language model architectures
Application of counterfactuals within LLM training
Case studies and examples of counterfactual training
Introducing AI safety concepts
Role of counterfactuals in enhancing model reliability
Ethical considerations and challenges
Simulation methodologies
Tools and software for counterfactual simulation
Best practices and common pitfalls
Analysis of successful counterfactual training implementations
Evaluative metrics and impact assessment
Emerging trends in counterfactual AI research
Potential for innovation in safety mechanisms
Discussion of open research questions
Hands-on session with expert guidance
Development of a simple counterfactual simulation
Collaborative problem-solving exercises
Summation of key concepts
Participant feedback and discussion
Future learning paths and resources
Subjects
Computer Science