What You Need to Know Before
You Start
Starts 8 June 2025 01:13
Ends 8 June 2025
00
days
00
hours
00
minutes
00
seconds
Superintelligent Agents Pose Catastrophic Risks - Safety-Guaranteed LLMs
Explore the catastrophic risks of superintelligent AI agents with Yoshua Bengio, who proposes safer alternatives like "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency-driven…
Simons Institute
via YouTube
Simons Institute
2544 Courses
1 hour 12 minutes
Optional upgrade avallable
Not Specified
Progress at your own speed
Free Video
Optional upgrade avallable
Overview
Explore the catastrophic risks of superintelligent AI agents with Yoshua Bengio, who proposes safer alternatives like "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency-driven…
Syllabus
- Introduction to Superintelligent AI
- Understanding Catastrophic Risks
- Safety in AI Design
- Alternatives to Superintelligent Agents
- Case Study: Yoshua Bengio's Proposals
- Agency and Action in AI
- Techniques for Safety-Guaranteed LLMs
- Accelerating Scientific Progress with AI
- Future Directions in AI Safety Research
- Conclusion
Definition and characteristics of superintelligent agents
Historical context and evolution of AI capabilities
The potential dangers posed by superintelligent agents
Case studies on AI risks and ethical dilemmas
Principles of safe AI systems
Approaches to mitigating risks
Overview of "Scientist AI"
Differences between action-driven AI and explanation-driven AI
Analysis of Bengio's frameworks for AI safety
Evaluating "Scientist AI" in various domains
The risks of agency in AI systems
Benefits and limitations of minimizing agency
Design principles for safer language models
Balance between autonomy and control
Potential roles of AI in research and discovery
Ethical considerations and impact on society
Emerging trends and technologies
Opportunities for innovation and collaboration
Recap of key concepts
Guiding principles for future AI developments
Subjects
Computer Science