What You Need to Know Before
You Start

Starts 8 June 2025 01:13

Ends 8 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Superintelligent Agents Pose Catastrophic Risks - Safety-Guaranteed LLMs

Explore the catastrophic risks of superintelligent AI agents with Yoshua Bengio, who proposes safer alternatives like "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency-driven…
Simons Institute via YouTube

Simons Institute

2544 Courses


1 hour 12 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Explore the catastrophic risks of superintelligent AI agents with Yoshua Bengio, who proposes safer alternatives like "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency-driven…

Syllabus

  • Introduction to Superintelligent AI
  • Definition and characteristics of superintelligent agents
    Historical context and evolution of AI capabilities
  • Understanding Catastrophic Risks
  • The potential dangers posed by superintelligent agents
    Case studies on AI risks and ethical dilemmas
  • Safety in AI Design
  • Principles of safe AI systems
    Approaches to mitigating risks
  • Alternatives to Superintelligent Agents
  • Overview of "Scientist AI"
    Differences between action-driven AI and explanation-driven AI
  • Case Study: Yoshua Bengio's Proposals
  • Analysis of Bengio's frameworks for AI safety
    Evaluating "Scientist AI" in various domains
  • Agency and Action in AI
  • The risks of agency in AI systems
    Benefits and limitations of minimizing agency
  • Techniques for Safety-Guaranteed LLMs
  • Design principles for safer language models
    Balance between autonomy and control
  • Accelerating Scientific Progress with AI
  • Potential roles of AI in research and discovery
    Ethical considerations and impact on society
  • Future Directions in AI Safety Research
  • Emerging trends and technologies
    Opportunities for innovation and collaboration
  • Conclusion
  • Recap of key concepts
    Guiding principles for future AI developments

Subjects

Computer Science