What You Need to Know Before
You Start

Starts 8 June 2025 06:56

Ends 8 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Superintelligent Agents Pose Catastrophic Risks - Safety-Guaranteed LLMs

Explore the catastrophic risks of superintelligent AI agents and Yoshua Bengio's proposed safer alternative: a non-agentic "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency.
Simons Institute via YouTube

Simons Institute

2544 Courses


1 hour 14 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Explore the catastrophic risks of superintelligent AI agents and Yoshua Bengio's proposed safer alternative:

a non-agentic "Scientist AI" designed to explain rather than act, potentially accelerating scientific progress while avoiding dangerous agency.

Syllabus

  • Introduction to Superintelligent AI
  • Definition and Characteristics of Superintelligent Agents
    Historical Context and Developments
    Current and Potential Capabilities
  • Catastrophic Risks of Superintelligent Agents
  • Alignment Problem and Control Challenges
    Scenarios of Unaligned AI
    Case Studies and Theoretical Models
  • Overview of AI Safety Research
  • Key Concepts in AI Safety
    Existing Approaches and Their Limitations
    Ethical and Societal Implications
  • Yoshua Bengio's "Scientist AI" Concept
  • Overview of Non-Agentic AI Models
    Comparison with Agentic AIs
    Potential Benefits for Scientific Advancements
  • Designing Safety-Guaranteed LLMs
  • Principles for Safe LLM Design
    Ensuring Explainability and Transparency
    Mechanisms to Avoid Unwanted Agency
  • Practical Implementations and Case Studies
  • Successful Use Cases of Non-Agentic AIs
    Lessons Learned from Past Implementations
    Pathways to Wider Adoption
  • Critiques and Limitations
  • Potential Drawbacks of the "Scientist AI"
    Addressing Critics and Continuing Debate
  • Future Directions and Research Opportunities
  • Emerging Trends in AI Safety
    Potential Collaborations and Community Efforts
    Developing Guidelines for Safe AI Research and Deployment
  • Conclusion
  • Summary of Key Insights
    Reflecting on the Path Forward for Safe AI Innovators

Subjects

Computer Science