What You Need to Know Before
You Start

Starts 4 July 2025 21:06

Ends 4 July 2025

00 Days

00 Hours

00 Minutes

00 Seconds

Measurements for Capabilities and Hazards

Join us to explore thorough frameworks for measuring AI's vast capabilities and potential risks, with a special emphasis on safety evaluation methods tailored for large language models. Discover critical insights within artificial intelligence and computer science landscapes through this expert-led course available on YouTube.

Simons Institute via YouTube

59 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Free Video

Optional upgrade avallable

Overview

Syllabus

Introduction to AI Capabilities and Hazards

Overview of AI systems and their applications

Importance of evaluating AI capabilities and hazards

Key terminology and concepts

Measurement Frameworks for AI Capabilities

Definitions of AI capabilities

Methods for assessing AI performance

Comparisons between human and AI capabilities

Metrics for Evaluating AI Models

Quantitative and qualitative metrics

Benchmarking AI models

Real-world examples of AI performance measurement

Assessing Safety in AI Systems

Understanding AI safety and risk assessment

Key principles of AI safety evaluation

Case studies on AI safety incidents

Evaluation Methodologies for Large Language Models (LLMs)

Overview of LLMs and their unique characteristics

Common safety challenges with LLMs

Tools and techniques for evaluating LLM safety

Potential Hazards Associated with Large Language Models

Identifying ethical and safety concerns

Analysis of bias, misinformation, and malicious use

Strategies for mitigating risks

Safety and Reliability Testing Protocols

Testing frameworks for AI systems

Scenario-based testing and simulation

Continuous monitoring and feedback loops

Current Research and Future Directions

Emerging trends in AI capability measurement

Advances in hazard evaluation methodologies

Open challenges and research opportunities in AI safety

Capstone Project

Practical application of measurement frameworks

Designing a safety evaluation plan for a given AI system

Presentations and peer feedback

Course Conclusion and Further Resources

Summary of key learnings

Recommended readings and resources for continued study

Subjects

Computer Science

What You Need to Know Before You Start

Measurements for Capabilities and Hazards

59 minutes

Not Specified

Free Video

Overview

Syllabus

Subjects

Intellectual Property Law in Digital Age

Add Useful AI to Your Web App - Not Just Chatbots

AI and Industrial Innovation

AI and Sustainable Convergence

AI Medical and Health Applications: Innovative Practices and Inclusive Design

AI and Ethical Governance

What You Need to Know Before
You Start