What You Need to Know Before
You Start

Starts 7 June 2025 14:42

Ends 7 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

Building Toward Computer Use with Anthropic

Explore Anthropic's Computer Use capability, enabling AI models to navigate interfaces through image reasoning and tool use. Learn to build applications with Claude API, multimodal prompting, and create assistants that interact with computers like humans.
DeepLearning.AI via Coursera

DeepLearning.AI

2019 Courses


1 hour 2 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Paid Course

Optional upgrade avallable

Overview

Building Towards Computer Use with Anthropic introduces an innovative capability from Anthropic that enables models to interact with and navigate computer interfaces. Taught by Colt Steele, Anthropic’s Head of Curriculum, this course covers Anthropic’s family of models and the building blocks that lead to the amazing new application – Computer Use.

Computer Use utilizes the capabilities of the latest models including image reasoning and tool use to enable an LLM-based agent to use a computer. Like a human user, the model processes an image of the screen, analyzes it to understand what’s going on, and navigates the computer by issuing mouse clicks and generating keyboard strokes to get things done.

In this course, you’ll learn the features that lead up to computer use from working with the Anthropic’s API, to multimodal prompting, prompt caching, and tool use, ending in a demo that combines these features to build an AI assistant that uses a computer. In detail, you’ll:

1.

Learn Anthropic’s approach to AI research, principles of AI safety, alignment, and interpretability while understanding the key differences between its models. 2. Make API requests to Claude, format messages for better responses, and control API parameters like system prompts, temperature, and max tokens for optimal responses. 3.

Write multi-modal prompts that combine text and image content blocks and build with streaming responses. 4. Learn effective prompting techniques such as using prompt templates, structuring prompts in XML, and providing examples to get consistent high-quality responses. 5.

Learn to implement prompt caching and see how it can reduce costs and latency. 6. Understand tool-use workflows and build a chatbot that can call different tools in response to users’ queries. 7.

See all these concepts come together in a demo that uses Anthropic Computer Use to achieve a task on a computer. Start utilizing Anthropic’s family of models to build towards Computer Use applications.

Syllabus

  • Building Toward Computer Use with Anthropic
  • Building Towards Computer Use with Anthropic introduces an innovative capability from Anthropic that enables models to interact with and navigate computer interfaces.Taught by Colt Steele, Anthropic’s Head of Curriculum, this course covers Anthropic’s family of models and the building blocks that lead to the amazing new application – Computer Use. Computer Use utilizes the capabilities of the latest models including image reasoning and tool use to enable an LLM-based agent to use a computer. Like a human user, the model processes an image of the screen, analyzes it to understand what’s going on, and navigates the computer by issuing mouse clicks and generating keyboard strokes to get things done.In this course, you’ll learn the features that lead up to computer use from working with the Anthropic’s API, to multimodal prompting, prompt caching, and tool use, ending in a demo that combines these features to build an AI assistant that uses a computer.In detail, you’ll: 1. Learn Anthropic’s approach to AI research, principles of AI safety, alignment, and interpretability while understanding the key differences between its models. 2. Make API requests to Claude, format messages for better responses, and control API parameters like system prompts, temperature, and max tokens for optimal responses. 3. Write multi-modal prompts that combine text and image content blocks and build with streaming responses. 4. Learn effective prompting techniques such as using prompt templates, structuring prompts in XML, and providing examples to get consistent high-quality responses. 5. Learn to implement prompt caching and see how it can reduce costs and latency. 6. Understand tool-use workflows and build a chatbot that can call different tools in response to users’ queries. 7. See all these concepts come together in a demo that uses Anthropic Computer Use to achieve a task on a computer.Start utilizing Anthropic’s family of models to build towards Computer Use applications.

Taught by

Colt Steele


Subjects

Computer Science