Anthropic Releases Claude 3.5 Sonnet and Haiku, Expands AI with 'Computer Use' Public Beta -- Campus Technology

Artificial Intelligence

Anthropic Releases Claude 3.5 Sonnet and Haiku, Expands AI with 'Computer Use' Public Beta

By John K. Waters
11/07/24

Anthropic has unveiled two advanced AI models, Claude 3.5 Sonnet and Claude 3.5 Haiku, with significant improvements in functionality and performance, especially in coding. The upgraded Claude 3.5 Sonnet outperforms its predecessor in several benchmarks, the company said, while Claude 3.5 Haiku matches the intelligence of Anthropic's previous largest model, Claude 3 Opus, at a similar cost and speed.

Claude 3.5 Sonnet brings notable enhancements in coding. It improves its SWE-bench Verified score to 49.0%, outperforming other publicly available models and specialized systems. The model also advances in TAU-bench, an agentic tool use task, showing gains in both retail and airline domains.

GitLab, a web-based platform that helps teams collaborate on software development, tested the model for DevSecOps tasks and found it delivered stronger reasoning (up to 10% across use cases) with no added latency, making it an ideal choice to power multi-step software development processes.

The release announcement included support statements from key clients, including Cognition and The Browser Company, which highlighted the model's improvements in coding, planning, and automation. "Claude 3.5 Sonnet has outperformed every model we've tested in automating web-based workflows," a Browser Company spokesperson said.

Claude 3.5 Haiku, Anthropic's next-generation fast model, offers broad performance improvements at the same cost and speed as the previous Claude 3 Haiku. Scoring 40.6% on SWE-bench Verified, it surpasses older models, making it suitable for user-facing products and tasks requiring personalization, such as analyzing purchase history or managing inventory records. Claude 3.5 Haiku will launch later this month on Anthropic's API, Amazon Bedrock, and Google Cloud's Vertex AI.

Anthropic also introduced a new "computer use" capability in public beta. Claude 3.5 Sonnet is the first model to offer this feature, which enables the AI to simulate human interactions with computer interfaces — moving cursors, clicking, and typing. Initially experimental, this function aims to automate complex, multi-step tasks for developers. Replit, for example, is using Claude's UI navigation abilities for in-progress evaluations in their Replit Agent product.

Claude's OSWorld evaluation score — 14.9% in the screenshot-only category and 22.0% with additional steps — highlights its potential to mimic human-like computer operations. (OSWorld provides benchmarking multimodal agents for open-ended tasks in real computer environments.) However, Anthropic advises developers to start with low-risk applications, as the technology may occasionally struggle with tasks like scrolling and zooming. "This capability is imperfect but rapidly evolving, and we are proactive in its safe deployment," Anthropic stated, underscoring new classifiers developed to detect misuse and mitigate risks.

In collaboration with the US and UK AI Safety Institutes, Anthropic conducted pre-deployment testing to ensure Claude 3.5 Sonnet adheres to ASL-2 safety standards under its Responsible Scaling Policy. The company emphasizes a commitment to safe AI evolution, recognizing both the potential and implications of more capable systems.

"We're eager for developers to explore these advancements and provide feedback. This is just the beginning of a new chapter in working with Claude," Anthropic said.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS. He can be reached at [email protected].

E-Mail this page

Printable Format

Featured

The 2025 NFL Draft and Ed Tech Selection: A Strategic Parallel

In the fast-evolving landscape of collegiate football, the NFL, and higher education, one might not immediately draw connections between the 2025 NFL Draft and the selection of proper educational technology for a college campus. However, upon closer examination, both processes share striking similarities: a rigorous assessment of needs, long-term strategic impact, talent or tool evaluation, financial considerations, and adaptability to a dynamic future.
Anthropic Develops AI 'Microscope' to Reveal the Hidden Mechanics of LLM Thought

Anthropic has unveiled new research tools designed to provide a rare glimpse into the hidden reasoning processes of advanced language models — like a "microscope" for AI.
Google, Microsoft Expand AI-Driven Search Capabilities

Recent announcements from Google and Microsoft highlight a slough of AI capabilities for their search tools.
Network to Code Launches AI Assistant for Enterprise Network Teams

Network automation firm Network to Code has launched NautobotGPT, an AI-powered assistant aimed at helping enterprise network engineers create, test, and troubleshoot automation tasks more efficiently.

CAMPUS TECHNOLOGY NEWS

Email Address*Country*Select primary job title/function*

Please type the letters/numbers you see above.

Anthropic Releases Claude 3.5 Sonnet and Haiku, Expands AI with 'Computer Use' Public Beta

Featured

The 2025 NFL Draft and Ed Tech Selection: A Strategic Parallel

Anthropic Develops AI 'Microscope' to Reveal the Hidden Mechanics of LLM Thought

Google, Microsoft Expand AI-Driven Search Capabilities

Network to Code Launches AI Assistant for Enterprise Network Teams

Portals

Artificial Intelligence

Cybersecurity

Data & Analytics

Learning Tools

Student Services

WEBCASTS

How Colleges and Universities Can Take the First Step with a Managed Services Provider

Avoiding Shiny Object Syndrome: How to Lay a Solid Foundation for Emerging Tech

How an AI-Powered Admissions Modernization effort Kicked Off a Data Transformation Journey at Illinois Tech

Whitepapers

The Pros and Cons of In-House IT

How Managed IT Services Solve Cybersecurity and Staffing Challenges and Save Colleges up to 30%

Transforming Student Engagement and Support with AI-Powered Communication

What Is Your Campus Security Posture in the Age of AI?

SPONSORED CONTENT