Anthropic Announces Claude 3.5 Sonnet, First of Three 3.5 Releases

OpenAI and Google competitor Anthropic has announced a new member of its Claude large language model (LLM) family: Claude 3.5 Sonnet. This is the first release in the forthcoming Claude 3.5 product line, and the company says it surpasses its predecessors and competitors in intelligence, speed, and cost-efficiency, offering advanced capabilities at the mid-tier model price.

Operating at twice the speed of its predecessor, Claude 3 Opus, Claude 3.5 Sonnet sets benchmarks for graduate-level reasoning, undergraduate-level knowledge, and coding proficiency, the company says, and demonstrates significant improvements in understanding nuanced and complex instructions, humor, and producing high-quality content with a relatable tone. Claude 3.5 Sonnet also excels in visual reasoning tasks, outperforming Claude 3 Opus in standard vision benchmarks. Its capabilities include accurate transcription of text from imperfect images.

The company is billing the new LLM as ideal for complex tasks such as context-sensitive customer support and multi-step workflow orchestration. An internal evaluation showed Claude 3.5 Sonnet solving 64% of problems in agentic coding tests.

Anthropic launched its Claude 3 family in March. The company says it plans to release Claude 3.5 Haiku and Claude 3.5 Opus (the other members of the Claude LLM family) later this year, along with new modalities and enterprise application integrations. The company is also exploring features like Memory to enhance personalization and efficiency.

Claude 3.5 Sonnet is available now for free on the company's website and the Claude iOS app, with enhanced rate limits for Claude Pro and Team plan subscribers. It's also accessible via the Anthropic API, Amazon Bedrock, and Google Cloud’s Vertex AI. The model is priced at $3 per million input tokens and $15 per million output tokens, featuring a 200K token context window.

The company also introduced a new feature called "Artifacts," which is a kind of dynamic workspace designed to allow enterprise users to generate and interact with content, such as code snippets and text documents in real time.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS.  He can be reached at [email protected].

Featured

  • Silhouettes of business professionals stand against a blurred futuristic city skyline at night, with a glowing digital network data connection

    It's Time for Higher Ed to Get Serious About AI Strategy

    Without a coordinated strategy that involves multiple academic and administrative units across the entire campus, colleges risk wasting resources, duplicating efforts, and ultimately failing to deliver on the promise of deploying technology to improve learning and operations.

  • Educational path and career development growth with neon icons for study, idea, graduation, and success

    How to Embrace Lifelong Learning as a Non-negotiable for Career Growth

    In a world shaped by rapid technological change and shifting economic forces, staying curious and committed to learning is the most powerful way to stay prepared.

  • large group of college students sitting on an academic quad

    Student Readiness: Learning to Learn

    Melissa Loble, Instructure's chief academic officer, recommends a focus on 'readiness' as a broader concept as we try to understand how to build meaningful education experiences that can form a bridge from the university to the workplace. Here, we ask Loble what readiness is and how to offer students the ability to 'learn to learn'.

  • AI word on microchip and colorful light spread

    Microsoft Unveils Maia 200 Inference Chip to Cut AI Serving Costs

    Microsoft recently introduced Maia 200, a custom-built accelerator aimed at lowering the cost of running artificial intelligence workloads at cloud scale, as major providers look to curb soaring inference expenses and lessen dependence on Nvidia graphics processors.