OpenAI Launches Slimmer, Cheaper GPT-4o Mini -- Campus Technology

Artificial Intelligence

OpenAI Launches Slimmer, Cheaper GPT-4o Mini

By John K. Waters
07/18/24

OpenAI has announced GPT-4o Mini, a slimmed down, more affordable version of its flagship multimodal GPT-4o model. The "mini" version, which replaces the GPT-3.5 model, is designed to "significantly expand the range of applications built with AI by making intelligence much more affordable," the company said in a statement.

Free and paid users of ChatGPT, including those on the Teams plan, will have access to GPT-4o mini today, and the company plans to roll it out to enterprise customers next week.

This new model was designed to enable a broad range of tasks with its low cost and latency, the company said, including applications that require multiple model calls, large volumes of context, and real-time customer interactions. The company is supporting is "faster, cheaper" claim with some impressive benchmark test scores. The GPT-4o Mini scores 82% on the MMLU (Massive Multitask Language Understanding) benchmark for evaluating the capabilities of language models, and outperforms GPT-4 on chat preferences in the LMSYS Chatbot Arena leaderboard.

Key benchmark results for GPT-4o Mini include:

Reasoning Tasks: 82.0% on MMLU, compared to 77.9% for Gemini Flash and 73.8% for Claude Haiku.
Math and Coding Proficiency: 87.0% on MGSM (math reasoning) and 87.2% on HumanEval (coding performance), outperforming Gemini Flash and Claude Haiku.
Multimodal Reasoning: 59.4% on MMMU, leading Gemini Flash and Claude Haiku.

Currently, GPT-4o Mini supports text and vision inputs through the API, and the company plans to include text, image, video, and audio inputs and outputs in the future. The model features a 128K token context window and up-to-date knowledge to October 2023. The improved tokenizer shared with GPT-4o is meant to enhance cost-effectiveness for handling non-English text.

OpenAI has built comprehensive safety measures into GPT-4o Mini that align with its Preparedness Framework and voluntary commitments. More than 70 external experts evaluated GPT-4o to identify potential risks, and their insights have improved the safety of both GPT-4o and GPT-4o Mini, the company says.

GPT-4o Mini is the first model to apply OpenAI's instruction hierarchy method, which enhances its ability to resist jailbreaks, prompt injections, and system prompt extractions. This innovation makes the model's responses more reliable and safer for large-scale applications, the company says.

OpenAI isn't the first vendor to offer a lightweight versions of its main product offering. Google has Gemini Nano and Anthropic has Claude Haiku.

GPT-4o Mini is available now to ChatGPT Free, Plus, and Team users starting today, and will be accessible to enterprise users next week. Fine-tuning options for GPT-4o Mini will be available in the coming days.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS. He can be reached at [email protected].

E-Mail this page

Printable Format

Featured

New Copilot Studio Feature to Introduce AI Agent Building Tools

Microsoft has announced plans to roll out a public preview of a new feature within Copilot Studio, allowing users to create autonomous AI "agents" designed to handle routine tasks.
California AI Watermarking Bill Garners OpenAI Support

ChatGPT creator OpenAI is backing a California bill that would require tech companies to label AI-generated content in the form of a digital "watermark." The proposed legislation, known as the "California Digital Content Provenance Standards" (AB 3211), aims to ensure transparency in digital media by identifying content created through artificial intelligence. This requirement would apply to a broad range of AI-generated material, from harmless memes to deepfakes that could be used to spread misinformation about political candidates.
Are Organizations Moving from Cloud to On-Premises? AWS Says Yes; Gartner Says It's Not Widespread

Is there a widespread backlash to cloud computing that sees organizations moving their IT operations back to on-premises data centers? The longstanding debate over that very question was rekindled by recent comments from AWS about cloud repatriation among its customer base.
5 Strategies for Democratizing Data to Enhance Student Outcomes

Data's role in enhancing educational outcomes is monumental, and it's time we harness this potential fully.

CAMPUS TECHNOLOGY NEWS

Email Address*Country*Select primary job title/function*

Please type the letters/numbers you see above.

OpenAI Launches Slimmer, Cheaper GPT-4o Mini

Featured

New Copilot Studio Feature to Introduce AI Agent Building Tools

California AI Watermarking Bill Garners OpenAI Support

Are Organizations Moving from Cloud to On-Premises? AWS Says Yes; Gartner Says It's Not Widespread

5 Strategies for Democratizing Data to Enhance Student Outcomes

Portals

Artificial Intelligence

Cybersecurity

Data & Analytics

Learning Tools

Student Services

WEBCASTS

From Applicant to Alumni: Integrating the Student Lifecycle into Identity Management

Getting Identity Right: Why Flexibility Is Key to a Modern IAM Solution

An AI-Native Network Puts Location Services at the Core of the Student Experience

Upskill Your Students with the Right Mix of Training in Evolving Tech

Whitepapers

The Faculty Guide to Getting Started with Gen AI

From Complexity to Clarity: Securing Cloud Environments in Higher Education

4 Causes of Student Disengagement (& How to Overcome Them)

How to Overcome 4 Common Work Execution Challenges in Higher Education With Smartsheet