AI

OpenAI unveils GPT-4o mini, a small AI model powering ChatGPT

Comment

OpenAI logo with spiraling pastel colors (Image Credits: Bryce Durbin / TechCrunch)
Image Credits: Bryce Durbin / TechCrunch

OpenAI introduced GPT-4o mini on Thursday, its latest small AI model. The company says GPT-4o mini, which is cheaper and faster than OpenAI’s current cutting edge AI models, is being released for developers, as well as through the ChatGPT web and mobile app for consumers starting today. Enterprise users will gain access next week.

The company says GPT-4o mini outperforms industry leading small AI models on reasoning tasks involving text and vision. As small AI models improve, they are becoming more popular for developers due to their speed and cost efficiencies compared to larger models, such as GPT-4 Omni or Claude 3.5 Sonnet. They’re a useful option for high volume, simple tasks that developers might repeatedly call on an AI model to perform.

GPT-4o mini will replace GPT-3.5 Turbo as the smallest model OpenAI offers. The company claims its newest AI model scores 82% on MMLU, a benchmark to measure reasoning, compared to 79% for Gemini 1.5 Flash and 75% for Claude 3 Haiku, according to data from Artificial Analysis.

Further, OpenAI says GPT-4o mini is significantly more affordable to run than its previous frontier models, and more than 60% cheaper than GPT-3.5 Turbo. Today, GPT-4o mini supports text and vision in the API, and OpenAI says the model will support video and audio capabilities in the future.

“For every corner of the world to be empowered by AI, we need to make the models much more affordable,” said OpenAI’s Head of Product API, Olivier Godemont, in an interview with TechCrunch. “I think GPT-4o mini is a really big step forward in that direction.”

For developers building on OpenAI’s API, GPT4o mini is priced at 15 cents per million input tokens and 60 cents per million output tokens. The model has a context window of 128,000 tokens, roughly the length of a book, and a knowledge cutoff of October 2023.

OpenAI would not disclose exactly how large GPT-4o mini is, but said it’s roughly in the same tier as other small AI models, such as Llama 3 8b, Claude Haiku, and Gemini 1.5 Flash. However, the company claims it to be faster, more cost efficient, and smarter than these industry leading small models. Before launching, OpenAI tested GPT-4o mini on the LMSYS.org chatbot arena to gauge its competitiveness.

More TechCrunch

TikTok is partnering with the music distribution service DistroKit to fast-track the creation of artist accounts for members. The ByteDance-owned short video platform introduced an Artist Account feature last year…

TikTok fast-tracks artist account creation for DistroKid members

Ford is still pushing forward on electrification, notably by increasing hybrid options.

Ford’s EV plans are in flux once again as it invests $3B into its biggest trucks

OpenAI introduced GPT-4o mini on Thursday, its latest small AI model. The company says GPT-4o mini, which is cheaper and faster than OpenAI’s current cutting edge AI models, is being…

OpenAI unveils GPT-4o mini, a small AI model powering ChatGPT
Image Credits: Bryce Durbin / TechCrunch

Featured Article

USPS shared customer postal addresses with Meta, LinkedIn and Snap

The U.S. Postal Service confirmed it took action to “remediate” the data sharing following a TechCrunch investigation.

USPS shared customer postal addresses with Meta, LinkedIn and Snap

The automotive industry is in the midst of dramatic technological change as companies seek out new ways to make money beyond building and selling gas-powered cars. And GM CEO and…

GM CEO Mary Barra is coming to TechCrunch Disrupt 2024

Amazon’s Prime Day event clocked record sales this year, as U.S. consumers spent $14.2 billion across July 16 and 17, according to Adobe Analytics. This marks an 11% jump from…

Amazon Prime Day 2024 sales hit record $14.2 billion

The now-scrapped moon mission would have been the U.S. space agency’s first resource-mapping mission off planet Earth.

NASA cancels $450M Viper moon mission, dashing ice prospecting dreams

Marathon Fusion emerges from stealth with a plan to help fusion power plants extract tritium, a necessary fuel.

Without this company’s technology, future fusion power plants might never light up

A security researcher found that some traffic lights controllers are exposed on the internet and could be manipulated.

Hackers could create traffic jams thanks to flaw in traffic light controller, researcher says

Strong magnets are essential for the type of fusion power being pursued by Commonwealth Fusion Systems and a number of other startups.

WHAM! Nuclear fusion experiment hits new record for magnet strength

Coast, a startup that describes itself as “a financial services platform for the future of transportation,” has raised $40 million in Series B funding — just four months after announcing…

Fintech startup Coast lands $40M just 4 months after its last $25M raise

Italy’s competition watchdog is investigating how Google gets user consent in order to link their activity across different services for ad profiling.

Google accused of misleading consumers to grab more data for ads

Featured Article

Proton launches ‘privacy-first’ AI writing assistant for email that runs on-device

Proton has launched a new AI-enabled writing assistant that helps users compose emails based on simple prompts.

Proton launches ‘privacy-first’ AI writing assistant for email that runs on-device

The Mumbai-based firm said one of its multisig wallets had suffered a security breach, and it was temporarily pausing all withdrawals from the platform.

India’s WazirX confirms security breach following a $230M ‘suspicious transfer’

The Korean tech giant is acquiring Oxford Semantic Technologies, a U.K.-based knowledge graph startup that has built an AI reasoning engine.

Samsung to acquire UK-based knowledge graph startup Oxford Semantic Technologies

A team of former Google DeepMind researchers are betting that their unique behavior engine for NPCs will make games more fun and dynamic.

Artificial Agency raises $16M to use AI to make NPCs feel more realistic in video games

When the war between Israel and Hamas broke out last October, we examined its potential impact on the tech ecosystems in Israel and Palestine. Nine months later, the prevailing sentiment…

Israel’s startup scene shows reslience despite nine months of war

Both of Hofy’s co-founders, Sami Bouremoum and Michael Ginzo, along with the rest of the startup’s leadership team and 120 employees, are joining Deel to build Deel IT.

Deel acquires Hofy to build its own IT device management service

Given the years and money invested in the Zone’s creation, there’s logic in Dyson’s decision to turn its R&D work into a more standard set of headphones with the OnTrac.

Dyson’s new OnTrac headphones don’t purify air

Bill Weber is out as chief executive at Firefly Aerospace, following a nearly two-year stint in the role, the maker of launch vehicles, lunar landers and orbital vehicles announced late Wednesday.…

Bill Weber out as CEO of Firefly Aerospace

NASA was looking for proposals that maximized the use of flight heritage because reliability will be key.

SpaceX’s vehicle to deorbit the International Space Station is a Dragon on steroids

Slope’s founders both have a background in AI, so large language models power the company’s underwriting infrastructure. 

How a B2B payments startup won Max, Jack and Sam Altman, JP Morgan as investors

TTT models, a new architecture, could effectively replace transformers if they scale up as their creators suggest they will.

TTT models might be the next frontier in generative AI

Vitalik Buterin, the co-founder of Ethereum, issued a warning on Wednesday against choosing a candidate purely based on whether they claim to be “pro-crypto.” In a blog post, Buterin said…

Ethereum co-founder’s warning against ‘pro-crypto’ candidates: ‘Are they in it for the right reasons?’

Adtech startup InMobi is eyeing a valuation of about $10 billion in an initial public offering it is planning for next year.

InMobi eyes $10 billion valuation in 2025 India IPO

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

A comprehensive list of 2024 tech layoffs

The Designer app features “prompt templates” that are designed to help jumpstart the creative process.

Microsoft’s AI-powered, Canva-like Designer app lands on iOS and Android

When Jaclyn Rice Nelson and Noah Gale launched AI talent and services company Tribe AI in 2019, they had to convince companies that having an AI strategy mattered. After the…

Tribe AI raised venture capital to keep up with demand after six years of bootstrapping

Spotify’s AI DJ, which first launched last year, is meant to serve as a smart audio guide that introduces music using a convincingly realistic voice.

Spotify adds a Spanish-speaking AI DJ, ‘Livi’

Adobe’s report noted that consumers spent $7.2 billion on the first day as compared to $6.4 billion on the first day in 2023.

US buyers spent $7.2B on the first day of Amazon’s Prime Day sales event