Date: July 19, 2024
OpenAI has launched a smaller and cheaper version of its flagship GPT-4o for web and mobile app consumers, including developers.
OpenAI has launched GPT-4o mini, a smaller and cheaper version of GPT-4o. The small AI model outperforms existing cutting-edge AI models. The mini version was released yesterday for developers, web, and mobile application users of ChatGPT. The AI model will be released for Enterprise users by next week.
For tasks that involve text and vision reasoning, GPT-4o mini outperforms Gemini 1.5 Flash, Llama 3 (70B), NeMo, GPT-3.5 Turbo, Reka Edge, and many more competitive smaller models. An independent Artificial Analysis shows a glaring difference in the performance results between GPT-4o and other available small AI models. GPT-40 mini scored 82% on MMLU, a benchmark to measure reasoning, compared to 79% for Gemini 1.5 Flash and 75% for Claude 3 Haiku.
“For every corner of the world to be empowered by AI, we need to make the models much more affordable. I think GPT-4o mini is a really big step forward in that direction”
- OpenAI’s head of Product API
GPT-4o mini will also replace GPT-3.5 Turbo as the smallest AI model offered by the company. It is still unclear if the existing users of Turbo will be shifted to GPT-4o mini or not. The company claims that its offering is much more affordable and consumes less power. Mini comes with text and vision capabilities, but the company says that video and audio capabilities will be added soon.
“Relative to comparable models, GPT-4o mini is very fast, with a median output speed of 202 tokens per second. This is more than 2X faster than GPT-4o and GPT-3.5 Turbo and represents a compelling offering for speed-dependent use-cases including many consumer applications and agentic approaches to using LLMs,” said George Cameron, Co-Founder at Artificial Analysis, in an email to a tech media house.
For developers, GPT-4o mini is priced at 15 cents per million input tokens and 60 cents per million output tokens. The model comes with a context window of 128,000 tokens that transforms roughly to the length of a book. OpenAI has not revealed how big the GPT-4o mini actually is but claims that it is similar to other small AI models.
By Arpit Dubey
Arpit is a dreamer, wanderer, and tech nerd who loves to jot down tech musings and updates. With a knack for crafting compelling narratives, Arpit has a sharp specialization in everything: from Predictive Analytics to Game Development, along with artificial intelligence (AI), Cloud Computing, IoT, and let’s not forget SaaS, healthcare, and more. Arpit crafts content that’s as strategic as it is compelling. With a Logician's mind, he is always chasing sunrises and tech advancements while secretly preparing for the robot uprising.
OpenAI Is Building an Audio-First AI Model And It Wants to Put It in Your Pocket
New real-time audio model targeted for Q1 2026 alongside consumer device ambitions.
Nvidia in Advanced Talks to Acquire Israel's AI21 Labs for Up to $3 Billion
Deal would mark chipmaker's fourth major Israeli acquisition and signal shifting dynamics in enterprise AI.
Nvidia Finalizes $5 Billion Stake in Intel after FTC approval
The deal marks a significant lifeline for Intel and signals a new era of collaboration between two of America's most powerful chipmakers.
Manus Changed How AI Agents Work. Now It's Coming to 3 Billion Meta Users
The social media giant's purchase of the Singapore-based firm marks its third-largest acquisition ever, as the race for AI dominance intensifies.