OpenAI introduced its latest AI model, the GPT-4o Mini, on Thursday. The new model is designed to be cheaper and faster than OpenAI’s existing cutting-edge AI models. Available starting today for developers and through the ChatGPT web and mobile app for consumers, enterprise users will gain access to GPT-4o Mini next week.
Enhanced Performance and Affordability
The GPT-4o Mini is designed to outperform industry-leading small AI models on reasoning tasks involving text and vision. As smaller AI models improve, their popularity among developers increases due to their speed and cost efficiencies compared to larger models like GPT-4 Omni or Claude 3.5 Sonnet.
Replacing GPT-3.5 Turbo as OpenAI’s smallest offering, GPT-4o Mini boasts impressive benchmarks. According to Artificial Analysis, the model scores 82% on MMLU, a benchmark for measuring reasoning, compared to 79% for Gemini 1.5 Flash and 75% for Claude 3 Haiku, and 87% on MGSM, outperforming Flash at 78% and Haiku at 72%.
Cost-Efficiency and Future Capabilities
OpenAI’s GPT-4o mini is more affordable than its previous models, over 60% cheaper than GPT-3.5 Turbo. It currently supports text and vision in the API, with plans to support video and audio capabilities in the future.
OpenAI’s Head of Product API, Olivier Godement, emphasized the need to make AI models more affordable for every corner of the world, stating that GPT-4o Mini is a significant step forward in this direction.
OpenAI’s GPT-4o Mini is priced at 15 cents per million input tokens and 60 cents per million output tokens, with a context window of 128,000 tokens and October 2023 knowledge cutoff.
OpenAI’s GPT-4o Mini, compared to other small AI models like Llama 3 8b, Claude Haiku, and Gemini 1.5 Flash, has been found to be faster, more cost-efficient, and smarter.
Artificial Analysis’s Co-Founder, George Cameron, has highlighted the GPT-4o Mini’s impressive speed, boasting a median output speed of 202 tokens per second, more than 2X faster than GPT-4o and GPT-3.5 Turbo, making it a promising option for speed-dependent use-cases, including consumer applications and agentic LLMs.
New Tools for Enterprise Customers
In addition to the GPT-4o Mini announcement, OpenAI has introduced the Enterprise Compliance API, a suite of tools designed to assist businesses in regulated industries like finance, healthcare, legal services, and government in complying with logging and audit requirements.
These tools will allow admins to audit and manage ChatGPT Enterprise data, including conversations, uploaded files, and workspace GPTs, while also providing more control over workspace GPTs.
OpenAI’s latest GPT-4o Mini and enterprise tools aim to make AI more accessible, affordable, and efficient for various users and applications.
For more such information and news, follow TechnologyHubspot.