OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

Date:

Share:


OpenAI today announced an improved version of its most capable artificial intelligence model to date—one that takes even more time to deliberate over questions—just a day after Google announced its first model of this type.

OpenAI’s new model, called o3, replaces o1, which the company introduced in September. Like o1, the new model spends time ruminating over a problem in order to deliver better answers to questions that require step-by-step logical reasoning. (OpenAI chose to skip the “o2” moniker because it’s already the name of a mobile carrier in the UK.)

“We view this as the beginning of the next phase of AI,” said OpenAI CEO Sam Altman on a livestream Friday. “Where you can use these models to do increasingly complex tasks that require a lot of reasoning.”

The o3 model scores much higher on several measures than its predecessor, OpenAI says, including ones that measure complex coding-related skills and advanced math and science competency. It is three times better than o1 at answering questions posed by ARC-AGI, a benchmark designed to test an AI models’ ability to reason over extremely difficult mathematical and logic problems they’re encountering for the first time.

Google is pursuing a similar line of research. Noam Shazeer, a Google researcher, yesterday revealed in a post on X that the company has developed its own reasoning model, called Gemini 2.0 Flash Thinking. Google’s CEO, Sundar Pichai, called it “our most thoughtful model yet” in his own post.

The two dueling models show competition between OpenAI and Google to be fiercer than ever. It is crucial for OpenAI to demonstrate that it can keep making advances as it seeks to attract more investment and build a profitable business. Google is meanwhile desperate to show that it remains at the forefront of AI research.

The new models also show how AI companies are increasingly looking beyond simply scaling up AI models in order to wring greater intelligence out of them.

OpenAI says there are two versions of the new model, o3 and o3-mini. The company is not making the models publicly available yet but says it will invite outsiders to apply to perform testing of them. OpenAI today also revealed more details of techniques used to align o1. This involves having the model reason about the nature of the request it is given to interrogate whether it may contravene its guardrails.



Source link

━ more like this

Here’s how Apple may make the next Vision headset more affordable

A new report suggests that Apple may be lining up its plans for the launch of its more budget-friendly Vision headset. As spotted...

How to Set Up a Virtual Call Center the Right (& Easy) Way

Setting up a virtual call center is easier and faster than you might think. A lot of people waste too much time trying to...

3 great free movies to stream this weekend (December 20-22)

Table of Contents Table of Contents Air Force One (1997) Wonka (2023) Tommy Boy (1995) This weekend, more than ever, is the time to take your family to...

James Bond (the movie franchise, not the spy) may be in deep jeopardy

Movie icon and super spy James Bond seemed to be on another rise to the top of the box office just a few...

Google’s Gemini Deep Research tool is now available globally

A little more than a week after announcing Gemini Deep Research, Google is making the tool available to more people. As of today,...
spot_img