What is AGI? Nobody agrees, and it’s tearing Microsoft and OpenAI apart.

Date:

Share:


The reported $100 billion profit threshold we mentioned earlier conflates commercial success with cognitive capability, as if a system’s ability to generate revenue says anything meaningful about whether it can “think,” “reason,” or “understand” the world like a human.

Sam Altman speaks onstage during The New York Times Dealbook Summit 2024 at Jazz at Lincoln Center on December 4, 2024, in New York City.


Credit:

Eugene Gologursky via Getty Images


Depending on your definition, we may already have AGI, or it may be physically impossible to achieve. If you define AGI as “AI that performs better than most humans at most tasks,” then current language models potentially meet that bar for certain types of work (which tasks, which humans, what is “better”?), but agreement on whether that is true is far from universal. This says nothing of the even murkier concept of “superintelligence”—another nebulous term for a hypothetical, god-like intellect so far beyond human cognition that, like AGI, defies any solid definition or benchmark.

Given this definitional chaos, researchers have tried to create objective benchmarks to measure progress toward AGI, but these attempts have revealed their own set of problems.

Why benchmarks keep failing us

The search for better AGI benchmarks has produced some interesting alternatives to the Turing Test. The Abstraction and Reasoning Corpus (ARC-AGI), introduced in 2019 by François Chollet, tests whether AI systems can solve novel visual puzzles that require deep and novel analytical reasoning.

“Almost all current AI benchmarks can be solved purely via memorization,” Chollet told Freethink in August 2024. A major problem with AI benchmarks currently stems from data contamination—when test questions end up in training data, models can appear to perform well without truly “understanding” the underlying concepts. Large language models serve as master imitators, mimicking patterns found in training data, but not always originating novel solutions to problems.

But even sophisticated benchmarks like ARC-AGI face a fundamental problem: They’re still trying to reduce intelligence to a score. And while improved benchmarks are essential for measuring empirical progress in a scientific framework, intelligence isn’t a single thing you can measure like height or weight—it’s a complex constellation of abilities that manifest differently in different contexts. Indeed, we don’t even have a complete functional definition of human intelligence, so defining artificial intelligence by any single benchmark score is likely to capture only a small part of the complete picture.



Source link

━ more like this

Labour MP Says UK Must ‘Double Down’ for Energy Security – London Business News | Londonlovesbusiness.com

Labour has announced its commitment to strengthening its Net Zero strategy, stating that expanding low-carbon energy is essential for the country to safeguard...

Three retro Mario titles are coming to Nintendo Switch Online on Mario Day

As if you needed reminding, next week is March 10, or MAR10 Day, as the marketing wizards at Nintendo have been calling it...

Labour MP’s partner ‘arrested on suspicion of spying for China’ – London Business News | Londonlovesbusiness.com

Three men have been taken into custody on suspicion of espionage for China. Officers arrested a 39-year-old man in London and a 43-year-old...

Iran will ‘set oil tankers on fire’ threatening ships delivering fuel supplies to the UK – London Business News | Londonlovesbusiness.com

Disruption in the Strait of Hormuz could significantly threaten global energy shipments, including fuel supplies to the United Kingdom. This strait is one of...

Unconfirmed reports suggest a submarine has ‘sunk’ an Iranian ship off the coast of Sri Lanka – London Business News | Londonlovesbusiness.com

At least 140 individuals are currently unaccounted for, and 32 others have sustained injuries following a reported submarine attack on an Iranian vessel,...
spot_img