Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word ‘Where’

2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word ‘Where’

22 May 2026
Indeed chief economist says we’re entering an era of ‘great mismatch’

Indeed chief economist says we’re entering an era of ‘great mismatch’

22 May 2026
The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good

The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good

22 May 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Will A.I. Soon Outsmart Humans? Play This Puzzle to Find Out.
Business

Will A.I. Soon Outsmart Humans? Play This Puzzle to Find Out.

Press RoomBy Press Room26 March 20257 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Will A.I. Soon Outsmart Humans? Play This Puzzle to Find Out.

In 2019, an A.I. researcher, François Chollet, designed a puzzle game that was meant to be easy for humans but hard for machines.

The game, called ARC, became an important way for experts to track the progress of artificial intelligence and push back against the narrative that scientists are on the brink of building A.I. technology that will outsmart humanity.

Mr. Chollet’s colorful puzzles test the ability to quickly identify visual patterns based on just a few examples. To play the game, you look closely at the examples and try to find the pattern.

Each example uses the pattern to transform a grid of colored squares into a new grid of colored squares:

The pattern is the same for every example.

Now, fill in the new grid by applying the pattern you learned in the examples above.

For years, these puzzles proved to be nearly impossible for artificial intelligence, including chatbots like ChatGPT.

A.I. systems typically learned their skills by analyzing huge amounts of data culled from across the internet. That meant they could generate sentences by repeating concepts they had seen a thousand times before. But they couldn’t necessarily solve new logic puzzles after seeing only a few examples.

That is, until recently. In December, OpenAI said that its latest A.I. system, called OpenAI o3, had surpassed human performance on Mr. Chollet’s test. Unlike the original version of ChatGPT, o3 was able to spend time considering different possibilities before responding.

Some saw it as proof that A.I. systems were approaching artificial general intelligence, or A.G.I., which describes a machine that’s as smart as a human. Mr. Chollet had created his puzzles as a way of showing that machines were still a long way from this ambitious goal.

But the news also exposed the weaknesses in benchmark tests like ARC, short for Abstraction and Reasoning Corpus. For decades, researchers have set up milestones to track A.I.’s progress. But once these milestones were reached, they were exposed as insufficient measures of true intelligence.

Arvind Narayanan, a Princeton computer science professor and co-author of the book “AI Snake Oil,” said that any claim that the ARC test measured progress toward A.G.I. was “very much iffy.”

Still, Mr. Narayanan acknowledged that OpenAI’s technology demonstrated impressive skills in passing the ARC test. Some of the puzzles are not as easy as the one you just tried.

The one below is little harder, and it, too, was correctly solved by OpenAI’s new A.I. system:

A puzzle like this shows that OpenAI’s technology is getting better at working through logic problems. But the average person can solve puzzles like this one in seconds. OpenAI’s technology consumed significant computing resources to pass the test.

Last June, Mr. Chollet teamed up with Mike Knoop, co-founder of the software company Zapier, to create what they called the ARC Prize. The pair financed a contest that promised $1 million to anyone who built an A.I. system that exceeded human performance on the benchmark, which they renamed “ARC-AGI.”

Companies and researchers submitted over 1,400 A.I. systems, but no one won the prize. All scored below 85 percent, which marked the performance of a “smart” human.

OpenAI’s o3 system correctly answered 87.5 percent of the puzzles. But the company ran afoul of competition rules because it spent nearly $1.5 million in electricity and computing costs to complete the test, according to pricing estimates.

OpenAI was also ineligible for the ARC Prize because it was not willing to publicly share the technology behind its A.I. system through a practice called open sourcing. Separately, OpenAI ran a “high-efficiency” variant of o3 that scored 75.7 percent on the test and cost less than $10,000.

“Intelligence is efficiency. And with these models, they are very far from human-level efficiency,” Mr. Chollet said.

(The New York Times sued OpenAI and its partner, Microsoft, in December for copyright infringement of news content related to A.I. systems.)

On Monday, the ARC Prize introduced a new benchmark, ARC-AGI-2, with hundreds of additional tasks. The puzzles are in the same colorful, grid-like game format as the original benchmark, but are more difficult.

“It’s going to be harder for humans, still very doable,” said Mr. Chollet. “It will be much, much harder for A.I. — o3 is not going to be solving ARC-AGI-2.”

Here is a puzzle from the new ARC-AGI-2 benchmark that OpenAI’s system tried and failed to solve. Remember, the same pattern applies to all the examples.

Now try to fill in the grid below according to the pattern you found in the examples:

This shows that although A.I. systems are better at dealing with problems they have never seen before, they still struggle.

Here are a few additional puzzles from ARC-AGI-2, which focuses on problems that require multiple steps of reasoning:

As OpenAI and other companies continue to improve their technology, they may pass the new version of ARC. But that does not mean that A.G.I. will be achieved.

Judging intelligence is subjective. There are countless intangible indicators of intelligence, from composing works of art to navigating moral dilemmas to intuiting emotions.

Companies like OpenAI have built chatbots that can answer questions, write poetry and even solve logic puzzles. In some ways, they have already exceeded the powers of the brain. OpenAI’s technology has outperformed its chief scientist, Jakub Pachocki, on a competitive programming test.

But these systems still make mistakes that the average person would never make. And they struggle to do simple things that humans can handle.

“You’re loading the dishwasher, and your dog comes over and starts licking the dishes. What do you do?” said Melanie Mitchell, a professor in A.I. at the Santa Fe Institute. “We sort of know how to do that, because we know all about dogs and dishes and all that. But would a dishwashing robot know how to do that?”

To Mr. Chollet, the ability to efficiently acquire new skills is something that comes naturally to humans but is still lacking in A.I. technology. And it’s what he has been targeting with the ARC-AGI benchmarks.

In January, the ARC Prize became a nonprofit foundation that serves as a “north star for A.G.I.” The ARC Prize team expects ARC-AGI-2 to last for about two years before it is solved by A.I. technology — though they would not be surprised if it happened sooner.

They have already started work on ARC-AGI-3, which they hope to debut in 2026. An early mock-up hints at a puzzle that involves interacting with a dynamic, grid-based game.

A.I. researcher François Chollet designed a puzzle game meant to be easy for humans but hard for machines.

Kelsey McClellan for The New York Times

Early mock-up for ARC-AGI-3, a benchmark that could involve interacting with a dynamic, grid-based game.

ARC Prize Foundation

This is a step closer to what people deal with in the real world — a place filled with movement. It does not stand still like the puzzles you tried above.

Even this, however, will go only part of the way toward showing when machines have surpassed the brain. Humans navigate the physical world — not just the digital. The goal posts will continue to shift as A.I. advances.

“If it’s no longer possible for people like me to produce benchmarks that measure things that are easy for humans but impossible for A.I.,” Mr. Chollet said, “then you have A.G.I.”

Artificial General Intelligence Artificial Intelligence Melanie (1958- ) Mitchell OpenAI Labs research Tests and Examinations vis-design
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

Here’s How Much More You’re Spending on Gas Because of the Iran War

Here’s How Much More You’re Spending on Gas Because of the Iran War

22 May 2026
McKinsey partner says up to 50% of work hours could be transformed within the next 5 years

McKinsey partner says up to 50% of work hours could be transformed within the next 5 years

21 May 2026
Video: Jury Rejects Elon Musk’s Lawsuit Against OpenAI and Microsoft

Video: Jury Rejects Elon Musk’s Lawsuit Against OpenAI and Microsoft

19 May 2026
5 Benefits And Risks Of Using AI For Cybersecurity

5 Benefits And Risks Of Using AI For Cybersecurity

18 May 2026
Targeting Undruggable Proteins With A Molecular Glue

Targeting Undruggable Proteins With A Molecular Glue

16 May 2026
xAI Cofounder Igor Babuschkin In Talks To Raise Up To  Billion For A New AI Startup

xAI Cofounder Igor Babuschkin In Talks To Raise Up To $1 Billion For A New AI Startup

15 May 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Exclusive: DeFi platform Azura launches after raising .9 million from Initialized

Exclusive: DeFi platform Azura launches after raising $6.9 million from Initialized

22 October 2024
Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

30 December 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
Current price of oil as of May 22, 2026

Current price of oil as of May 22, 2026

22 May 20261 Views
6 Teachable Moments From An Atlanta Rush Hour Downpour

6 Teachable Moments From An Atlanta Rush Hour Downpour

22 May 20261 Views
I’ve spent 25 years in venture capital. Here’s how it quietly shut ordinary Americans out of the AI wealth boom—and what could fix it

I’ve spent 25 years in venture capital. Here’s how it quietly shut ordinary Americans out of the AI wealth boom—and what could fix it

22 May 20261 Views
The Importance Of Red Teaming For Scaling Enterprise AI Agents

The Importance Of Red Teaming For Scaling Enterprise AI Agents

22 May 20262 Views

Recent Posts

  • 2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word ‘Where’
  • Indeed chief economist says we’re entering an era of ‘great mismatch’
  • The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good
  • ‘You kind of ruined it with your trans obsession’: House points fingers as Smithsonian Women’s museum funding fails
  • Current price of oil as of May 22, 2026

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word ‘Where’

2026 America Innovates | Responsible For All Our Digital Maps, Jack Dangermond Loves The Word ‘Where’

22 May 2026
Indeed chief economist says we’re entering an era of ‘great mismatch’

Indeed chief economist says we’re entering an era of ‘great mismatch’

22 May 2026
The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good

The Post-‘The Boys’ Finale ‘Vought Rising’ Trailer Is Here, And Quite Good

22 May 2026
Most Popular
‘You kind of ruined it with your trans obsession’: House points fingers as Smithsonian Women’s museum funding fails

‘You kind of ruined it with your trans obsession’: House points fingers as Smithsonian Women’s museum funding fails

22 May 20260 Views
Current price of oil as of May 22, 2026

Current price of oil as of May 22, 2026

22 May 20261 Views
6 Teachable Moments From An Atlanta Rush Hour Downpour

6 Teachable Moments From An Atlanta Rush Hour Downpour

22 May 20261 Views

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.