Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Elon Musk’s pay package reveals what SpaceX really is: a  trillion monster built to colonize Mars

Elon Musk’s pay package reveals what SpaceX really is: a $1 trillion monster built to colonize Mars

21 May 2026
Advanced Packaging Leads The Way To Intel Foundry Success

Advanced Packaging Leads The Way To Intel Foundry Success

21 May 2026
SpaceX finally files IPO prospectus, reveals revenue is up–but losses are too

SpaceX finally files IPO prospectus, reveals revenue is up–but losses are too

21 May 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » New Model Reasoning: An Engineer’s Take
Innovation

New Model Reasoning: An Engineer’s Take

Press RoomBy Press Room24 January 20255 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
New Model Reasoning: An Engineer’s Take

The models are coming out fast and furiously – it seems like every time we turn around, we have new forms of LLM operations and AI engines to make sense of.

But what do these changes actually do in the industry?

I came across this X post ostensibly from Dr. Tim Scarfe at Machine Learning Street Talk, where someone who evidently has experience with these technologies discusses what a breakthrough the o1-pro model is, and why.

Essentially, Scarfe says, the new model changes the iterative process through which engineers prompt LLMs to perform complex tasks.

“The biggest apparent change with o1-pro is the complexity it can handle in a ‘single shot,’” he writes. “Previously, LLMs could only do ‘so much work’ in a single forward pass, and there were weird restrictions we had to subconsciously internalize due to the self-attention linearization hacks, i.e. you could only ever ask LLMs to address and do work inside an amorphous limited subspace of the context.”

He also points out that the traditional process isn’t actually a ‘single shot,’ but that a parallelized search tree process is in play.

A Postage Stamp of Attention

In addition, Scarfe uses the postage stamp analogy to talk about the constrained capability of last-generation attention mechanisms.

“Imagine you had a world map,” he writes, “and in every forward pass of an LLM you could only perform a ‘postage stamps worth’ of computation, and you decided as a prompter where to place the postage stamp on the map. That’s pretty much how LLMs worked before o-series. So we as engineers designed ways to place more postage stamps, or subdivide the map and aggregate the results into something coherent.”

He explains how engineering teams tried to get around these limitations with multi-agent collaboration and other techniques.

“o1-pro now automates this for us with less need for prompt hacking (and/or) engineering from us,” he adds.

He also refers to transformers as “finite state automata,” saying they’re extremely limited, again, in the types of computation they can do in a single forward pass.

Notwithstanding the semantics of automata, that makes sense. (Strictly speaking, chatGPT has this to say: “(Transformers are) a continuous and parameterized computational framework and thus are outside the classic, discrete automata model.”)

There’s a certain subjectivity there; I just thought that was interesting. Anyway, those who are discovering these model capabilities (and using them) are helping the AI systems to organize their resources in different ways to become more capable and more versatile.

What’s the Difference?

Scarfe also describes the difference that the new model makes to users this way: – “more verbosity, more diversity and less banality.”

And, at the end of the day, more accuracy.

Let’s look at these criteria in a bit more detail.

Verbosity has to do with the ways that the models speak to us and answer our questions. You can frame it this way: is the LLM a Shakespeare or a kindergartner? As for diversity, when the model can search better at inference, it can deliver wider-ranging results. And banality – well, that has a little bit to do with the uncanny valley. I’ve written about how early LLM results were “simple,” “generic,” in a word, yes, “banal.” In other words, it’s the nuance and complexity of the result that passes a deeper Turing test.

And in terms of accuracy:

“(The new model is) now spreading out 1000 postage stamps on the map, capturing exactly the information which matches and answers my prompt,” Scarfe writes. “The difference is night and day.”

Deep Thoughts from Francois Chollet

At the end of the post, Scarfe references Francois Chollet, a renowned voice in AI research who left Google to work on the Arc prize. I’ve covered his work in prior posts, where the AI engine tries to solve a pattern recognition problem that humans can do without too much trouble.

Navigating over to Chollet’s own X feed, you can see that he is optimistic about what recent models have done to solve the Arc problem.

“Today OpenAI announced o3, its next-gen reasoning model,” Chollet wrote Dec. 20. “We’ve worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute) and 87.5% in high-compute mode (thousands of $ per task). It’s very expensive, but it’s not just brute — these capabilities are new territory and they demand serious scientific attention.”

Here are some other interesting statements that Chollet has made lately about the state of the AI industry.

“Computing used to feel fast — everything ran locally, software was mostly in C/C++ and was kept in check by the need to run on all kinds of old hardware. Now any one of my Chrome tabs is using 100x more RAM than a NeXT workstation had in total.” – Sept. 3, 2024

“The current climate in AI has so many parallels to 2021 web3 it’s making me uncomfortable. Narratives based on zero data are accepted as self-evident. Everyone is expecting as a sure thing ‘civilization-altering’ impact … in the next 2-3 years.” – Jan. 8, 2023

And here’s one with great relevance to the markets:

“Software is this weird space where you can spend basically nothing and create a billion dollars of value, or spend a billion dollars and create basically no value,” Feb. 1, 2022

In Conclusion

Here is some of what I find relevant to today’s engineering world, as we discover new model capabilities. I say discover, not build, because the systems themselves are endowed with capabilities that amaze humans. Watch this space for more on what new models will do in the future.

BIG MONEY FINANCE AND VC
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

Advanced Packaging Leads The Way To Intel Foundry Success

Advanced Packaging Leads The Way To Intel Foundry Success

21 May 2026
Today’s Wordle #1797 Hints And Answer For Thursday, May 21

Today’s Wordle #1797 Hints And Answer For Thursday, May 21

21 May 2026
4 Factors That Strongly Influence First Impressions, By A Psychologist

4 Factors That Strongly Influence First Impressions, By A Psychologist

20 May 2026
A Third-Wave Philanthropy Unlocked By AI Could Supercharge Federal R&D

A Third-Wave Philanthropy Unlocked By AI Could Supercharge Federal R&D

20 May 2026
The 0 Trillion Question—What Is AI’s Value In Asset Management

The $150 Trillion Question—What Is AI’s Value In Asset Management

20 May 2026
Thursday, May 21 Clues And Answers (#1,075)

Thursday, May 21 Clues And Answers (#1,075)

20 May 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Exclusive: DeFi platform Azura launches after raising .9 million from Initialized

Exclusive: DeFi platform Azura launches after raising $6.9 million from Initialized

22 October 2024
Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

30 December 2024
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
SpaceX IPO targets .5 trillion total addressable market, mission to ‘make life multiplanetary’ and understand ‘true nature of the universe’

SpaceX IPO targets $28.5 trillion total addressable market, mission to ‘make life multiplanetary’ and understand ‘true nature of the universe’

20 May 20263 Views
4 Factors That Strongly Influence First Impressions, By A Psychologist

4 Factors That Strongly Influence First Impressions, By A Psychologist

20 May 20261 Views
Nvidia Q1 earnings: Chipmaker beats on earnings and boosts dividend, but forecasts disappoint

Nvidia Q1 earnings: Chipmaker beats on earnings and boosts dividend, but forecasts disappoint

20 May 20263 Views
A Third-Wave Philanthropy Unlocked By AI Could Supercharge Federal R&D

A Third-Wave Philanthropy Unlocked By AI Could Supercharge Federal R&D

20 May 20262 Views

Recent Posts

  • Elon Musk’s pay package reveals what SpaceX really is: a $1 trillion monster built to colonize Mars
  • Advanced Packaging Leads The Way To Intel Foundry Success
  • SpaceX finally files IPO prospectus, reveals revenue is up–but losses are too
  • Today’s Wordle #1797 Hints And Answer For Thursday, May 21
  • SpaceX IPO targets $28.5 trillion total addressable market, mission to ‘make life multiplanetary’ and understand ‘true nature of the universe’

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Elon Musk’s pay package reveals what SpaceX really is: a  trillion monster built to colonize Mars

Elon Musk’s pay package reveals what SpaceX really is: a $1 trillion monster built to colonize Mars

21 May 2026
Advanced Packaging Leads The Way To Intel Foundry Success

Advanced Packaging Leads The Way To Intel Foundry Success

21 May 2026
SpaceX finally files IPO prospectus, reveals revenue is up–but losses are too

SpaceX finally files IPO prospectus, reveals revenue is up–but losses are too

21 May 2026
Most Popular
Today’s Wordle #1797 Hints And Answer For Thursday, May 21

Today’s Wordle #1797 Hints And Answer For Thursday, May 21

21 May 20262 Views
SpaceX IPO targets .5 trillion total addressable market, mission to ‘make life multiplanetary’ and understand ‘true nature of the universe’

SpaceX IPO targets $28.5 trillion total addressable market, mission to ‘make life multiplanetary’ and understand ‘true nature of the universe’

20 May 20263 Views
4 Factors That Strongly Influence First Impressions, By A Psychologist

4 Factors That Strongly Influence First Impressions, By A Psychologist

20 May 20261 Views

Archives

  • May 2026
  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.