Close Menu
Alpha Leaders
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
What's On
Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

3 April 2026
Markets rally hard on Iran’s promise to play nice at Hormuz

Markets rally hard on Iran’s promise to play nice at Hormuz

3 April 2026
Renewable energy transition could accelerate as Iran war shocks oil and gas supply

Renewable energy transition could accelerate as Iran war shocks oil and gas supply

3 April 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Alpha Leaders
newsletter
  • Home
  • News
  • Leadership
  • Entrepreneurs
  • Business
  • Living
  • Innovation
  • More
    • Money & Finance
    • Web Stories
    • Global
    • Press Release
Alpha Leaders
Home » Nvidia’s Groq bet shows that the economics of AI chip-building are still unsettled
News

Nvidia’s Groq bet shows that the economics of AI chip-building are still unsettled

Press RoomBy Press Room31 December 20254 Mins Read
Facebook Twitter Copy Link Pinterest LinkedIn Tumblr Email WhatsApp
Nvidia’s Groq bet shows that the economics of AI chip-building are still unsettled

Nvidia built its AI empire on GPUs. But its $20 billion bet on Groq suggests the company isn’t convinced GPUs alone will dominate the most important phase of AI yet: running models at scale, known as inference. 

The battle to win on AI inference, of course, is over its economics. Once a model is trained, every useful thing it does—answering a query, generating code, recommending a product, summarizing a document, powering a chatbot, or analyzing an image—happens during inference. That’s the moment AI goes from a sunk cost into a revenue-generating service, with all the accompanying pressure to reduce costs, shrink latency (how long you have to wait for an AI to answer), and improve efficiency.

That pressure is exactly why inference has become the industry’s next battleground for potential profits—and why Nvidia, in a deal announced just before the Christmas holiday, licensed technology from Groq, a startup building chips designed specifically for fast, low-latency AI inference, and hired most of its team, including founder and CEO Jonathan Ross.

Inference is AI’s ‘industrial revolution’

Nvidia CEO Jensen Huang has been explicit about the challenge of inference. While he says Nvidia is “excellent at every phase of AI,” he told analysts at the company’s Q3 earnings call in November that inference is “really, really hard.” Far from a simple case of one prompt in and one answer out, modern inference must support ongoing reasoning, millions of concurrent users, guaranteed low latency, and relentless cost constraints. And AI agents, which have to handle multiple steps, will dramatically increase inference demand and complexity—and raise the stakes of getting it wrong. 

“People think that inference is one shot, and therefore it’s easy. Anybody could approach the market that way,” Huang said. “But it turns out to be the hardest of all, because thinking, as it turns out, is quite hard.”

Nvidia’s support of Groq underscores that belief, and signals that even the company that dominates AI training is hedging on how inference economics will ultimately shake out. 

Huang has also been blunt about how central inference will become to AI’s growth. In a recent conversation on the BG2 podcast, Huang said inference already accounts for more than 40% of AI-related revenue—and predicted that it is “about to go up by a billion times.”

“That’s the part that most people haven’t completely internalized,” Huang said. “This is the industry we were talking about. This is the industrial revolution.”

The CEO’s confidence helps explain why Nvidia is willing to hedge aggressively on how inference will be delivered, even as the underlying economics remain unsettled.

Nvidia wants to corner the inference market

Nvidia is hedging its bets to make sure that they have their hands in all parts of the market, said Karl Freund, founder and principal analyst at Cambrian AI Research. “It’s a little bit like Meta acquiring Instagram,” he explained. “It’s not that they thought Facebook was bad, they just knew that there was an alternative that they wanted to make sure wasn’t competing with them.” 

That, even though Huang had made strong claims about the economics of the existing Nvidia platform for inference. “I suspect they found that it either wasn’t resonating as well with clients as they’d hoped, or perhaps they saw something in the chip-memory-based approach that Groq and another company called D-Matrix has,” said Freund, referring to another fast, low-latency AI chip startup backed by Microsoft that recently raised $275 million at a $2 billion valuation. 

Freund said Nvidia’s move into Groq could lift the entire category. “I’m sure D-Matrix is a pretty happy startup right now, because I suspect their next round will go at a much higher valuation thanks to the [Nvidia-Groq deal],” he said. 

Other industry executives say the economics of AI inference are shifting as AI moves beyond chatbots into real-time systems like robots, drones, and security tools. Those systems can’t afford the delays that come with sending data back and forth to the cloud, or the risk that computing power won’t always be available. Instead, they favor specialized chips like Groq’s over centralized clusters of GPUs. 

Behnam Bastani, founder and CEO of OpenInfer, which focuses on running AI inference close to where data is generated—such as on devices, sensors, or local servers rather than distant cloud data centers—said his startup is targeting these kinds of applications at the “edge.” 

The inference market, he emphasized, is still nascent. And Nvidia is looking to corner that market with its Groq deal. With inference economics still unsettled, he said Nvidia is trying to position itself as the company that spans the entire inference hardware stack, rather than betting on a single architecture.

“It positions Nvidia as a bigger umbrella,” he said. 

This story was originally featured on Fortune.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link

Related Articles

Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

3 April 2026
Markets rally hard on Iran’s promise to play nice at Hormuz

Markets rally hard on Iran’s promise to play nice at Hormuz

3 April 2026
Renewable energy transition could accelerate as Iran war shocks oil and gas supply

Renewable energy transition could accelerate as Iran war shocks oil and gas supply

3 April 2026
Mercor, a  billion AI startup, confirms it was the victim of a major cybersecurity breach

Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach

3 April 2026
Paul Krugman smacks down Trump speech with argument that  gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

Paul Krugman smacks down Trump speech with argument that $4 gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

2 April 2026
Trump wants to add nearly T to the T national debt with military budget, watchdog warns

Trump wants to add nearly $7T to the $39T national debt with military budget, watchdog warns

2 April 2026
Don't Miss
Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

Unwrap Christmas Sustainably: How To Handle Gifts You Don’t Want

By Press Room27 December 2024

Every year, millions of people unwrap Christmas gifts that they do not love, need, or…

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

Walmart dominated, while Target spiraled: the winners and losers of retail in 2024

30 December 2024
Moltbook is the talk of Silicon Valley. But the furor is eerily reminiscent of a 2017 Facebook research experiment

Moltbook is the talk of Silicon Valley. But the furor is eerily reminiscent of a 2017 Facebook research experiment

6 February 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Latest Articles
Paul Krugman smacks down Trump speech with argument that  gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

Paul Krugman smacks down Trump speech with argument that $4 gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

2 April 20260 Views
Trump wants to add nearly T to the T national debt with military budget, watchdog warns

Trump wants to add nearly $7T to the $39T national debt with military budget, watchdog warns

2 April 20260 Views
Apollo and FC Barcelona just proved legacy markets are losing their grip on business

Apollo and FC Barcelona just proved legacy markets are losing their grip on business

2 April 20261 Views
Why American billionaires are abandoning Wall Street for English soccer clubs

Why American billionaires are abandoning Wall Street for English soccer clubs

2 April 20261 Views

Recent Posts

  • Jack Dorsey and Roelof Botha think AI can make middle management obsolete 
  • Markets rally hard on Iran’s promise to play nice at Hormuz
  • Renewable energy transition could accelerate as Iran war shocks oil and gas supply
  • Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach
  • Paul Krugman smacks down Trump speech with argument that $4 gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

Recent Comments

No comments to show.
About Us
About Us

Alpha Leaders is your one-stop website for the latest Entrepreneurs and Leaders news and updates, follow us now to get the news that matters to you.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks
Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

Jack Dorsey and Roelof Botha think AI can make middle management obsolete 

3 April 2026
Markets rally hard on Iran’s promise to play nice at Hormuz

Markets rally hard on Iran’s promise to play nice at Hormuz

3 April 2026
Renewable energy transition could accelerate as Iran war shocks oil and gas supply

Renewable energy transition could accelerate as Iran war shocks oil and gas supply

3 April 2026
Most Popular
Mercor, a  billion AI startup, confirms it was the victim of a major cybersecurity breach

Mercor, a $10 billion AI startup, confirms it was the victim of a major cybersecurity breach

3 April 20263 Views
Paul Krugman smacks down Trump speech with argument that  gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

Paul Krugman smacks down Trump speech with argument that $4 gas is ‘less than half’ of the Hormuz hit. Here’s what he’s talking about

2 April 20260 Views
Trump wants to add nearly T to the T national debt with military budget, watchdog warns

Trump wants to add nearly $7T to the $39T national debt with military budget, watchdog warns

2 April 20260 Views

Archives

  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • May 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • December 2024
  • November 2024
  • October 2024
  • September 2024
  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • March 2022
  • January 2021
  • March 2020
  • January 2020

Categories

  • Blog
  • Business
  • Entrepreneurs
  • Global
  • Innovation
  • Leadership
  • Living
  • Money & Finance
  • News
  • Press Release
© 2026 Alpha Leaders. All Rights Reserved.
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Type above and press Enter to search. Press Esc to cancel.