AI’s Next Bottleneck: Not the Algorithm, but the Hardware
- Dan Sebastianelli

- May 25
- 5 min read

Artificial intelligence is often discussed as if it lives in the cloud — weightless, invisible, and infinitely scalable. In reality, AI is becoming one of the most physical industries in the world.
Behind every chatbot, coding assistant, image generator, enterprise agent, and recommendation engine sits a massive stack of hardware: CPUs, GPUs, memory chips, networking equipment, cooling systems, data centers, substations, transformers, and power plants.
The next phase of AI competition may not be decided only by who has the best model. It may be decided by who can secure enough compute, enough data center capacity, and enough electricity.
From Software Race to Infrastructure Race
The first wave of generative AI was about models. OpenAI, Anthropic, Google, Meta, xAI, DeepSeek, and others competed over reasoning, speed, cost, and capabilities.
Now the race is shifting. The question is no longer simply: “Who has the smartest AI?”
It is also:
Who can get the chips?
Who can build the data centers?
Who can cool them?
Who can power them?
Who can do all of that faster than competitors?
NVIDIA’s own language makes the point clearly. In its fiscal Q3 2026 results, the company reported record data center revenue of $51.2 billion and CEO Jensen Huang said, “Blackwell sales are off the charts, and cloud GPUs are sold out.”
That statement captures the current AI economy in one sentence: demand is not theoretical. It is colliding with supply.
The GPU Shortage Was Only the Beginning
The most visible bottleneck has been the GPU. Large AI models need massive parallel computing power, and GPUs remain the core engine for both training and increasingly for inference.
But GPUs are not standalone products. They depend on advanced packaging, high-bandwidth memory, specialized substrates, networking equipment, cooling, and power delivery. If any one of those inputs becomes constrained, the whole system slows down.
Memory is now a major stress point. Reuters reported that SK Hynix said all its chips were sold out for 2026, while Samsung had secured customers for its 2026 HBM output. Reuters also reported that major cloud and AI companies were asking suppliers for as much memory as they could deliver, regardless of price.
This matters because AI is no longer just consuming “specialized” chips. It is beginning to absorb capacity across the broader semiconductor supply chain.

CPUs Are Back in the Conversation
For much of the AI boom, investors and executives focused on GPUs. That made sense. GPUs were the scarce asset.
But CPUs are becoming important again.
As AI shifts from training large models to running millions or billions of daily inference tasks, the surrounding compute architecture changes. Agentic AI systems — tools that plan, execute, retrieve data, call software, and perform autonomous workflows — require more than raw GPU horsepower.
They need orchestration. They need memory management. They need networking. They need server CPUs.
AMD CEO Lisa Su recently said the global CPU market is “tight,” driven by stronger-than-expected demand from AI inferencing and agentic AI. AMD is now working with Taiwanese partners to ramp CPU production capacity through 2026 and beyond.
This is an important shift. AI demand is broadening from the GPU layer into the entire data center stack.
The Data Center Becomes the New Factory
The modern AI data center increasingly resembles a factory. Instead of producing cars, steel, chemicals, or electronics, it produces intelligence at scale.
And like factories, AI data centers require land, capital, equipment, utilities, cooling, grid connections, and long-term planning.
The International Energy Agency estimates that data centers consumed about 415 terawatt-hours of electricity in 2024, roughly 1.5% of global electricity consumption. By 2030, the IEA projects that number will more than double to around 945 terawatt-hours, with AI as the most important driver of growth.
That is a striking number. It means the AI boom is no longer just a technology story. It is an energy, infrastructure, and industrial planning story.
McKinsey has described the scale of the buildout in even more physical terms: global data center supply could grow from roughly 70 gigawatts today to about 220 gigawatts within five years, with around 75% of that growth driven by AI.
This is why power availability is becoming a site-selection issue. Data centers are moving to where electricity is available, not simply where fiber networks or tax incentives are attractive.

The next AI bottleneck is not just building more data centers — it is getting them online. Sightline Climate estimates that 30% to 50% of the 2026 data center pipeline may not come online before year-end, even though at least 16 GW of capacity is scheduled across roughly 140 large projects.
Only about 5 GW is already under construction, leaving roughly 11 GW still in the announced stage with no visible construction progress. That gap matters because AI infrastructure is now colliding with the slower realities of power availability, transformers, switchgear, skilled labor, permitting, and local opposition. In other words, the market may be announcing capacity faster than the physical system can deliver it.

Power Is the Ultimate Bottleneck
Chips can be ordered. Buildings can be financed. But electricity infrastructure is harder to move quickly.
Transmission lines, transformers, substations, grid interconnections, and new generation capacity all have long development timelines. The IEA estimates that about 20% of planned data center projects could face delays if grid risks are not addressed. It also notes that building new transmission lines in advanced economies can take four to eight years, while wait times for critical grid components such as transformers and cables have doubled over the past three years.
This is where the AI story becomes much bigger than Silicon Valley.
As AI demand keeps accelerating, utilities, regulators, real estate developers, chipmakers, hyperscalers, and local communities will all become part of the same supply chain.
A shortage of GPUs is inconvenient.
A shortage of power is existential.

Why This Matters for Business
For companies adopting AI, the key lesson is simple: do not assume compute will always be cheap, available, or instantaneous.
Over time, AI models will likely become more efficient. DeepSeek and other model innovations already show that better software can reduce cost. But history suggests efficiency can increase demand rather than reduce it. When something becomes cheaper and more useful, people use more of it.
That means the winners may not simply be the companies with the best AI tools. The winners may be the companies that plan around constraints:
Businesses should think about AI cost, compute access, latency, and data strategy as part of operational planning — not just innovation planning.
Technology vendors should secure resilient supply chains, not just strong models.
Utilities and energy providers should view AI data centers as one of the most important new sources of electricity demand growth.
Investors should watch not only AI software companies, but also the “picks and shovels”: chips, memory, networking, cooling, power equipment, grid infrastructure, and energy generation.

Final Word
The AI boom is real. But it is not floating above the economy. It is landing directly on the physical world.
The next bottlenecks will not be abstract. They will be CPUs, GPUs, HBM memory, data center shells, cooling systems, transformers, substations, and megawatts.
In the first phase of AI, the question was: who has the best model?
In the next phase, the question may be: who has the capacity to run it?
FSG believes this is the key strategic point. AI is no longer just a software revolution. It is an infrastructure race — and infrastructure races are won by those who secure supply early, manage constraints intelligently, and understand that every digital breakthrough eventually depends on physical capacity.
We leave you with this interesting comparison of California vs Texas inspired by
a John Bistline chart.
Forecast data by FSG.





Comments