Corvus
Landscape Topic · c23abb1f

Semiconductor and AI accelerator market

The global market for designing, fabricating, and selling semiconductor devices — with particular emphasis on AI accelerators (GPUs, NPUs/TPUs, custom ASICs, and supporting HBM memory and EUV lithography) used to train and serve machine-learning workloads.

In scope: merchant data-center AI accelerators, hyperscaler in-house ASICs (TPU, Trainium, Maia, MTIA), foundry production at leading-edge nodes (3nm, 2nm), HBM memory, EUV/High-NA lithography, advanced packaging (CoWoS, SoIC), and the U.S./EU/Asia policy regime (CHIPS Act, BIS export controls). Out of scope: consumer/IoT MCUs, automotive analog, discrete power devices, and edge-only inference SoCs except where they intersect AI workloads.

Completed
2026-06-16 20:55 UTC

Bottom Line Up Front

The semiconductor industry hit a record $791.7B in 2025 (SIA, +25.6% YoY) and is tracking toward ~$1T in 2026, with AI accelerators driving most of the growth and NVIDIA capturing an estimated 80–92% of the data-center AI accelerator and discrete-GPU market. The supply chain is brittle: TSMC fabricates ~70% of the world's foundry output and is the sole leading-edge supplier for NVIDIA, AMD, Apple, and Broadcom; ASML is the sole supplier of EUV/High-NA lithography; SK Hynix and Samsung supply the HBM memory that gates accelerator output. Constraints — 3nm/2nm wafer and CoWoS packaging shortages, U.S. export controls (BIS, Oct-2023 update), and concentrated geopolitical risk in Taiwan — likely persist through 2027.

§ 01

What it is

The semiconductor industry is the aggregate of firms designing and fabricating semiconductor devices (transistors, integrated circuits, memory, analog/RF, sensors); the AI-accelerator subsegment is the class of specialized hardware accelerators — GPUs, neural processing units (NPUs / AI accelerators / deep-learning processors), TPUs, and custom ASICs — built to accelerate artificial-intelligence and machine-learning workloads, especially large neural networks and computer vision. GPUs originated for image processing but became the de facto AI compute substrate because their linear-algebra acceleration suits neural-network training and inference. The boundary of the 'AI accelerator market' draws in not only the merchant compute silicon (NVIDIA Blackwell, AMD Instinct, Intel Gaudi, Cerebras WSE, Groq LPU) and hyperscaler in-house ASICs (Google TPU, AWS Trainium, Microsoft Maia, Meta MTIA, Broadcom-designed silicon), but also the upstream value chain that produces them: the leading-edge foundry (TSMC, Samsung, Intel Foundry), the lithography equipment supplier (ASML), the HBM memory producers (SK Hynix, Samsung, Micron), and the advanced-packaging capacity (TSMC CoWoS, SoIC) that integrates compute and memory dies.

§ 02

Who operates in it

The market has three concentric rings. The center is the merchant AI accelerator vendors: NVIDIA dominates with an estimated 86% of AI data-center revenue (late 2025) and ~92% of the discrete GPU market in Q1 2025, on the strength of CUDA and the Blackwell GB200 generation. AMD is the principal merchant challenger with the Instinct MI300X / MI325X / MI355X line. Intel competes through Gaudi accelerators and Intel Foundry Services. Specialist challengers — Cerebras (wafer-scale WSE-3), Groq (LPU inference ASIC) and others — chip away at specific workload niches. The second ring is the hyperscaler/in-house ASIC programs: Alphabet (TPU, since 2015 internally / 2018 externally), AWS (Trainium/Inferentia), Microsoft (Maia), Meta (MTIA) — most fabricated as custom ASICs partnered with Broadcom or Marvell. The outer ring is the upstream supply chain that no accelerator ships without: TSMC (foundry, ~70% global share, sole leading-edge supplier to NVIDIA, Apple, Broadcom, Qualcomm), Samsung and Intel (foundry challengers), ASML (sole EUV/High-NA lithography supplier), and the 'Big Three' memory makers SK Hynix, Samsung, and Micron supplying HBM. Standards bodies (JEDEC for HBM) and regulators (U.S. BIS, EU/Dutch government on EUV exports) sit alongside the firms.

§ 03

How it works

The value chain runs design → wafer fabrication → packaging → systems → cloud. Fabless designers (NVIDIA, AMD, Apple, Broadcom, Qualcomm) develop the chip; they license CPU/GPU IP and EDA tools (Synopsys, Cadence) and tape out to TSMC (and increasingly Samsung Foundry, Intel Foundry). Wafers are patterned using ASML lithography — EUV at leading nodes, with High-NA EUV ramping for sub-2nm. Wafers are then diced and packaged; for AI accelerators this means TSMC's CoWoS 2.5D packaging that stacks compute logic dies next to multiple HBM3e/HBM4 stacks supplied by SK Hynix and Samsung. The packaged accelerator is integrated into server systems (Supermicro, Dell, HPE, Foxconn, custom hyperscaler designs) and racked into data centers. The end customers are hyperscaler cloud operators (Microsoft Azure, AWS, Google Cloud, Meta, Oracle, CoreWeave) and a growing tier of 'neoclouds'. Each step is a potential bottleneck, but at present the binding constraints are CoWoS advanced packaging and HBM supply, not raw wafers. Hyperscalers vertically integrate by designing in-house ASICs with Broadcom or Marvell, fabricated at TSMC — a hedge against NVIDIA pricing and a way to optimize total cost of ownership for their internal workloads.

§ 04

Why it exists

Four reinforcing forces drive the market. Demand: the post-ChatGPT generative-AI build-out, with hyperscalers signaling >$1T in cumulative AI infrastructure capex over the forecast horizon, anchors the demand curve. Software lock-in: NVIDIA's CUDA — released in 2007, a decade ahead of any merchant competitor — created a software moat that makes existing AI workloads costly to port, sustaining NVIDIA's pricing power even as challengers' hardware closes the gap. Physics-and-capital: leading-edge fabrication (3nm, 2nm) and advanced packaging (CoWoS, SoIC) require capex on a scale (TSMC alone is committing ~$56B in 2026) that only 2–3 firms in the world can sustain, creating natural-monopoly economics at the top of the value chain. Policy: the CHIPS Act subsidizes domestic capacity in the U.S. while BIS export controls bifurcate the global market by chip tier and country tier, intentionally constraining adversary access to frontier compute. These forces collectively concentrate value capture in a small number of geographically specific firms.

§ 05

When — the chronology

The industry traces to the 1948 Bell Labs invention of the transistor and the 1958 independent invention of the integrated circuit by Kilby (TI) and Noyce (Fairchild); Moore's-law scaling drove six decades of capacity growth. The AI-accelerator subsegment is far younger: GPUs evolved from graphics-only into general-purpose compute when NVIDIA released CUDA in 2007. Google's TPU (internal 2015, third-party 2018) proved hyperscaler in-house ASICs were viable. The current cycle began with the late-2022 inflection in generative-AI demand following ChatGPT, which pulled NVIDIA H100 demand sharply forward, then accelerated with the March 2024 Blackwell (GB200) announcement. Policy inflections — the U.S. CHIPS and Science Act (signed 9 Aug 2022, $52.7B appropriated) and successive BIS export-control updates (Oct 2022, Oct 2023, and case-by-case revisions in 2026) — reshaped the geography of capacity. The present moment (mid-2026) is a record run: SIA reported $791.7B in 2025 sales (+25.6% YoY) and the industry is tracking toward ~$1T in 2026; NVIDIA Q1 FY2027 revenue hit $81.6B (+85% YoY). Pure-play foundries grew 30% YoY in Q1 2026, with growth concentrated in 3nm/2nm and advanced packaging — the very capacity that is the binding bottleneck.

§ 06

Where

Geographic concentration is the defining structural fact. Silicon Valley (Santa Clara, California) hosts the headquarters of NVIDIA, AMD, and Intel — the merchant AI silicon design center. Hsinchu Science Park, Taiwan, hosts TSMC's headquarters and the bulk of its leading-edge fab capacity; this is the single point of failure of the global AI compute supply chain. Veldhoven, the Netherlands, hosts ASML — the sole EUV/High-NA lithography vendor, the choke point above TSMC. Yeongtong District in Suwon, South Korea hosts Samsung Electronics and Icheon hosts SK Hynix — the two Korean firms that produce the HBM memory NVIDIA Blackwell and AMD Instinct require. The CHIPS Act is reshoring some advanced production to Arizona (TSMC), Ohio (Intel) and elsewhere, but leading-edge logic plus HBM plus EUV remain heavily concentrated in Taiwan, South Korea, and the Netherlands, with no near-term substitute. China is the largest semiconductor buyer (and a major fabricator at trailing nodes) but is excluded from leading-edge AI chips and SME by U.S./allied controls.

§ 07

Players

14 in the space
§ 07b

Chronology

17 events
  1. 1948-12-23 Transistor invented at Bell Labs by Shockley, Brattain, and Bardeen — origin of the semiconductor industry.
  2. 1958-09-12 Integrated circuit independently invented by Jack Kilby (Texas Instruments) and Robert Noyce (Fairchild).
  3. 1969-05-01 Advanced Micro Devices (AMD) incorporated.
  4. 1987-02-21 Taiwan Semiconductor Manufacturing Company (TSMC) founded, pioneering the pure-play foundry business model.
  5. 1993-04-05 Nvidia Corporation founded by Jensen Huang, Chris Malachowsky, and Curtis Priem.
  6. 1994-10-02 ASML Holding N.V. incorporated in the Netherlands.
  7. 2007-06-23 Nvidia officially releases CUDA, establishing a proprietary parallel-computing API that becomes the AI software moat.
  8. 2015-01-01 Google begins using Tensor Processing Units (TPUs) internally, validating purpose-built accelerator ASICs at hyperscaler scale.
  9. 2018-01-01 Google makes Cloud TPUs available to third-party customers; Broadcom Inc. reincorporates in Delaware (current entity).
  10. 2022-08-09 U.S. CHIPS and Science Act signed into law (P.L. 117-167): $280B authorized, $52.7B appropriated for domestic semiconductor manufacturing and R&D.
  11. 2022-10-07 U.S. Bureau of Industry and Security (BIS) issues initial advanced computing and SME export controls targeting China.
  12. 2023-10-17 BIS major update to advanced computing / semiconductor export controls; A800 and H800 (China-specific) chips controlled, expanding restrictions to a broader chip set and SME.
  13. 2024-03-18 Nvidia announces the Blackwell platform (B200, GB200 NVL72), claiming up to 30x LLM inference vs H100 — the H100 successor generation.
  14. 2025-06-12 AMD unveils Instinct MI350X and MI355X at Advancing AI 2025; claims up to 4x generational gain and 35x inference uplift, positioning as the principal merchant alternative to Blackwell.
  15. 2026-02-06 SIA announces global semiconductor sales hit $791.7 billion in 2025, +25.6% YoY versus $630.5B in 2024 — a record.
  16. 2026-04-30 TrendForce confirms 3nm/2nm wafer and 2.5D/3D advanced-packaging (CoWoS) capacity bottlenecks persisting through 2026–2027, gating AI accelerator output.
  17. 2026-05-20 Nvidia reports Q1 FY2027 revenue of $81.6B (+85% YoY), Q2 guide above $87B — continued AI accelerator demand acceleration.
§ 08

Market

The merchant AI accelerator market is highly concentrated at the top — NVIDIA's estimated 80–92% share of data-center AI accelerator and discrete GPU revenue gives it natural-monopoly economics, with margins to match. AMD and Intel split the merchant remainder, with hyperscaler in-house ASICs taking a growing minority of total AI compute spend. The upstream layers are even more concentrated: TSMC at ~70% global foundry share is effectively unrivaled at the leading edge; ASML is a true monopoly in EUV. Memory is a credible oligopoly (SK Hynix, Samsung, Micron). The total semiconductor market grew 25.6% YoY to $791.7B in 2025 and is forecast around $1T for 2026; the data-center accelerator subsegment within that is growing materially faster. The dynamics are AI-led upswing meeting structural bottlenecks: demand outruns CoWoS and HBM capacity, sustaining premium pricing for incumbents while challengers race to certify alternative stacks (ROCm, Triton, OpenAI's stack, vendor-portable inference frameworks).

Size
$791.7 billion global semiconductor sales in 2025 (+25.6% YoY); SIA / WSTS forecast ~$1 trillion in 2026. Data-center accelerator subsegment estimated at $170B in 2025 (MarketsandMarkets), forecast to ~$370B by 2030 at ~17% CAGR. NVIDIA's data-center segment alone tracking >$300B annualized at Q1 FY2027 run-rate.
Segments
Merchant data-center AI accelerators (NVIDIA, AMD, Intel) · Hyperscaler in-house AI ASICs (Google TPU, AWS Trainium/Inferentia, Microsoft Maia, Meta MTIA — typically Broadcom/Marvell-designed) · Specialist accelerators (Cerebras WSE, Groq LPU, SambaNova, Tenstorrent) · Edge / client NPUs (Apple Neural Engine, Qualcomm Hexagon, Intel/AMD CPU NPUs) · HBM memory (SK Hynix, Samsung, Micron) · Leading-edge foundry (TSMC, Samsung, Intel Foundry) · Lithography equipment (ASML monopoly at EUV) · Advanced packaging (TSMC CoWoS, SoIC; Samsung X-Cube; Intel Foveros)
Dynamics
AI-led upswing meets structural bottlenecks: demand exceeds CoWoS/HBM capacity through 2026–2027. Pricing premiums persist for NVIDIA Blackwell. Hyperscalers diversify with in-house ASICs but remain large NVIDIA buyers. U.S. export controls bifurcate the market by tier of country and tier of chip. CHIPS Act reshoring is real (TSMC Arizona) but does not relocate the bottleneck.
§ 09

Outlook

Moderate confidence

Global semiconductor sales are likely to clear $1 trillion in 2026, with AI accelerators and HBM the dominant growth drivers. NVIDIA is highly likely to retain a dominant (>70%) share of merchant data-center AI accelerators through 2026, supported by the CUDA software moat, priority TSMC CoWoS allocation, and the Blackwell-class generation. AMD's MI355X/MI400 family is likely to grow meaningfully in absolute terms while ceding overall share. Hyperscaler in-house ASIC programs are likely to capture a growing minority of total AI-compute spend (Google TPU, AWS Trainium, Microsoft Maia, Broadcom-designed silicon) but unlikely to displace NVIDIA at the top in the near term. TSMC's 3nm and CoWoS bottleneck is likely to persist through 2027; ASML High-NA EUV is unlikely to materially change the constraint picture before 2027. U.S./allied export controls are likely to continue tightening case-by-case rather than loosen, sustaining the Tier-1/2 versus Tier-3 bifurcation. Roughly even chance that a memory-cycle correction or hyperscaler-capex moderation produces a softer 2027 print after the 2025–26 surge. The most consequential downside risk — though low-probability over the forecast horizon — is a disruption to Taiwan operations, which would trigger another global chip shortage and immediately re-price the entire stack.

§ 10

Key Judgments

graded per ICD 203
KJ-01 High Confidence

NVIDIA is highly likely to retain a dominant share (>70%) of the data-center AI accelerator market through 2026, anchored by the CUDA software moat, Blackwell GB200/B300 generation, and priority TSMC CoWoS allocation.

KJ-02 High Confidence

TSMC's foundry monopoly at the leading edge (3nm fully committed, 2nm in HVM, CoWoS constrained through 2027) is the binding supply constraint for the entire AI-accelerator value chain, and a disruption to Taiwan operations would likely trigger another global chip shortage.

KJ-03 Moderate Confidence

Hyperscaler in-house ASIC programs (Google TPU, AWS Trainium, Microsoft Maia, Meta MTIA, Broadcom-designed silicon) are likely to capture a growing minority of merchant-AI-compute spending over the next 2–3 years, though the magnitude of NVIDIA displacement is uncertain.

KJ-04 Moderate Confidence

U.S. export controls on advanced computing chips and SME (BIS, October 2023 update and subsequent revisions) are likely to continue tightening case-by-case rather than loosen, sustaining the bifurcation between Tier 1/2 markets that receive frontier chips and Tier 3 markets that do not.

KJ-05 Moderate Confidence

Global semiconductor sales are likely to clear the $1 trillion milestone in 2026, with AI accelerators and HBM the dominant growth drivers; a roughly even chance that a memory-cycle correction or hyperscaler-capex moderation pulls the figure into a softer 2027.