Models & Foundation Labs

The large language models that power most AI products today are produced by a small number of organisations running some of the most expensive training runs ever attempted. This layer covers the frontier closed labs (OpenAI, Anthropic, Google DeepMind), the open-weight releases that challenge them (Meta's Llama, Mistral), specialist fine-tuning providers, training-data suppliers, and the growing evaluation and safety ecosystem that regulators in the US and EU are beginning to mandate. Capability here is what ultimately determines the value of every layer below.

THE TAKE

Frontier closed labs hold the capability lead; open-weight models are closing the gap and undercutting price. The investable edges sit to the side: licensed training data, plus the evaluation, red-teaming and safety-assurance market that EU and US regulators are now mandating.

Signals

Anthropic holds ~40% of enterprise LLM API spend in 2025, up from ~24% in 2024, ahead of OpenAI at ~27% (Menlo Ventures 2025).
Enterprise GenAI spend tripled to ~$37B in 2025 from ~$11.5B in 2024 (Menlo Ventures 2025).
Anthropic raised at a ~$380B valuation in early 2026, then moved toward an IPO near ~$965B (company announcements 2026).
OpenAI exited 2025 near ~$20B ARR, reaching ~$25B annualized by early 2026 (company / press).

The investment angle

The edge is not picking a winner among frontier labs. It is owning the licensed data, evaluation infrastructure and safety tooling every lab must buy regardless of model-share shifts.

Dominant playerOpenAI / Anthropic / Google DeepMind

ConcentrationAnthropic ~40%, OpenAI ~27%, Google ~21% enterprise spend (Menlo Ventures, 2025)

Key metricFrontier closed labs are the capability chokepoint

GeographyUnited States

Inside this layer, node by node

The atlas data behind this layer: 21 nodes, 1 of them chokepoints. Every node links back into the network map; market figures carry their source.

Frontier closed-model developersL7.1chokepointoligopolyscaling6 companies

Labs developing state-of-the-art proprietary models without releasing weights. API access gates the most capable reasoning, multimodal and agentic functions. OpenAI, Anthropic, Google DeepMind and xAI set market terms; their pricing and rate limits constrain what applications can afford.

Open-weight model developersL7.2fragmentedscaling5 companies

Organisations that train and publicly release model weights, enabling self-hosting, fine-tuning and redistribution. They erode pricing power of closed-model providers and complicate US export controls on AI. Value capture is indirect—structural disruption rather than direct margins.

$21B market · 202534.1% CAGRsource ↗

↑ depends onMLOps and training/serving toolchain Cloud compute platforms AI accelerators and silicon Model training data inputs

↓ feeds intoFine-tuning, model hosting and inference-API providers (MaaS)Fine-tuning services Serverless inference API Dedicated / reserved inference Synthetic data generation Model evaluation, safety and alignment services Evaluation and benchmarking Red-teaming and safety testing Alignment and interpretability research organisations

Fine-tuning, model hosting and inference-API providers (MaaS)L7.3fragmentedscaling10 companies

Providers that host third-party models, offer fine-tuning, and serve via API without developing the models themselves. Inference prices fell roughly 90% from 2022 to 2025, forcing competition on latency, hardware efficiency, and enterprise SLAs. Survivors win on operational execution, not model ownership.

$7.9B market · 202422.6% CAGRsource ↗

↑ depends onOpen-weight model developers Inference serving, engines and runtimes Frontier closed-model developers Cloud compute platforms

↓ feeds intoApplications and inference (demand side)AI-native horizontal platforms Vertical / industry AI Healthcare and life sciences AI Industrial AI and operations optimisation Sales and marketing AI AI-native BPO and managed services Legal and compliance AI Financial services AI Defence, intelligence and national security AI Software development and code AI Customer service and contact-centre AI Creative, media and design AI Scientific research and engineering AI +3 more in the atlas →

Fine-tuning services3 Serverless inference API6 Dedicated / reserved inference1

Model training data inputsL7.4fragmentedscaling18 companies

Curated corpora, human annotation pipelines, and synthetic data for training and fine-tuning models. Public web data is saturating; proprietary and synthetic sources increasingly determine model quality. Data owners and synthetic-generation platforms are capturing emerging value.

$2.8B market · 202427.7% CAGRsource ↗

↑ depends onCurated and licensed training corpora Data labelling and annotation Synthetic data generation

↓ feeds intoFrontier closed-model developers Open-weight model developers Modality- and domain-specialist model developers Image and video generation models Speech and audio models Biology and scientific models Robotics and world models Code and software models

Curated and licensed training corpora6 Data labelling and annotation7 Synthetic data generation5

Modality- and domain-specialist model developersL7.5fragmentedemerging23 companies

Labs building models specialised by modality or scientific domain, not generalist frontier labs. Defensible where proprietary domain data, simulation environments, or expert feedback loops create moats. Biology and robotics are prime examples.

↑ depends onFrontier closed-model developers Model training data inputs MLOps and training/serving toolchain

Image and video generation models5 Speech and audio models5 Biology and scientific models5 Robotics and world models5 Code and software models3

Model evaluation, safety and alignment servicesL7.6fragmentedemerging6 companies

Services that test models for capability, failure modes, and adherence to human intent before deployment. Regulatory mandates in the EU and elsewhere are making this compulsory. Value flows to accredited testing firms and consultancies with government or enterprise contracts.

↑ depends onFrontier closed-model developers Open-weight model developers

Evaluation and benchmarking3 Red-teaming and safety testing1 Alignment and interpretability research organisations2

Companies we track

Alphabet (Google DeepMind)

Gemini frontier lab; ~21% enterprise spend

GOOGL · US

Meta AI

Llama; most-deployed open-weight model

META · US

Anthropic

#1 enterprise LLM share; ~$30B ARR run-rate

Private · US

OpenAI

ChatGPT; consumer AI leader

Private · US

xAI

Grok frontier model

Private · US

Mistral AI

European open-weight champion

Private · FR

Supply chain

Raw inputs

Training computeUS

clusters from layers below

Scale AI / SurgeUS

data & labelling

→

Key suppliers

AnthropicUS

~40% enterprise LLM spend

OpenAIUS

~27% enterprise LLM spend

Google DeepMindUS

~21% enterprise LLM spend

Meta AIUS

open-weight counterweight

→

Buyers

Cloud model marketplacesUS

Bedrock, Azure, Vertex

Application buildersGlobal

demand side