L7
Models & Foundation Labs
The large language models that power most AI products today are produced by a small number of organisations running some of the most expensive training runs ever attempted. This layer covers the frontier closed labs (OpenAI, Anthropic, Google DeepMind), the open-weight releases that challenge them (Meta's Llama, Mistral), specialist fine-tuning providers, training-data suppliers, and the growing evaluation and safety ecosystem that regulators in the US and EU are beginning to mandate. Capability here is what ultimately determines the value of every layer below.
THE TAKE
Frontier closed labs hold the capability lead; open-weight models are closing the gap and undercutting price. The investable edges sit to the side: licensed training data, plus the evaluation, red-teaming and safety-assurance market that EU and US regulators are now mandating.
Signals
- Anthropic holds ~40% of enterprise LLM API spend in 2025, up from ~24% in 2024, ahead of OpenAI at ~27% (Menlo Ventures 2025).
- Enterprise GenAI spend tripled to ~$37B in 2025 from ~$11.5B in 2024 (Menlo Ventures 2025).
- Anthropic raised at a ~$380B valuation in early 2026, then moved toward an IPO near ~$965B (company announcements 2026).
- OpenAI exited 2025 near ~$20B ARR, reaching ~$25B annualized by early 2026 (company / press).
The investment angle
The edge is not picking a winner among frontier labs. It is owning the licensed data, evaluation infrastructure and safety tooling every lab must buy regardless of model-share shifts.
Inside this layer, node by node
The atlas data behind this layer: 21 nodes, 1 of them chokepoints. Every node links back into the network map; market figures carry their source.
Labs developing state-of-the-art proprietary models without releasing weights. API access gates the most capable reasoning, multimodal and agentic functions. OpenAI, Anthropic, Google DeepMind and xAI set market terms; their pricing and rate limits constrain what applications can afford.
Organisations that train and publicly release model weights, enabling self-hosting, fine-tuning and redistribution. They erode pricing power of closed-model providers and complicate US export controls on AI. Value capture is indirect—structural disruption rather than direct margins.
Providers that host third-party models, offer fine-tuning, and serve via API without developing the models themselves. Inference prices fell roughly 90% from 2022 to 2025, forcing competition on latency, hardware efficiency, and enterprise SLAs. Survivors win on operational execution, not model ownership.
Curated corpora, human annotation pipelines, and synthetic data for training and fine-tuning models. Public web data is saturating; proprietary and synthetic sources increasingly determine model quality. Data owners and synthetic-generation platforms are capturing emerging value.
Labs building models specialised by modality or scientific domain, not generalist frontier labs. Defensible where proprietary domain data, simulation environments, or expert feedback loops create moats. Biology and robotics are prime examples.
Services that test models for capability, failure modes, and adherence to human intent before deployment. Regulatory mandates in the EU and elsewhere are making this compulsory. Value flows to accredited testing firms and consultancies with government or enterprise contracts.
Companies we track
Supply chain
Raw inputs
Key suppliers
Buyers