There’s no single canonical name. The most common labels in VC/strategy circles:
Term
Used By
Data Flywheel
Most pitch decks, a16z, Bessemer, SignalFire
Data Network Effects
NFX (though they argue most claims are overstated)
Data Moat / Data-as-a-Moat
Bessemer explicitly
Networked SaaS
SignalFire’s 2024-25 branded framework
Oil Wells
a16z (Aug 2025) — drill deep into one workflow, own the data
System of Record → System of Intelligence
Enterprise software circles
Data Gravity
Abraham Thomas (Pivotal)
In a pitch, I’d say “data flywheel” for the shorthand, “workflow-to-intelligence” for the descriptive label, and cite Bessemer’s vertical SaaS data lesson or a16z’s “oil wells” framing as the canonical VC references.
Companies by Industry
Tier 1 — Canonical Exemplars (most cited in VC discussions)
Global Threat Report + ML models trained on aggregate threat data
Recorded Future ($2.65B → Mastercard)
Threat intelligence platform
Aggregated open/dark web + technical sources via ML/NLP
Coalition ($3.5B val)
Cyber insurance + monitoring
”Active Data Graph” — proprietary risk data for underwriting
Insurance / Energy / Other Verticals
Company
Wedge
Intelligence
Guidewire (~$18B mkt cap)
Core insurance platform
Claims Intel — anonymized claims benchmarking across insurers
Enverus
Energy workflow SaaS
Benchmark data from 95%+ of US energy producers
FBN
Precision ag + seed purchasing
Anonymized seed performance and input pricing benchmarks
ServiceTitan (public)
Field service mgmt for trades
Titan Score — performance benchmarking across contractors
Gong ($7.25B val)
Conversation intelligence for sales
Revenue intelligence benchmarks from 2M+ analyzed deals
Toast (~$20B mkt cap)
Restaurant POS
Aggregate performance benchmarking across merchants
Shopify (~$120B mkt cap)
E-commerce platform
Benchmarks + Shopify Capital lending powered by merchant data
Historical Precedents (pre-SaaS)
Dun & Bradstreet (1840s), Experian/TransUnion/Equifax, FICO — these are the original workflow-to-intelligence companies. Merchants sharing payment data → credit bureaus → scoring → analytics. The model predates SaaS by over a century.
Notable Failures
Carta (2024 trust crisis) — A sales employee used confidential cap table data from a customer (Linear) to pitch secondary stock sales. Multiple companies reported similar behavior. CEO had to exit the secondary trading business entirely. The lesson: data monetization that creates conflicts of interest with the core workflow destroys customer trust. Directly relevant to how you structure the digital twin’s relationship to the compliance product.
NFX’s own company (Tickle) — Collected 24 billion data points from online quizzes. Found “the data wasn’t monetizable.” Lesson: volume alone doesn’t create a monetizable asset. The data must be unique, structured, and have clear buyers.
Zenefits — $4.5B → implosion from compliance violations. In data-rich verticals, compliance failures destroy the data asset by destroying customer trust.
Key Strategic Readings
Bessemer — “Ten Lessons from a Decade of Vertical Software Investing” — Lesson 5 explicitly covers building data businesses from vertical SaaS. Names FICO, Verisk, CoreLogic. Provides a 4-part framework.
NFX — “What Makes Data Valuable” — The most rigorous skeptical framework. Six conditions for genuine data network effects.
a16z — “Oil Wells vs. Pipelines” (Aug 2025) — “Oil wells” drill deep into a single workflow to own the system of record. Directly applicable to the compliance wedge.
Abraham Thomas — “Data and Defensibility” — Rigorous taxonomy of data advantages. Toast as data gravity exemplar.
SignalFire — “Why Networked SaaS is the New AI Business Model” — $1.2T addressable value by 2030.
What This Suggests for TBD
Flatiron Health is the strongest analog — purpose-built clinical workflow wedge designed from day one to generate an intelligence data asset, validated at $1.9B. Altana AI is the closest sector comp ($1B valuation, compliance wedge → supply chain graph, serves CBP). Coupa’s “Community Intelligence” is the best reference for how to frame the value exchange to customers (“your anonymized data improves the product for everyone”).
The Carta lesson is the most important cautionary tale: the data monetization side must never create a conflict of interest with the compliance workflow side. Structural separation matters.