Workflow-to-Intelligence Business Model

Naming the Pattern

There’s no single canonical name. The most common labels in VC/strategy circles:

TermUsed By
Data FlywheelMost pitch decks, a16z, Bessemer, SignalFire
Data Network EffectsNFX (though they argue most claims are overstated)
Data Moat / Data-as-a-MoatBessemer explicitly
Networked SaaSSignalFire’s 2024-25 branded framework
Oil Wellsa16z (Aug 2025) — drill deep into one workflow, own the data
System of Record → System of IntelligenceEnterprise software circles
Data GravityAbraham Thomas (Pivotal)

In a pitch, I’d say “data flywheel” for the shorthand, “workflow-to-intelligence” for the descriptive label, and cite Bessemer’s vertical SaaS data lesson or a16z’s “oil wells” framing as the canonical VC references.


Companies by Industry

Tier 1 — Canonical Exemplars (most cited in VC discussions)

CompanyWedge (Workflow)Intelligence (Monetized Data)
CartaCap table management for 50K+ startupsPrivate market intelligence: valuations, round sizes, comp benchmarks
ADPPayroll for ~1 in 6 US workersNational Employment Report (economists cite it monthly), comp benchmarks
Flatiron Health ($1.9B → Roche)Oncology EHR for clinicsReal-world evidence datasets sold to pharma for drug development
Veeva (~$35B mkt cap)Cloud CRM/content for life sciencesVeeva Data Cloud — HCP reference data, prescriber data, claims
Verisk (~$40B mkt cap)Insurance forms, rating, and actuarial services (since 1971)Claims analytics, property data, risk models across the industry

Fintech / Payments

CompanyWedgeIntelligence
MastercardPayment network (125B+ transactions)SpendingPulse, Test & Learn, Advisors division; acquired Recorded Future ($2.65B)
PlaidBank account linking APIsAggregate transaction patterns → risk scoring, income verification
StripePayment processingRadar fraud detection trained on cross-merchant data
RampCorporate expense managementPrice Intelligence — contract benchmarking against millions of transactions
AdyenEnterprise payment processingRevenueProtect fraud ML trained on cross-merchant data

Healthcare / Life Sciences

CompanyWedgeIntelligence
Tempus AI (public, $693M rev)Genomic diagnostics & clinical decision supportLargest clinical+molecular data library; curated datasets sold to pharma
IQVIA (~$45B mkt cap)CRM/engagement for pharma, clinical trial mgmt1.2B+ de-identified patient records, 53+ PB proprietary data
Komodo HealthHealthcare analytics platformReal-world data foundation for drug distribution and sales strategy

Supply Chain / Logistics

CompanyWedgeIntelligence
Coupa (~$8B acquisition)Source-to-pay procurement SaaS”Community Intelligence” from $8T+ in anonymized spend data
Altana AI ($1B valuation)Supply chain compliance platform (serves CBP)Global trade relationship graph — closest comp to Project TBD
project44Transportation visibilityPredictive ETAs and benchmarks from cross-shipper/carrier data
FlexportDigital freight forwardingSupply chain intelligence from aggregate shipment/customs data
FourKitesSupply chain visibility SaaSPredictive analytics and benchmarking from aggregate tracking data

Real Estate

CompanyWedgeIntelligence
CoStar (~$35B mkt cap)CRE data platform (LoopNet, Apartments.com)Industry-standard CRE analytics — comps, submarket data, tenant intelligence
VTSCRE leasing/asset mgmt (60%+ US Class A office)300M+ data points → real-time supply/demand, pricing intelligence
Yardi MatrixProperty management SaaSRevenue/expense benchmarking, ownership data, supply pipeline

HR / Compensation

CompanyWedgeIntelligence
Pave ($1.6B val)Compensation management for 8,600+ cosReal-time comp benchmarks from actual payroll/cap table data
Burning Glass / LightcastJob posting aggregationLabor market intelligence — skills taxonomy, demand forecasting

Cybersecurity

CompanyWedgeIntelligence
CrowdStrike (~$90B mkt cap)Falcon endpoint securityGlobal Threat Report + ML models trained on aggregate threat data
Recorded Future ($2.65B → Mastercard)Threat intelligence platformAggregated open/dark web + technical sources via ML/NLP
Coalition ($3.5B val)Cyber insurance + monitoring”Active Data Graph” — proprietary risk data for underwriting

Insurance / Energy / Other Verticals

CompanyWedgeIntelligence
Guidewire (~$18B mkt cap)Core insurance platformClaims Intel — anonymized claims benchmarking across insurers
EnverusEnergy workflow SaaSBenchmark data from 95%+ of US energy producers
FBNPrecision ag + seed purchasingAnonymized seed performance and input pricing benchmarks
ServiceTitan (public)Field service mgmt for tradesTitan Score — performance benchmarking across contractors
Gong ($7.25B val)Conversation intelligence for salesRevenue intelligence benchmarks from 2M+ analyzed deals
Toast (~$20B mkt cap)Restaurant POSAggregate performance benchmarking across merchants
Shopify (~$120B mkt cap)E-commerce platformBenchmarks + Shopify Capital lending powered by merchant data

Historical Precedents (pre-SaaS)

Dun & Bradstreet (1840s), Experian/TransUnion/Equifax, FICO — these are the original workflow-to-intelligence companies. Merchants sharing payment data → credit bureaus → scoring → analytics. The model predates SaaS by over a century.


Notable Failures

Carta (2024 trust crisis) — A sales employee used confidential cap table data from a customer (Linear) to pitch secondary stock sales. Multiple companies reported similar behavior. CEO had to exit the secondary trading business entirely. The lesson: data monetization that creates conflicts of interest with the core workflow destroys customer trust. Directly relevant to how you structure the digital twin’s relationship to the compliance product.

NFX’s own company (Tickle) — Collected 24 billion data points from online quizzes. Found “the data wasn’t monetizable.” Lesson: volume alone doesn’t create a monetizable asset. The data must be unique, structured, and have clear buyers.

Zenefits — $4.5B → implosion from compliance violations. In data-rich verticals, compliance failures destroy the data asset by destroying customer trust.


Key Strategic Readings

  1. Bessemer — “Ten Lessons from a Decade of Vertical Software Investing” — Lesson 5 explicitly covers building data businesses from vertical SaaS. Names FICO, Verisk, CoreLogic. Provides a 4-part framework.
  2. NFX — “What Makes Data Valuable” — The most rigorous skeptical framework. Six conditions for genuine data network effects.
  3. a16z — “Oil Wells vs. Pipelines” (Aug 2025) — “Oil wells” drill deep into a single workflow to own the system of record. Directly applicable to the compliance wedge.
  4. Abraham Thomas — “Data and Defensibility” — Rigorous taxonomy of data advantages. Toast as data gravity exemplar.
  5. SignalFire — “Why Networked SaaS is the New AI Business Model” — $1.2T addressable value by 2030.

What This Suggests for TBD

Flatiron Health is the strongest analog — purpose-built clinical workflow wedge designed from day one to generate an intelligence data asset, validated at $1.9B. Altana AI is the closest sector comp ($1B valuation, compliance wedge → supply chain graph, serves CBP). Coupa’s “Community Intelligence” is the best reference for how to frame the value exchange to customers (“your anonymized data improves the product for everyone”).

The Carta lesson is the most important cautionary tale: the data monetization side must never create a conflict of interest with the compliance workflow side. Structural separation matters.