The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI that automatically catalogues datasets, generates metadata, and tracks data lineage across transformations and systems. Includes automated schema documentation and lineage graph generation; distinct from data quality monitoring which checks correctness rather than documenting provenance.
Data cataloguing and metadata management has crossed a critical inflection point in 2026: from nice-to-have to AI blocker. The question has shifted from "does this work" to "can we afford not to?" Vendor platforms are mature—Databricks GA lineage system tables, dbt Catalog, Apache Polaris TLP graduation, and major cloud providers shipping native metadata layers. Gartner's return to publishing a Magic Quadrant after a five-year hiatus confirmed market consolidation around five Leaders (Atlan, Alation, Informatica, IBM, Collibra). The core tension remains unchanged: tool maturity far exceeds organizational adoption discipline. Fewer than 30% of users actively engage post-deployment; Fortune 500 companies still track dependencies in spreadsheets; independent stress tests reveal most platforms fail silently under real governance load. Yet adoption pressure has intensified catastrophically: MIT attributes 95% of AI deployment failures to data governance gaps; EU AI Act mandates data lineage for high-risk systems; text-to-SQL accuracy crashes from 86% (benchmark) to 6% (real enterprise databases without governed metadata). The practice is firmly good-practice tier—proven at scale, economically justified, strategically critical—but constrained by persistent organizational barriers and integration complexity rather than technical capability.
Two-layer market architecture has solidified: technical catalogs (Databricks Unity Catalog, Snowflake Horizon/Polaris, AWS Glue) providing physical metadata and access control; governance catalogs (Atlan, Alation, Collibra, Informatica, IBM) providing business metadata, lineage, and discovery—with dbt Catalog and major tools shipping native metadata layers. Databricks, dbt, and Snowflake all released GA lineage features in April 2026, signaling platform consolidation. Apache Polaris graduated to ASF Top-Level Project (Feb 2026), validating open-source catalog infrastructure maturity. Market leaders dominate: Alation 570+ clients (32 countries), Collibra 5.25B valuation (Raito 2025 acquisition for access governance), Informatica, Atlan ($750M valuation, Gartner Leader advancement 2025-2026). Financial services deployments accelerated: Solidatus (10-100x speed gains) serving BNY, HSBC, LSEG; named case study documented 53% documentation workload reduction in 90 days (Kiwi.com on Atlan). Open-source adoption at scale: DataHub 3,000+ orgs managing 3M assets (Netflix, Visa, Slack, Foursquare, Pinterest); Slack collapsed 6 years of metadata debt in 3 days.
Deployment economics confirmed. American Airlines: 130,000 employees across Unity Catalog/Alation. Financial services: 65%→92% lineage completeness, 15% efficiency gains. US government: mission-critical document governance (20M+ annual). VA government: 75% discovery time reduction (Collibra). Market USD 3.01B (2026, 21.9% CAGR to USD 12.04B by 2033); lineage segment specifically USD 2.10B at 22.2% CAGR to USD 10.45B (2034).
Yet adoption friction persists despite maturity. Collibra implementations: 6-12 months, $100K+/year, fewer than 30% active engagement post-launch. Alation: $198K+ annual cost, 5-6 month implementation, column-level lineage at premium tier—signaling ROI barriers and change management complexity remain primary limiting factors despite vendor maturity. Survey data: 91% report slower search, 60% cite outdated documentation, 74% struggle in 500+ asset organizations (documentation-decay paradox). Practitioners identify platform-specific failures: batch-oriented catalogs fail under real-time governance, AI training validation, and decentralized architectures; siloed discovery/governance/observability tools incompatible with modern operations. The core constraint is not technology but governance discipline and organizational readiness to sustain continuous metadata curation and business user engagement.
— Critical assessment arguing data catalogs and lineage tools are architecturally declining, being subsumed into semantic layers; claims tools are inherently passive and decay without enforcement—important architectural limitation signal.
— Vendor thought leadership on metadata maintenance: automated feedback loops improve AI agent accuracy from 60% to near-100% in production, directly quantifying practice value for AI governance.
— Critical barrier quantification: 63% of organizations lack AI-ready data management practices; Gartner projects 60% of AI projects will be abandoned due to inadequate data foundations.
— IDC-sponsored ROI study quantifying DataHub Cloud deployment impact: 17-18% productivity gains, 91% faster searches, 58% faster incident resolution, up to 25% storage savings per deployment.
— Vendor analysis of Collibra adoption barriers: long implementation cycles, governance-heavy UX, opaque pricing—signals market maturity plateau and evolution pressure toward lighter alternatives.
— Vendor positioning of enterprise data graphs as foundational AI infrastructure; cites Gartner research: 60% of AI projects fail without context infrastructure, reframes AI hallucination as data context problem.
— Vendor GA for AI-powered lineage; cites CDO prioritization (38%) amid regulatory pressure (EU AI Act, GDPR, CCPA), positioning automated lineage as mandatory governance infrastructure.
— DataHub supports 3,000+ organizations managing 3M+ assets with quantified deployment outcomes: 91% faster data searches (50 min → 5 min), 119% more AI/ML models to production, 48% fewer data-related outages.