Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Data catalogue, metadata & lineage management

GOOD PRACTICE

TRAJECTORY

Stalled

AI that automatically catalogues datasets, generates metadata, and tracks data lineage across transformations and systems. Includes automated schema documentation and lineage graph generation; distinct from data quality monitoring which checks correctness rather than documenting provenance.

OVERVIEW

Data cataloguing and metadata management has crossed a critical inflection point in 2026: from nice-to-have to AI blocker. The question has shifted from "does this work" to "can we afford not to?" Vendor platforms are mature—Databricks GA lineage system tables, dbt Catalog, Apache Polaris TLP graduation, and major cloud providers shipping native metadata layers. Gartner's return to publishing a Magic Quadrant after a five-year hiatus confirmed market consolidation around five Leaders (Atlan, Alation, Informatica, IBM, Collibra). The core tension remains unchanged: tool maturity far exceeds organizational adoption discipline. Fewer than 30% of users actively engage post-deployment; Fortune 500 companies still track dependencies in spreadsheets; independent stress tests reveal most platforms fail silently under real governance load. Yet adoption pressure has intensified catastrophically: MIT attributes 95% of AI deployment failures to data governance gaps; EU AI Act mandates data lineage for high-risk systems; text-to-SQL accuracy crashes from 86% (benchmark) to 6% (real enterprise databases without governed metadata). The practice is firmly good-practice tier—proven at scale, economically justified, strategically critical—but constrained by persistent organizational barriers and integration complexity rather than technical capability.

CURRENT LANDSCAPE

Two-layer market architecture has solidified: technical catalogs (Databricks Unity Catalog, Snowflake Horizon/Polaris, AWS Glue) providing physical metadata and access control; governance catalogs (Atlan, Alation, Collibra, Informatica, IBM) providing business metadata, lineage, and discovery—with dbt Catalog and major tools shipping native metadata layers. Databricks, dbt, and Snowflake all released GA lineage features in April 2026, signaling platform consolidation. Apache Polaris graduated to ASF Top-Level Project (Feb 2026), validating open-source catalog infrastructure maturity. Market leaders dominate: Alation 570+ clients (32 countries), Collibra 5.25B valuation (Raito 2025 acquisition for access governance), Informatica, Atlan ($750M valuation, Gartner Leader advancement 2025-2026). Financial services deployments accelerated: Solidatus (10-100x speed gains) serving BNY, HSBC, LSEG; named case study documented 53% documentation workload reduction in 90 days (Kiwi.com on Atlan). Open-source adoption at scale: DataHub 3,000+ orgs managing 3M assets (Netflix, Visa, Slack, Foursquare, Pinterest); Slack collapsed 6 years of metadata debt in 3 days.

Deployment economics confirmed. American Airlines: 130,000 employees across Unity Catalog/Alation. Financial services: 65%→92% lineage completeness, 15% efficiency gains. US government: mission-critical document governance (20M+ annual). VA government: 75% discovery time reduction (Collibra). Market USD 3.01B (2026, 21.9% CAGR to USD 12.04B by 2033); lineage segment specifically USD 2.10B at 22.2% CAGR to USD 10.45B (2034).

Yet adoption friction persists despite maturity. Collibra implementations: 6-12 months, $100K+/year, fewer than 30% active engagement post-launch. Alation: $198K+ annual cost, 5-6 month implementation, column-level lineage at premium tier—signaling ROI barriers and change management complexity remain primary limiting factors despite vendor maturity. Survey data: 91% report slower search, 60% cite outdated documentation, 74% struggle in 500+ asset organizations (documentation-decay paradox). Practitioners identify platform-specific failures: batch-oriented catalogs fail under real-time governance, AI training validation, and decentralized architectures; siloed discovery/governance/observability tools incompatible with modern operations. The core constraint is not technology but governance discipline and organizational readiness to sustain continuous metadata curation and business user engagement.

TIER HISTORY

ResearchJan-2020 → Jan-2020
Bleeding EdgeJan-2020 → Jan-2022
Leading EdgeJan-2022 → Jul-2023
Good PracticeJul-2023 → present

EVIDENCE (129)

— Critical assessment arguing data catalogs and lineage tools are architecturally declining, being subsumed into semantic layers; claims tools are inherently passive and decay without enforcement—important architectural limitation signal.

— Vendor thought leadership on metadata maintenance: automated feedback loops improve AI agent accuracy from 60% to near-100% in production, directly quantifying practice value for AI governance.

— Critical barrier quantification: 63% of organizations lack AI-ready data management practices; Gartner projects 60% of AI projects will be abandoned due to inadequate data foundations.

The Business Value of DataHub CloudAdoption Metrics

— IDC-sponsored ROI study quantifying DataHub Cloud deployment impact: 17-18% productivity gains, 91% faster searches, 58% faster incident resolution, up to 25% storage savings per deployment.

— Vendor analysis of Collibra adoption barriers: long implementation cycles, governance-heavy UX, opaque pricing—signals market maturity plateau and evolution pressure toward lighter alternatives.

— Vendor positioning of enterprise data graphs as foundational AI infrastructure; cites Gartner research: 60% of AI projects fail without context infrastructure, reframes AI hallucination as data context problem.

— Vendor GA for AI-powered lineage; cites CDO prioritization (38%) amid regulatory pressure (EU AI Act, GDPR, CCPA), positioning automated lineage as mandatory governance infrastructure.

— DataHub supports 3,000+ organizations managing 3M+ assets with quantified deployment outcomes: 91% faster data searches (50 min → 5 min), 119% more AI/ML models to production, 48% fewer data-related outages.

HISTORY

  • 2020: Early market maturation with Alation and Collibra establishing leadership through ecosystem partnerships; 500+ enterprise customers in financial services and pharma; recognized as critical for compliance and impact analysis but faced widespread documentation bottlenecks and scalability questions.
  • 2021: Vendor landscape expanded with Microsoft (Azure Purview integration), open-source alternatives (Amundsen, Spline adoption at enterprises), and new SaaS entrants; category consolidation began with Azure Data Catalog deprecation; despite maturation, enterprises still maintained fragmented metadata repositories indicating deployment complexity remained high.
  • 2022-H1: Ecosystem solidified with Databricks GA of lineage in Unity Catalog, major vendors (Alation, Collibra, Cloudera) integrating lineage as core feature; adoption accelerated at technology leaders (AWS Glue/Spline, academic deployments for compliance), yet critical assessments emerged questioning design-time lineage ROI—signaling that despite vendor maturity, real-world deployment remained operationally complex and value realization uneven.
  • 2022-H2: Vendor ecosystem continued expansion with Collibra adding Data Marketplace and cloud-native features, Alation achieving $100M ARR and 25% Fortune 100 penetration; analyst focus shifted to operational integration (Forrester's "DataOps" framing); however, IDC survey revealed only 28% of organizations widely adopted data intelligence, exposing the persistent gap between strategic recognition and real-world deployment maturity.
  • 2023-H1: Real-world adoption accelerated: Crocs deployed Alation for cloud migration governance, confirming data catalog utility for operational transformation; data.world released generative AI-powered governance features, signaling vendor focus on automating metadata discovery; Dresner Advisory's 2023 study (7th edition) documented market maturation across customer segments. Yet automation remained the unresolved frontier—survey data indicated 70% of enterprises still relied on manual governance practices despite mature catalog vendors.
  • 2023-H2: Vendor leadership solidified with Alation entering Japan market (CTC partnership, 10B JPY target) and Collibra named Forrester Leader Q3 2023; real-world deployments scaled (RaceTrac: 256M annual transactions with Alation/Databricks; Swapfiets: 125 metrics governance via Atlan). Yet implementation barriers hardened: consulting assessments documented glossary design failures, inadequate curation, and chronic underestimation of resourcing; critical industry analyses highlighted manual toil, continuous data change outpacing catalog refresh, and lack of distributed architecture, exposing fundamental structural limitations despite tool maturity.
  • 2024-Q1: Vendor cloud-native expansion continued (AWS-Alation PrivateLink for HIPAA-compliant deployments) and public-sector adoption grew (University of Colorado multi-department Collibra rollout); yet adoption barriers remained unchanged. Gartner research cited 40% catalog program failure rate driven by lack of business engagement; practitioner analyses documented persistent design-time user neglect and metadata scope ambiguity; international case studies reported 30% usage drops in real deployments. Lineage for real-time systems emerged as unresolved frontier beyond traditional batch catalog architectures. Manual governance reliance persisted at 70% despite vendor maturity; the practice remained locked in its core tension between universal strategic recognition and persistent operational implementation barriers.
  • 2024-Q2: Vendor innovation accelerated in automation and real-time lineage: Alation launched Workflow Automation bots (Completeness, Compliance) addressing persistent manual governance bottleneck; Microsoft Purview redesigned governance layers with business domains and data products; Oracle expanded global regional coverage; OpenLineage standard addressed batch-centric lineage limitations for streaming. Azure Data Catalog retirement completed consolidation toward unified governance. However, adoption barriers persisted unchanged—70% manual governance reliance, design-time user neglect, metadata scope ambiguity—with no evidence of resolution. Vendor expansion signaled market maturation and competitive pressure, but addressable barriers remained organizational and cultural rather than technical. Practice remained at good-practice tier: proven enterprise deployment yet constrained by operational complexity and unclear lineage ROI.
  • 2024-Q3: Vendor maturation accelerated: W3C published DCAT 3 formal recommendation (Aug 2024) establishing standardized metadata interoperability; Informatica, Collibra, and Atlan all named Leaders in Forrester Wave Q3 2024, highlighting analyst recognition shift toward AI-augmented governance and observability. Informatica's July 2024 AI-powered inferred lineage launch addressed discovery at scale (1000+ sources); research confirmed 82% of organizations still lack governance/catalog solutions, yet enterprises with 100+ catalog users reported 74% data trust correlation. Critical assessments from observability vendors highlighted persistent usability barriers and insufficient integration with complementary practices. Adoption barriers remained unchanged despite vendor innovation: organizational readiness, design-time user engagement, and ROI clarity remained primary limiting factors. Good-practice tier held firm.
  • 2024-Q4: Cloud-native lineage matured: AWS launched Data Lineage GA in DataZone with OpenLineage compatibility (December 2024), bringing automated lineage capture to cloud enterprises. Real-world deployments documented concrete scale and ROI: Alation case studies showed a delivery service uncovering quality issues on Snowflake and an EU supermarket chain scaling to 4,000+ users with $12.4M savings. Market consolidation continued with major vendors emphasizing AI-powered discovery and federated architectures. Despite ecosystem maturity and analyst recognition, adoption barriers persisted: 82% of organizations still lacked catalog solutions, manual governance at 70%, and design-time engagement remained critical bottleneck. Good-practice tier confirmed—proven deployment capability yet constrained by organizational readiness barriers.
  • 2025-Q1: Vendor analyst recognition advanced: Collibra named Gartner Leader for Data and Analytics Governance Platforms (Feb 2025); enterprise deployments continued scaling with DACH retail (Collibra) and financial services (lineage completeness 65%→92%, 15% efficiency gain) seeing concrete ROI. Technical innovation addressed metadata complexity: Apache Iceberg research demonstrated automated synchronization across multi-platform environments (Snowflake/Databricks), reducing management burden. Yet implementation challenges persisted: practitioner analysis of 150+ deployments identified critical failure modes (poor planning, technology-first approaches, adoption neglect), confirming that governance barriers remained organizational and cultural rather than technical. Analyst recognition of AI-driven governance acceleration signaled evolving market expectations toward intelligent metadata discovery. Practice remained at good-practice tier: deployments showed continued scale and value realization, but mass-market adoption remained blocked by persistent organizational complexity and design-time engagement barriers.
  • 2025-Q2: Vendor ecosystem maturation continued with product-led innovation: Ataccama released v16.1 with automated lineage visualization and cloud-native support; Alation demonstrated enterprise-scale deployment at American Airlines (130,000 employees, Unity Catalog integration, automated metadata extraction). Market consolidation signaled leadership: Alation at 570+ clients across 32 countries ($1.7B valuation), Collibra, Informatica, and Atlan anchored competitive positions. Open-source ecosystem showed maturity gaps (lineage, quality frameworks, RBAC) despite broader adoption. Yet adoption barriers persisted unchanged: manual governance reliance at 70%, implementation complexity documented in consultancy analyses, industry statistics cited 80% governance initiative failure rates and $15M annual losses from poor data visibility. Automation benefits claimed by vendors (60% audit time reduction, 70% lineage accuracy) lacked independent validation. Practice remained at good-practice tier: demonstrated real-world scale and value but constrained by persistent organizational implementation barriers and resourcing complexity.
  • 2025-Q3: Vendor innovation focused on dark data and consumption patterns: Collibra launched Data Usage capability for Snowflake (July 2025), addressing the persistent challenge that 66% of enterprise data remains unused. Real-world deployments continued at scale: transportation sector firm completed MVP rollout with structured metamodel and lineage configuration; global life sciences company deployed Data Marketplace with business-aligned governance and multi-step access controls. Practitioner analyses from September onward emphasized adoption timelines and critical success factors—distinguishing pilot success from organization-wide adoption, signaling market maturation around implementation methodology. However, critical assessments from Dataedo and practitioners documented persistent failure modes (slow progress, trust deficits, incomplete documentation, user engagement collapse) mirroring pre-2025 barriers. Collibra Workflows implementations documented common pitfalls (misaligned objectives, performance bottlenecks, resource underestimation). Yet concurrent practitioner analyses showed data catalogs remain essential for AI readiness, with Gartner research citing 60% of AI projects fail without AI-ready data practice—positioning catalogs as foundational to enterprise AI strategy. Adoption barriers remained organizational (change management, governance discipline, documentation sustainability) rather than technical. Practice remained at good-practice tier: real deployments confirmed value at enterprise scale, but barriers to mass adoption persisted unchanged.
  • 2025-Q4: Government-scale deployments and ecosystem maturation reinforced practice value while exposing continued integration requirements. US federal government deployed Alation for mission-critical financial document governance (20M+ annually) with 98% faster login and 95% faster onboarding, confirming production-scale adoption in regulated sectors. AWS marketplace validated Collibra platform with independent customer achieving 30% manual work reduction over 2-year deployment. Open-source ecosystem advanced: Magda v5.0.0 released with AI chatbot and hybrid search capabilities, supporting 100,000+ datasets historically. Vendor research highlighted persistent barriers: Alation's dominance (550+ customers, $1.7B valuation) constrained by high implementation complexity and premium pricing; mid-market ROI modeling (750% first-year ROI for 150-user bank deployment) remained vendor-optimistic without independent validation. Critical assessments emerged on data quality integration: leading vendors emphasized that metadata catalogs alone fail without automated quality validation, as traditional approaches accumulate metadata faster than teams can validate it, leading to decay and user distrust. Adoption barriers crystallized around integration and governance discipline rather than technical capability. Good-practice tier maintained: demonstrated government and enterprise scale, yet practice remained constrained by organizational implementation barriers and the persistent requirement for complementary data quality platforms.
  • 2026-Jan: Vendor innovation accelerated with AI-powered automation and analyst recognition. Informatica named Gartner Leader for Data & Analytics Governance Platforms (Jan 2026), signaling analyst validation of integrated governance maturity. NTT Docomo deployed Alation's machine learning data catalog with Documentation Agent and Workflow Automation, demonstrating production-scale governance automation. Alation research confirmed 71% organizational adoption of formal governance programs, with McKinsey data indicating structured approaches deliver use cases 90% faster and reduce costs by 30%. Yet adoption barriers persisted: independent evaluations documented Collibra implementations requiring 6-12 months with $100K+/year costs; critical assessments revealed fewer than 30% of users actively engage post-implementation, with poor data quality costing organizations $12.9M annually and data professionals losing 20% of project time to discovery inefficiencies. Independent ROI analyses quantified benefits (60% discovery time reduction, 90% support ticket decrease) validating economic case despite implementation barriers. Good-practice tier held: demonstrated production-scale deployments and analyst recognition, yet structural adoption challenges persisted unchanged.
  • 2026-Feb: Analyst ecosystem and market metrics signaled continued maturity. Gartner's Magic Quadrant for data catalogs returned after 5-year hiatus (Nov 2025), naming Leaders as Atlan, Alation, Informatica, IBM, and Collibra, with market shift toward active metadata orchestration platforms. Global data lineage market forecast expansion to USD 65.5B by 2035 (CAGR 25.6%) with 51% current adoption—indicating strong ecosystem maturation. Real-world deployments continued: global life sciences firm migrated legacy Data Marketplace to Collibra, managing 300+ data publications for 100+ users. Yet critical adoption barriers persisted: Fortune 500 companies still managing data dependencies in Excel (exposing AI governance maturity gaps), and practitioners documented repeated lineage implementation failures (dashboard-only thinking, untracked manual steps, broken ownership) causing slow delivery and user distrust. Metadata governance for AI emerged as critical constraint: text-to-SQL accuracy dropped from 86% (academic benchmark) to 6% (real enterprise databases without AI-ready metadata), with 47% of knowledge workers making major decisions based on hallucinated AI output. Practice remained at good-practice tier: mature vendor ecosystem and analyst recognition confirmed, yet organizational barriers—integration requirements, governance discipline gaps, and AI-readiness constraints—remained primary adoption limiting factors.
  • 2026-Mar: Deployment scale and regulatory pressure intensified simultaneously: DataHub surpassed 3,000 organisations with Slack collapsing six years of metadata debt in three days; EU AI Act formally mandates data lineage for high-risk AI systems, creating compliance-driven adoption pressure. Databricks research documents catalog users generating 12x more AI agents; MIT analysis attributes 95% of AI deployment failures to data governance gaps. Solidatus AI Lineage Assistant reached GA (10x-100x speed gains, serving BNY, HSBC, LSEG); Google consolidated Data Catalog into Dataplex Universal Catalog (Jan 2026) with column-level lineage and AI semantic search. Market confirmed at USD 3.01B (2026, 21.9% CAGR to USD 12.04B by 2033), while independent stress tests documented most catalog platforms failing silently under real governance load — confirming that tooling maturity has not resolved organisational adoption barriers.
  • 2026-Apr: Market inflection moment: data governance and metadata management elevated to primary AI blocker status for first time in analyst history. Gartner projects 75% organizational adoption of active metadata practices by 2027; MIT Sloan/McKinsey/Deloitte 2026 surveys confirm data infrastructure as top barrier to AI production (39% deployment rate vs 88% pilot adoption); 60% abandon AI projects due to governance gaps. Vendor ecosystem maturation accelerated: dbt Catalog reached GA with column-level lineage and automated metadata discovery; Databricks GA lineage system tables (table_lineage, column_lineage) shipped as part of Unity Catalog; Informatica recognized as Gartner Leader; IBM watsonx.data intelligence GA shipping automated lineage export to Collibra; Alation Business Lineage GA enabling non-technical users to understand impact analysis at scale. Market architecture solidified into two distinct layers — technical catalogs (Unity Catalog, Polaris, AWS Glue) and governance catalogs (Atlan, Alation, Collibra) — with documented case study of Kiwi.com achieving 53% documentation workload reduction in 90 days on Atlan. Adoption friction persisted: survey data showed 91% report slower search, 60% cite outdated documentation, 74% struggle in 500+ asset organizations, and Alation's $198K+ annual cost with 5-6 month implementation remain primary ROI barriers. Lineage market segment demonstrated rapid expansion ($2.10B at 22.2% CAGR to 2034) driven by AI governance and regulatory compliance. Good-practice tier confirmed: mature vendor ecosystem with analyst recognition and government-scale deployments validated; organizational readiness and integration complexity remain constraining factors on adoption velocity.
  • 2026-May: Deployment evidence validated practice maturity while architectural limitations emerged. DataHub confirmed 3,000+ organizations managing 3M+ assets with IDC-quantified outcomes (91% faster searches, 119% more AI/ML to production, 48% fewer data-related outages); Informatica, Collibra, and Alation confirmed market consolidation with regulatory drivers (EU AI Act, GDPR, CCPA) mandating governance. Automated feedback loops in metadata maintenance (Alation) demonstrated improvement of AI agent accuracy from 60% to near-100% in production, directly quantifying catalog value for AI governance. Critical assessments documented persistent adoption barriers: Alation and Collibra implementations require 6-12 months with $100K+/year costs; fewer than 30% of users actively engage post-launch; 63% of organizations still lack AI-ready data management practices. Architectural limitations surfaced: batch-oriented catalog designs fail under real-time governance and decentralised AI governance requirements; critical opinion argued traditional catalogs are architecturally declining as metadata tooling becomes subsumed into semantic layers. Good-practice tier held: proven deployment at scale and economic ROI validated, but structural adoption barriers and architectural evolution pressures constrain velocity toward category maturity.

TOOLS