Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Budget variance analysis & narrative explanation

GOOD PRACTICE

TRAJECTORY

Stalled

AI that analyses budget variances and generates narrative explanations of why actuals deviated from plan. Includes automated waterfall decomposition and natural language variance commentary; distinct from financial reporting which presents results rather than explaining variances.

OVERVIEW

AI-driven budget variance analysis has moved past proof-of-concept into proven, accessible tooling. The practice automates what was once one of the most labour-intensive steps in the close cycle: decomposing actual-versus-plan deviations into their drivers (price, volume, mix) and generating narrative explanations fit for management review. GA features from Microsoft, IBM, Pigment, and HighRadius now handle this end-to-end, and a Workiva survey of nearly 1,500 finance professionals found 91% reporting that AI improved the timeliness of financial decisions.

The question facing finance teams is no longer whether the technology works, but whether their data and processes are ready for it. Vendor capability is mature; the constraint is organisational. Data quality, multi-entity complexity, and the enduring need for human review on publication-ready narratives set the pace of rollout. For teams with clean, well-governed planning data, variance automation delivers measurable close-cycle compression. For those without it, the tooling outpaces the foundation.

CURRENT LANDSCAPE

The vendor ecosystem has consolidated around a handful of production-grade platforms, with the May-June 2026 window marking a clear inflection toward deployment at scale. Microsoft's Variance Analysis Agent, formally GA in April 2026 and now integrated into Excel via 365 Copilot, has become the most widely accessible entry point—deployed within existing tools for Pro/Business Standard subscribers with immediate adoption across enterprise base. Pigment's Analyst Agent, launched November 2025, continues to drive automation across cost centres; customers including Coca-Cola, Unilever, ServiceNow, and Supercell cite days of manual work eliminated per cycle. Carta's deployment with Pigment achieved 80% reduction in data aggregation time. IBM Planning Analytics added AI-driven price/volume decomposition in early 2026. HighRadius scales past 1,000 deployments. V7 Labs' Business Performance Analysis Agent reduces monthly variance reporting from 1-2 days to 10-15 minutes. Arbo and SkyStem provide new GA options, establishing automation as table-stakes across FP&A vendors. ChatFin, positioned for CFO-led month-end close automation, demonstrates the shift from dedicated variance tools to integrated close-cycle agents that handle variance narratives as a component of end-to-end close workflows.

Late May 2026 adoption inflection: A Consero survey of 102 PE/VC-backed CFOs (mid-May) ranked management reporting and variance analysis as the #1 AI use case in finance at 32% adoption with 3-6 month payback—the fastest payback window of all finance workflows. 42% report broad or fully embedded AI in finance (up 20 points year-over-year), and 75% see ROI within 12 months. Vertical Edge AI's synthesis of Deloitte research (1,300+ finance leaders at $1B+ revenue companies) documents that variance narratives have moved from forecast to deployed stage within a single fiscal year, with 63% of major finance functions fully deploying AI. KPMG's parallel survey of 1,013 senior finance leaders across 20 countries confirms AI delivers strongest gains in judgment-heavy work—decision-making quality at 70% and speed at 71%—the exact competencies variance explanation demands. This convergence of adoption signal, deployment acceleration, and evidence that variance narratives rank as the fastest-paying finance AI use case represents the transition to mainstream adoption: the technology is proven, deployments are scaling, and finance leadership has moved from "should we?" to "how fast can we?"

The architectural and organizational constraints shaping scaling remain significant. Analyst time allocation reveals a fundamental bottleneck: FP&A teams spend 80% of their time gathering and reconciling data across systems, not on analysis—meaning variance automation that does not solve data integration upstream provides limited value. Successful deployments (Gävle Energi, Carta) demonstrate operational improvements when variance tools integrate with controlled close processes, but most finance organizations lack the data governance foundation to move beyond pilots. A Bain survey of 951 companies documents the variance problem directly: 37% of organizations targeted 11–20% cost savings from their AI initiatives, while nearly 40% of those landed in the 0–10% bucket instead—missing variance budgets by 30+ percentage points. This gap reflects both measurement discipline and execution uncertainty: companies struggle to baseline "before" states and isolate causal AI impact from organizational change. Successful practitioners (see Christophe Atten case) treat variance automation as a confidence-calibrated human-in-the-loop workflow, flagging low-confidence outputs for investigator review rather than aiming for full autonomy.

Even as adoption accelerates, fundamental constraints persist. Peer-reviewed research (Stanford AI Index, published Science, MIT CSAIL, May 2026) documents that AI models collapse on identical tasks when facts are reframed: GPT-4o drops from 98.2% accuracy to 64.4% (34-point collapse), DeepSeek R1 from 90% to 14.4% (76-point collapse). The AI Incident Database recorded 362 incidents in 2025 (55% increase from 2024), with 1,436 documented court cases involving AI-generated hallucinations. Vikas Malpani's June 2026 analysis estimates $67.4B in aggregate hallucination costs across 2024 enterprise deployments (direct losses $18.2B, operational cleanup $21.5B, reputational $27.7B), with 47% of enterprise AI users admitting they made major business decisions based on hallucinated content. These failure modes directly threaten variance narrative reliability: plausible-sounding explanations of budget drivers can be factually fabricated without detection. Gartner's 2026 Hype Cycle assessment rates domain-specific financial models—the category that includes AI-trained variance analysis—as still in adolescence, 2-5 years away from mainstream maturity, despite widespread GA product availability suggesting otherwise. The maturity gap is fundamental: variance analysis requires deterministic outputs (same inputs → same answer every time) with auditability and repeatability, but LLMs are probabilistic by design, incompatible with those requirements without external data grounding and validation layers. Production AI agent research (Princeton study, June 2026) reveals that accuracy improvements do not translate to reliability improvements: models become more accurate but exhibit unpredictable behavior, high sensitivity to minor prompt variations, and instability in step sequencing—critical risks when explanations must be auditable and defensible.

At the organizational level, the CFO accountability bar has sharpened in mid-2026. Surveys show 70% of finance executives are ready to cut AI budgets if business targets miss, and 73% report unmet AI expectations from 2025 investments. Only 28% of organizations see measurable financial impact from AI despite 92% deploying tools, revealing a persistent perception-to-reality gap. Finance-specific barriers compound technology challenges: Glenn Hopper's analysis shows only 1 in 14 CFOs (7%) report that AI investments made strong impact, despite 60% of teams having AI deployed; only 17% of finance professionals use AI in core workflows despite 56% using it somewhere (up from 28% in 2023)—indicating shadow AI usage and insufficient integration with decision workflows. Organizational adoption barriers dwarf technical ones: user proficiency accounts for 38% of AI implementation difficulty vs. only 16% technical issues (Prosci survey of 1,107 organizations), and training investment lifts adoption from 25% to 76%—yet most organizations underestimate change management requirements. Governance failures—inconsistent data definitions, missing baseline metrics, unaccountable pilots—remain the primary barrier to scaling variance automation, outpacing technical limitations. 70% of enterprise AI projects fail to reach production; a Caxy Interactive analysis documents five structural killers: data fragmentation (57% of organizations unprepared), UX gaps between demos and production, security/compliance burden, cost spirals, and organizational readiness gaps. Workiva's survey of 1,497 finance professionals found a 32-point gap between CFO claims of AI adoption and controller reports of actual deployment: presentation-layer automation (dashboards, narratives) masks unchanged manual data preparation and reconciliation underneath. Human-in-the-loop review remains standard practice for published variance narratives. Data governance maturity and organizational discipline—not vendor selection—determine whether teams convert pilots to production value. The 2026 inflection in adoption metrics reflects proven tooling and mature vendor ecosystems; scaling that adoption depends on solving the organizational and governance constraints that have consistently stalled finance AI at the pilot stage since 2023.

TIER HISTORY

ResearchJan-2023 → Apr-2024
Bleeding EdgeApr-2024 → Jul-2025
Leading EdgeJul-2025 → Oct-2025
Good PracticeOct-2025 → present

EVIDENCE (97)

— Synthesizes 2026 adoption research (Prosci n=1107, WalkMe/SAP, Gallup, SurveyMonkey): user proficiency accounts for 38% of difficulty vs 16% technical; training lift 76% vs 25% adoption; leadership trust gap (-1.50 to +1.65 on trust scale)—directly applicable to variance analysis adoption barriers.

— Quantifies AI failure economics: 95% of pilots show zero P&L impact, $67.4B estimated 2024 hallucination costs (direct, operational, reputational), 47% of AI users made major decisions on hallucinated content; identifies reliability tax as hidden cost structure.

— FP&A practitioner deployed LLM-based variance automation: 47-minute setup, 80% clean output, 20% flagged low-confidence, 6 hours/month saved, ROI paid back in first close cycle; demonstrates successful confidence-calibrated human-in-the-loop deployment.

— Wolters Kluwer (major accounting software vendor) distinguishes GenAI strengths (unstructured analysis, pattern finding) from limitations (structured calculations, repeatability); flags hallucination, variability, transparency risks; emphasizes human oversight requirement for audit defensibility.

Gävle Energi Case StudyCase Studies

— Swedish energy company deployed Pacera Mercur for integrated variance analysis across business areas; results: reduced manual work, improved planning quality, drill-down functionality enabled analysis across projects and facilities—named deployment with operational improvements.

— Bain survey of 951 companies: 37% targeted 11-20% cost savings, ~40% achieved only 0-10% instead—direct evidence of budget variance problem; paradox: 90% increasing budgets despite missing targets; only 7% run fully autonomous agents.

— AI Journal describes practical transformation: $10M manufacturing variance decomposed into cost increases (40%), production inefficiencies (35%), currency (25%) in seconds vs. days; role shifts from writer to investigator as AI automates 70-80% data-gathering work.

— Princeton study of 14 AI agent models over two years: accuracy improves significantly but reliability (stable, predictable behavior) improves much more slowly; models show sequential instability and high sensitivity to minor prompt variations—critical for variance narrative generation systems.

HISTORY

  • 2023-H1: Variance analysis automation integrated into record-to-report platforms (HighRadius, Workiva); fintech companies (Ramp) demonstrated variance management case studies; adoption remained concentrated in large enterprises with mature FP&A functions.
  • 2024-Q1: ClickUp launched dedicated AI agents for budget variance analysis; academic research documented 23% variance reduction and 35% accuracy gains in construction project deployments; adoption barriers persisted including data quality, organizational resistance, and implementation complexity.
  • 2024-Q2: Anaplan deployments scaled to mid-market (Abilene Christian University, 6-week production implementation); independent analyst validation emerged (Nucleus Research: 2 months annual work saved); Gartner survey identified budget variance explanation as #1 finance GenAI priority; public sector began AI-assisted workflows; human-in-the-loop verification remained industry standard, limiting pure automation potential.
  • 2024-Q3: Platform maturity expanded (HighRadius, Workiva, D365 Copilot); industry adoption signals strengthened (KPMG survey: 71% AI in finance, 57% exceeding ROI expectations; 66% of finance leaders see GenAI as most impactful for variance explanation); vendor confidence reflected in product roadmap investments; critical practitioner analysis highlighted persistent data preparation and causal reasoning bottlenecks limiting automation potential.
  • 2024-Q4: Mainstream platform integration accelerated: Microsoft Business Central added GA variance analysis Power BI reports (November); D365 Finance deployed AI co-pilots for variance-to-narrative workflows (December); KPMG US survey showed 78% of companies piloting/using AI for financial planning. Adoption remained constrained by data quality requirements and mid-market complexity rather than technical feasibility.
  • 2025-Q1: HighRadius and Workiva continued platform maturity (HighRadius reporting 1100+ customers with 95% forecast accuracy claims; Workiva powering public sector budget automation including City of Dubuque's production budget book workflow). Broader AI adoption uncertainty signaled by enterprise ROI timelines extending beyond six months; data quality challenges cited by 85% of finance leaders as primary barrier. Practice remained technically mature but adoption paced by organizational readiness and data governance rather than vendor availability.
  • 2025-Q2: Workiva reported 30% of customer base enabled AI features in production (May 2025 earnings call), signaling acceleration from prior periods and vendor monetization confidence. Competitive landscape expanded with specialized variance tools (Vena Copilot, Planful Predict) explicitly marketed for AI-assisted variance explanation. Practitioner adoption signals emerged (57% of finance professionals piloting or using AI). Organizational bottlenecks remained: data quality, human review requirements, and causal reasoning complexity in multi-entity models continued to pace mid-market adoption.
  • 2025-Q3: HighRadius scaled to 1000+ customers with AI-powered variance insights (99% accuracy, 60%+ automation); Microsoft Copilot for Finance entered public preview with variance analysis in Excel. Adoption expanded to 79% of FP&A teams using AI. Critical limitations documented: AI struggles with nuanced commentary, causal reasoning in multi-entity models, and stakeholder confidence. Human-in-the-loop verification remained industry standard, constraining pure automation ROI for mid-market organizations.
  • 2025-Q4: Microsoft Copilot for Finance 'Automate Variance Analysis' reached GA in October 2025, enabling recurring end-to-end variance automation with notifications. Pigment launched Analyst Agent in November for departmental variance narratives. However, economic headwinds emerged: Deloitte survey (October) found only 6% of AI projects broke even in under one year; MIT study found 95% of organizations saw zero GenAI ROI. Practice remained technically mature but ROI timelines and scaling challenges constrained mid-market adoption growth.
  • 2026-Jan: IBM Planning Analytics Workspace (v3.1.3) delivered GA variance analysis with AI-driven driver decomposition (price/volume) and AI summaries; continued mainstream platform maturity with minimal new deployment announcements signaling consolidation around existing vendor ecosystem.
  • 2026-Feb: Ecosystem consolidation accelerated with steady product advancement across major vendors: Workiva survey (1,497 respondents) confirmed 91% of finance leaders believe AI improved decision timeliness and 65% actively use AI in disclosures, suggesting mainstream production adoption. Pigment and Copilot for Finance offered production-grade variance automation with documented efficiency gains (30-40% close cycle reduction). Critical headwinds persisted: Gartner predicted 60% of AI projects lacking data readiness would be abandoned through 2026, and broad MIT/Gartner research showed 95% of organizations saw zero GenAI ROI. Implementation complexity and ROI expectations remained the constraining factors rather than technical capability.
  • 2026-Mar: Governance failures emerged as the primary adoption barrier, not technology: LedgerUp analysis revealed 32-point perception gap between CFOs claiming AI adoption and controllers reporting unchanged manual workflows. Finance industry analysis documented 80% overall AI project failure rate, with root cause identified as governance and data foundation gaps, not technology limitations. Accounting VC analysis cited Yale Fin-RATE research showing 18.6% accuracy degradation on temporal financial reasoning tasks critical to variance analysis. Named customer deployment (Carta/Pigment) documented 80% reduction in data aggregation time with 12-week implementation, validating production viability for data-ready organizations. Market consolidation continued with Aleph FP&A comparison showing AI-powered variance explanation as 2026 table-stakes capability across seven major platforms.
  • 2026-Apr: V7 Labs launched a production Business Performance Analysis Agent compressing monthly variance reporting from 1-2 days to 10-15 minutes, reinforcing the time-savings case for narrowly-scoped deployments. Ecosystem expansion included Datarails (product GA for AI-assisted variance narrative drafting), Microsoft Dynamics 365 Copilot (25-30% close cycle gains), and reinforced Microsoft Copilot for Finance adoption across enterprise base. Critical measurement data emerged: CFO.com benchmark showed best-in-class AI models fail 1 in 5 accounting tasks; Claryx documented 41% hallucination rate in financial NLP; Bain found 83% of CFOs planning AI increases but only 15-25% have scaled to production; Deloitte released 2026 framework for measuring AI ROI as organizations struggle with baseline gaps and accountability voids. The ROI gap sharpened: 70% of enterprise AI projects fail to reach production, with narrowly-scoped automation (54% success) significantly outperforming large-transformation initiatives (22% success) — validating focused, incremental rollout strategy for most finance teams.
  • 2026-May: Alteryx released a GA starter kit for AI-driven budget variance diagnosis with threshold alerting and narrative generation; independent tool benchmarks showed Claude at 64.4% vs Copilot at 4.4% on variance explanation (Wall Street Prep benchmark), with Copilot flagged for data fabrication risk when semantic model gaps exist. A Mercor benchmark of frontier models across 25 financial tasks found 16-20pp accuracy drops for visual inputs and systematic mathematical reasoning failures, adding to growing evidence that model reliability for variance narratives requires careful scope control and data grounding. A university facilities case study documented AI variance detection within 48 hours versus a 5-month manual lag, preventing $1.4M in downstream costs. Consero survey of 102 PE/VC-backed finance leaders (mid-May) positioned variance analysis and management reporting as the #1 ranked AI use case at 32% adoption with 3-6 month payback, signaling mainstream transition from bleeding-edge to good-practice adoption; 42% reported broad/fully embedded AI (up 20 points YoY), and 75% seeing ROI within 12 months. New GA products from Arbo and SkyStem brought vendor count to seven major platforms with native variance analysis automation. Enterprise budget burndown accelerated (71% of companies exceeded 2026 AI budgets in four months), making real-time variance monitoring and cost forecasting operationally acute. Independent analysis documents bimodal ROI distribution (early adopters 171% average, late majority 95% failure within 6 months) and Stanford AI Index confirming governance gaps (not technology) as binding constraint to production deployment, underscoring critical importance of controls infrastructure.
  • 2026-Jun: Peer-reviewed reliability research (Stanford AI Index, Science, MIT CSAIL) documented fundamental hallucination collapse: GPT-4o drops 34 points (98.2%→64.4%) and DeepSeek R1 drops 76 points (90%→14.4%) on identical tasks reframed differently, with 362 AI incidents in 2025 (55% increase)—directly threatening variance narrative reliability. Gartner Hype Cycle 2026 rates domain-specific financial variance models as "adolescent," 2-5 years from mainstream maturity despite widespread GA product availability. KPMG's survey of 1,013 senior finance leaders across 20 countries confirms AI delivers strongest gains in judgment-heavy work (70% decision quality, 71% speed)—the exact competencies variance explanation demands—while Deloitte synthesis (1,300+ leaders) confirms variance narratives "moved from forecast to deployed in single fiscal year" at 63% of major finance functions. ChatFin and Microsoft Copilot (Variance Analysis Agent, GA April 2026 in Excel) continue mainstream platform availability; CFO accountability bar sharpened with 70% ready to cut AI budgets if targets miss and 73% reporting unmet expectations from 2025 investments. Practitioner deployment confirmed confidence-calibrated human-in-the-loop as the working model: a named FP&A deployment achieved 80% clean variance output with 20% flagged low-confidence, saving 6 hours/month with ROI in the first close cycle; separately, Bain survey data (951 companies) quantified the AI cost-savings variance problem itself—37% of organizations targeted 11-20% savings but ~40% landed in the 0-10% bucket, missing their own AI budget targets by 30+ percentage points.