The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
AI for financial operations, reporting, planning, and risk management. Over half the practices are good practice: fraud detection, expense management, invoice processing, and financial forecasting have mainstream adoption. Regulatory compliance and audit automation are advancing. The domain is tightly clustered around good-practice with minimal bleeding-edge — finance favours proven, auditable tools over experimental ones.
Finance and accounting is a domain where AI has won every technical argument and lost most organizational ones. Across the sixteen practices we track, the vendor ecosystem is mature, the tooling is generally available, and the ROI case studies are abundant. Supplier and spend analytics has reached established status -- 92% adoption among procurement organizations, with Coupa managing $472 billion in transactions and the Hackett Group ranking spend analytics as 2026's number one transformation initiative. Invoice processing, cash flow prediction, collections automation, expense management, fraud detection, budget variance analysis, financial forecasting, and insurance underwriting all sit at proven maturity with GA platforms from the major ERP and fintech vendors. The technology question is settled across most of this domain.
What is not settled is whether organizations can absorb what they have bought. The execution gap is the defining feature of Finance & Accounting AI in mid-2026, and it is widening rather than narrowing. An Oliver Wyman survey of 500 CFOs controlling 12% of global market capitalization found only 8% have deployed AI agents at scale, with 74% still in planning or piloting. Gartner reports that only 7% of CFOs see strong AI impact -- a 93% disappointment rate. PwC's Global CEO Survey of 4,454 leaders found 56% reporting zero financial returns from AI despite significant investment. The pattern repeats across every practice: 87% of CFOs say AI is critical to strategy, yet only 21% of deployed solutions deliver tangible value. The number 73% appears with uncomfortable frequency -- 73% of enterprise AI projects fail to deliver ROI, 73% of data leaders identify data quality as the number one barrier, 73% of GenAI implementations fail according to a 140-deployment analysis.
The exceptions prove the structural logic. EY has deployed agentic AI to 130,000 auditors across 160,000 engagements, processing 1.4 trillion journal entries annually -- the largest professional services agent deployment to date. Allstate's ALLIE began closing insurance policies live in three states in May. Oracle shipped 1,000 agentic agents across its Fusion customer base. Walmart's autonomous procurement agent negotiated 2,000 supplier contracts with a 68-72% agreement rate and 3% cost savings. These are real deployments at genuine scale, but they share a common trait: they occur inside organizations with enterprise-grade data governance, dedicated AI teams, and multi-year investment programs. The gap between these vanguard institutions and the median finance function is not closing. It is, by most measures, getting wider.
No practice changed tier or trend in this scan cycle, which is itself significant: stability across sixteen practices suggests the domain has settled into a structural equilibrium where the forces driving adoption and the forces constraining it are roughly balanced.
The most consequential new evidence centers on the Big Four's production-scale commitments. EY's deployment of agentic AI across 160,000 audit engagements marks a shift from optional tooling to embedded infrastructure at the profession's highest level. KPMG launched a GA financial close AI assistant powered by Google Gemini Enterprise and integrated with Workday. Against this, KPMG is simultaneously cutting 10% of its US audit partners while expanding AI pilots -- a signal that the technology is restructuring professional services headcount, not merely augmenting it.
A DualEntry Labs benchmark of 19 AI models on 101 real accounting workflows established an accuracy ceiling that constrains the entire domain: the top performer, Claude Opus 4.7, reached 79.2% accuracy, with month-end close tasks scoring just 50%. No model exceeded 70% overall. This is the structural reason financial close automation remains stalled at 2% full adoption and why human-in-the-loop review persists as standard practice. Meanwhile, MIT and Stanford researchers studying 277 accountants at 79 firms documented the productivity upside: AI enabled a 7.5-day faster month-end close and shifted 8.5% of time from routine processing to analytical work, but 62% of participants expressed error concerns.
On the regulatory front, the CFPB issued a final rule eliminating disparate impact liability under ECOA -- a seismic shift for AI credit scoring that removes a key fair-lending enforcement mechanism. The EU AI Act's August 2026 compliance deadline for high-risk credit scoring systems is now three months away. Seven UK professional accounting bodies jointly codified AI ethics standards via formal conduct rules, and the IRS expanded to 125+ AI/ML models (up from 54 in 2024) for enforcement, despite staff reductions.
Upstart's Q1 2026 earnings confirmed continued production momentum -- 425,000 loans originated, 173.6% accuracy advantage over FICO -- but a simultaneous securities class action alleged that its Model 22 fundamentally failed to account for macroeconomic factors, overstated accuracy, and caused $70 million in missed revenue guidance. The litigation crystallizes the model governance risk that shadows every AI deployment in high-stakes financial workflows.
The 93% disappointment rate is structural, not transitional. Gartner's finding that 93% of CFOs report disappointment with AI impact, PwC's finding that 56% of CEOs see zero financial returns, and the Bain data showing only 31% of CFOs report positive outcomes are not early-adoption growing pains. Root cause analysis from TechFastForward identifies the failure mode as "AI without a home" -- technology deployed without operational ownership, governance frameworks, or measurement infrastructure. Only 18% of organizations track AI ROI, and 82% of boards lack AI ROI measurement capability. The gap is not closing because most finance functions have not invested in the organizational architecture required to make AI work: clean data, unified process definitions, change management capacity, and accountability structures. Vendor capability advances -- Oracle's 1,000 agentic agents, SAP's AI-native Ariba, BlackLine's Verity suite -- make no difference without this foundation.
Accuracy ceilings bind the domain at a fundamental level. The DualEntry benchmark establishing a 79.2% accuracy ceiling across AI models on real accounting tasks, with month-end close at 50%, is not an engineering defect that better models will fix. Peer-reviewed research from OpenAI and Georgia Tech has proven that hallucinations are mathematically inevitable in current architectures. Independent benchmarks document 4.2-19.1% hallucination rates across frontier models, with citation accuracy the worst-performing dimension at 12.4%. For a domain where a single misclassified revenue recognition entry triggers restatement risk, and where ASC 606 requires 100% precision on contract modifications, this is a hard ceiling. It explains why 97% of finance professionals demand human oversight of AI outputs and why financial close remains 97% manual despite six years of vendor investment.
Regulatory fragmentation is reshaping platform architecture, not just compliance cost. The EU AI Act (August 2026 enforcement), Colorado AI Act (June 2026), CFPB fair-lending changes, COSO internal control frameworks for GenAI, state-by-state insurance mandates ($107 million in fines in January 2026 alone), and IRS enforcement expansion are not a single regulatory trend -- they are a multi-jurisdictional compliance surface that forces different architectural decisions for different markets. Compliance-by-design enforcement -- hard constraints encoded into platform logic rather than probabilistic guidance -- has become the operational baseline for collections, insurance claims, and credit scoring. Organizations that treated regulatory compliance as a cost center now find it is an architectural requirement that shapes which AI capabilities they can deploy and where.
The Big Four pivot signals a labor market restructuring, not an augmentation story. EY deploying agentic AI to 130,000 auditors, KPMG cutting audit partners while expanding AI, Big Four graduate openings falling 44% year-over-year, and automation penetration ranging from 38% to 80% by task type -- these are not efficiency gains. They are the early signals of a structural labor reallocation in professional services. The firms that built their business models on leverage ratios and billable hours are rebuilding around AI-augmented throughput. For finance leaders evaluating audit and advisory relationships, the implication is that the service delivery model they are buying is changing underneath them.
Insurance AI is breaking its own economic model. AI precision pricing threatens the foundational pooling model of insurance. State Farm and Allstate withdrew from California homeowners markets because granular AI models revealed their legacy pricing had systematically undercharged high-risk properties. Class-action litigation against major carriers alleges AI claims systems use proxy discrimination through voice analysis, geolocation, and browser history. UnitedHealth faces discovery orders revealing the scope of its algorithmic claim denial system, which independent analysis found had a 90% error rate. The insurance industry is simultaneously the sector with the most mature AI deployment evidence (Aviva's 80+ models delivering 60 million pounds in annual value, AIG targeting 500,000 submissions processed) and the sector where AI deployment is creating existential business model and litigation risk.
EY Launches Enterprise-Scale Agentic AI to Redefine the Audit Experience (product-ga) — The largest professional services agent deployment to date — 130,000 auditors across 160,000 engagements — is the primary counter-evidence to the 93% disappointment narrative, but its existence depends on multi-year investment infrastructure that most finance functions do not have. https://www.ey.com/en_my/newsroom/2026/04/ey-launches-enterprise-scale-agentic-ai-to-redefine-the-audit-experience-for-the-ai-era
The Best AI Model Still Fails 1 in 5 Accounting Tasks (industry benchmark) — The DualEntry Labs finding that Claude Opus 4.7 tops out at 79.2% accuracy, with month-end close tasks at 50%, is the structural ceiling that explains why financial close remains 97% manual despite six years of vendor investment — not a capability gap that next-model releases will fix. https://www.cfo.com/news/the-best-ai-model-still-fails-1-in-5-accounting-tasks-Claude-Opus-OpenAI-GPT/818100/
CFOs of 12% of Market Cap on AI, Growth, and Transformation (industry-report) — Oliver Wyman's survey of 500 CFOs controlling 12% of global market capitalization, finding only 8% have deployed AI agents at scale with 74% still in planning or piloting, is the most authoritative quantification of the execution gap at the top of the finance function. https://www.oliverwymanforum.com/agenda-cfo/playbook-cfo-strategy-growth-cost-ai.html
KPMG Introduces Month-End Close AI Assistant (product-ga) — A Big Four firm shipping GA agentic close tooling powered by Google Gemini Enterprise and integrated with Workday confirms the technology is production-ready at the profession's highest tier — while simultaneous partner headcount cuts signal this is a restructuring event, not an augmentation story. https://www.cpapracticeadvisor.com/2026/04/23/kpmg-introduces-month-end-close-ai-digital-assistant/182146/
Judge Orders UnitedHealth to Disclose Details of Algorithmic Coverage Denial Tool (news-coverage) — A federal discovery order forcing UnitedHealth to expose the full development and deployment record of nH Predict crystallizes the litigation risk shadowing every AI deployment in high-stakes financial workflows, and sets a precedent for discovery scope that other carriers now face. https://www.business-humanrights.org/en/latest-news/usa-judge-orders-unitedhealth-to-disclose-details-of-its-use-of-an-algorithmic-tool-in-class-action-over-alleged-ai-driven-coverage-denials/
CFPB's New Fair Lending Rule Is Out — No Surprises But Big Changes (industry-report) — The April 2026 elimination of disparate impact liability under ECOA removes the primary statistical discrimination enforcement mechanism for AI credit scoring — a seismic regulatory shift that widens the deployment window for algorithmic underwriting while stripping a key consumer protection in the same move. https://www.bakerdonelson.com/cfpbs-new-fair-lending-rule-is-out-no-surprises-but-big-changes
Upstart Holdings Q1 2026 Earnings Call Transcript (adoption-metric) — Upstart's production metrics — 425,000 loans originated, 173.6% accuracy advantage over FICO, 3.5% more approvals at equivalent risk — represent the most detailed public disclosure of an AI credit model's operational performance, making it a critical benchmark for the entire sector. https://www.insidermonkey.com/blog/upstart-holdings-inc-nasdaqupst-q1-2026-earnings-call-transcript-1754867/
Pomerantz Law Firm Files Class Action Against Upstart Holdings (news-coverage) — Filed five days after the earnings call above, the allegation that Upstart's Model 22 overstated accuracy and caused $70 million in missed revenue guidance crystallizes the model governance risk that makes even well-performing AI credit systems litigation targets the moment macro conditions shift. https://www.afslaw.com/perspectives/alerts/federal-court-orders-broad-discovery-against-uhc-ai-coverage-denial-lawsuit
AI Insights May 1 2026: Allstate's ALLIE Closing Policies Live in Three States (case-study) — Allstate's ALLIE represents the clearest evidence that agentic insurance automation has crossed from pilot to production transaction-level deployment — while the same carrier has withdrawn from California homeowners entirely, illustrating how AI precision can simultaneously enable and destroy market participation. https://insuranceindustry.ai/ai-insights-may-1-2026/
The IRS Now Uses AI to Flag Audits: What Practitioners Need to Know (opinion) — IRS expansion to 125+ AI/ML models under formal IRM 10.24.1 governance — up from 54 in 2024 — demonstrates that the federal government's primary enforcement body has moved AI from experiment to operational infrastructure faster than most private finance functions, despite significant staff reductions. https://www.nexairi.com/article/Accounting/irs-ai-audit-selection-practitioners-guide/