Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Tax preparation & optimisation support

LEADING EDGE

TRAJECTORY

Stalled

AI that assists in tax preparation, identifies optimisation opportunities, and ensures compliance with tax regulations. Includes multi-jurisdiction tax calculation and transfer pricing support; distinct from regulatory filing which submits forms rather than calculating tax positions.

OVERVIEW

AI-assisted tax preparation is a leading-edge practice defined by a sharp bifurcation: enterprise platforms with human oversight are delivering proven ROI, while general-purpose LLMs applied directly to tax compliance remain too unreliable for professional use. Forward-leaning vendors and firms have moved past pilots into production deployments that materially reduce preparation time, compress compliance cycles, and cut audit exposure. Most organisations have not followed. Strategic intent runs high -- roughly four in five tax professionals see AI as transformative -- but fewer than two in five have committed resources. The gap is not mere inertia. LLMs still fail systematically on tax law interpretation, producing hallucinated case citations and documented error rates above 40% on expense and claim categorisation. The IRS has issued no formal guidance on AI-prepared returns, leaving practitioners fully liable for AI-assisted errors. Consumer trust is actively retreating. The result is a practice where the vanguard is extracting real value through structured, human-supervised workflows, while accuracy, regulatory, and liability barriers keep broad adoption firmly out of reach.

CURRENT LANDSCAPE

Deployment momentum has accelerated sharply. Intuit now operates 3 million AI agents processing 237 million transactions monthly and achieving median $12K tax liability reductions per filer; TurboTax Live revenue reached $2B in FY2025 at 47% growth. Thomson Reuters CoCounsel Tax achieved general availability with 32% average task time savings across research workflows; organizational adoption of agentic AI reached 34% of tax firms (up from 21% a year prior). Consumer mainstream adoption has surged: Adobe survey shows 26% of US workers used AI for tax filing in 2026, up from 11% in 2024 (136% YoY growth), validated by OpenAI's report of 4x surge in ChatGPT tax-related queries year-over-year. Professional governance maturity is crossing critical threshold: seven professional accounting bodies (ICAEW, ICAS, ACCA, AAPA, STEP, CTA, Tax Faculty) jointly codified AI ethics standards via PCRT (Professional Conduct in Relation to Taxation) in April 2026, moving from guidance to formal conduct standards; US regulatory framework is now active across AICPA professional competence requirements, SEC financial reporting standards, PCAOB audit inspection, and state laws (IL, CO, CA, NYC).

However, accuracy and liability barriers remain unresolved. Peer-reviewed research (Boston University TaxCalcBench) confirms frontier LLMs compute fewer than one-third of federal returns correctly with 82% prediction variance on input changes; independent benchmarking (ORCA V2) documents 78.3% ChatGPT instability on tax calculations. A Dext survey of 500 accountants found 50% aware of businesses suffering direct financial losses from AI-generated tax advice, with 40% of practitioners losing 4-10 hours weekly fixing AI errors and 43% warning of business insolvency risks. Practitioner-documented failures show AI hallucinating Form 1023 rules and PLR summaries in real tax work with leading 2025–2026 models. The gap between adoption growth and deployment readiness persists: only 18% of organisations track AI ROI despite widespread deployment; 61% of tax departments prioritize time-savings over business outcomes. The IRS has yet to issue formal liability guidance on AI-prepared returns. Bifurcated landscape solidifies: enterprise platforms with human oversight delivering documented ROI and governable workflows, while consumer-facing and unvetted LLM use poses systemic accuracy and compliance risk that regulatory frameworks have not yet addressed.

TIER HISTORY

ResearchJan-2020 → Jan-2020
Bleeding EdgeJan-2020 → Jan-2021
Leading EdgeJan-2021 → present

EVIDENCE (133)

— 2026 adoption guide for CPAs/EAs: AI tax planning software reduces prep time by 70%; 47% of tax professionals now prioritize AI investment (vs 18% using no automation).

— Major product GA: Intuit launched TurboTax as native connector in Claude (April 23, 2026), embedding tax prep with real-time refund estimates into conversational AI workflows.

— Editorial briefing identifies April 29-30, 2026 convergence point: AICPA credentialing (AI core competency), Intuit/Claude GA, Black Ore Tax Autopilot launch—signaling profession-wide inflection.

— AICPA official guidance codifying professional accountability standards for CPAs using AI tools and protecting client data (SSTS 1.3/1.4)—formalizing governance into practice requirements.

— AICPA guidance on AI automation of tax reporting and withholding (core prep functions)—covers AI agents, data classification, reconciliation, and real-time compliance risk flagging.

— Academic critical assessment (Texas A&M) documenting structural risks: hallucinated legal authority, inability to comprehend anti-abuse doctrines, attorney-client privilege gaps—key barriers to reliance.

— Tax law expert analysis: AI must be treated as instrumentality, not agent; identifies governance framework needs and privilege waiver risks in tax practice.

— Production deployment: TaxWorld's Ezylia AI tax assistant (RAG-based) achieved 97.5% resolution rate, 98% accuracy, 2,000+ daily queries, 500+ hours saved per week across small/mid-sized UK/Ireland practices.

HISTORY

  • 2020: TurboTax reached 42.7M U.S. units with 11% YoY growth; TurboTax Live expert-assisted service grew ~70%. ONESOURCE deployed at enterprise scale (KPMG, Global Tax Management) with documented close-cycle improvements. AICPA published automation roadmap highlighting ROI and implementation barriers. Consumer Free File adoption remained low at 2-3% despite IRS backing.

  • 2021: TurboTax Live sustained momentum with 90%+ YoY customer growth; Tax Knowledge Engine scaled to 37M+ consumers. H&R Block achieved market share gains with Block Experience digital strategy. ONESOURCE expanded to 10,000+ companies globally; TR launched Determination Anywhere edge-computing platform. Intuit's exit from IRS Free File (amid dark-pattern controversy) exposed accessibility barriers despite mainstream AI adoption in consumer and enterprise segments.

  • 2022-H1: TurboTax Live accelerated with 30% revenue growth ($1B annual run rate) and 20% customer growth; TR's Checkpoint Edge gained adoption among smaller CPA practices. Tax leaders signaled inflection point—70% of enterprises projected regulatory system access within 3 years, driving automation adoption. Critical trust barriers surfaced: Intuit faced $141M settlement for deceptive Free File steering; academic case studies warned AI fails on social/emotional context for vulnerable taxpayers; software testing found reliability gaps in tax calculations. Blue J Legal deployed ML for tax court outcome prediction (56-74% confidence), signaling frontier adoption in litigation planning.

  • 2022-H2: Enterprise and government deployments validated AI tax automation ROI. Informatica deployed ONESOURCE for global indirect tax automation with IT time savings; Forrester study showed 120% three-year ROI with 75% invoice error reduction. CPA market consolidation: UltraTax CS (20%), Drake (17.1%), Lacerte (13.3%), ProSeries (12.8%) dominated. OECD data: 50%+ of tax administrations using/planning RPA, with government case studies showing 97% time savings. Consumer trust remained challenged: TurboTax Virtual Expert Platform scaled to 730M annual AI interactions but user reports documented calculation failures triggering IRS audits. Tension persisted between validated enterprise ROI and consumer-facing reliability gaps.

  • 2023-H1: GenAI adoption sentiment accelerated with enterprise investment intentions reaching peak—KPMG survey shows 40% of C-suite leaders planning $10M+ AI investment in tax departments and 99% citing AI as 'next frontier.' However, deployment gaps remained stark: Thomson Reuters found only 4% of tax professionals actively using GenAI despite 78% optimism. Critical accuracy barriers became visible—ChatGPT tested 100% incorrectly on real tax scenarios, and TurboTax user reports documented ongoing calculation errors triggering IRS audits. ICAEW highlighted GPT-4's nascent tax capabilities (demonstrated at launch) alongside professional displacement concerns. Sentiment-to-deployment gap widened, suggesting adoption barriers despite enterprise enthusiasm.

  • 2023-H2: Vendor ecosystem matured with product announcements: Thomson Reuters unveiled Checkpoint Edge AI Assistant and SurePrep TaxCaddy auto-categorization (scheduled for 2024 GA), while Intuit Assist launched with fine-tuned financial LLMs for tax and accounting. Enterprise strategic commitment remained high—KPMG December survey rated Gen AI as top emerging technology with 100% expecting very high organizational impact. However, professional adoption remained cautious: Thomson Reuters corporate tax department survey documented awareness and planning but persistent skepticism about GenAI readiness. CPA practitioners reported GenAI accurate on stable historical rules but unreliable for recent tax law changes, underscoring accuracy-to-trust gap. Narrative remained unchanged: high enterprise optimism, low practitioner deployment, and critical accuracy barriers limiting immediate adoption.

  • 2024-Q1: Enterprise GenAI adoption accelerated with 55% of large tax departments ($2B+) either using (29%) or exploring (26%) AI tools; AICPA issued revised Standards on Standards for Tax Services (effective Jan 2024) formalizing professional standards for AI use. Forrester's March 2024 study validated ONESOURCE indirect tax ROI at $3.8M over 3 years with invoice error reduction (3% to <1%) and 50% team efficiency gains. However, critical production failures emerged: TurboTax's Intuit Assist and H&R Block's AI Tax Assist documented >30% inaccuracy rates in consumer deployments with millions of users. California DTFA announced government-scale GenAI deployment for tax advisory (10K peak daily calls) with accuracy risk mitigation. Regulatory environment tightened (FTC continued enforcement on Intuit). Narrative remained bifurcated: enterprise platforms delivered documented ROI while consumer-facing GenAI systems failed accuracy tests, widening trust and compliance barriers.

  • 2024-Q2: Practitioner adoption data exceeded sentiment surveys: Intuit QuickBooks survey (June 2024) found 98% of 707 US accountants had used AI for client services in prior year, with 69% using it for data entry and 65% for client portfolio management; 57% planned additional AI investment. Enterprise deployments continued: Orica deployed ONESOURCE for global tax provisioning with improved controls and compliance across jurisdictions. Consumer-facing accuracy failures persisted: IRS Taxpayer Advocate Service formally cautioned against relying solely on AI tax advice (June 2024), citing Washington Post findings of 30-50% chatbot error rates. Market dynamics shifted: Intuit lost 1M free TurboTax users to IRS direct file competition, though TurboTax Live (AI-enhanced expert-assisted) grew to $1.4B revenue with 24M customers using AI for explanations. Thomson Reuters survey (May 2024) showed 77% of tax professionals believe AI applicable, with 60% of corporates saying it should be used. Narrative: professional-level adoption accelerated with practical ROI documented, while consumer-facing GenAI chatbots highlighted accuracy and regulatory risks, reinforcing the bifurcated two-tier landscape.

  • 2024-Q3: Product ecosystem matured with major vendor releases: Thomson Reuters launched Checkpoint Edge with CoCounsel AI-assisted research (July 2024, US GA with 70% beta user satisfaction). Intuit deployed Claude via Amazon Bedrock in TurboTax for the 2024 tax season, scaling AI-driven explanations to millions of consumers (July 2024), with Q3 earnings showing 24M customers actively using Intuit Assist. Enterprise strategic commitment reached peak: KPMG September 2024 survey of 500 C-suite executives found 88% view GenAI as vital for Pillar Two global minimum tax compliance and 98% plan to invest in tax AI within 12 months. However, critical deployment challenges surfaced: Tax Executives Institute roundtable (August 2024) with practitioners from P&G, BDO, and Thomson Reuters identified persistent adoption barriers—data quality issues, rapidly evolving tax regulations creating a 'moving target' for AI models, and overwhelming workload preventing upskilling. AWS Marketplace reviews (August 2024) of ONESOURCE showed mixed real-world deployment outcomes: users praised accuracy and ERP integration but cited implementation complexity and cost as barriers, reflecting the gap between vendor claims and field experience. Narrative remained bifurcated: consumer platforms expanding reach (TurboTax 24M users, Claude deployment) and professional tools advancing, yet enterprise tax departments navigating significant operational hurdles despite peak strategic intent to invest.

  • 2024-Q4: Enterprise adoption matured with EY survey showing 87% of CTOs believe GenAI drives efficiency (up from 15% in 2023), though 75% remain in exploratory phases; ONESOURCE completed GA integration with SAP and IDC positioned Thomson Reuters as market leader. Consumer-facing systems deteriorated: H&R Block faced $7M FTC settlement for deceptive practices (November 2024); TurboTax Live deployed Claude via Bedrock to 24M users. International adoption validated: Polish peer-reviewed research shows 80%+ of EU tax advisors accept GenAI and would pay for specialized systems (€2.75M market estimate). Regulatory environment tightened: AICPA Standards enforced professional judgment requirements and IRS continued cautioning against sole reliance on AI tax advice. Core tension persisted: enterprise platforms with human oversight and ERP integration delivered documented efficiency gains, while general-purpose LLM chatbots posed systematic accuracy and liability risks.

  • 2025-Q1: GenAI adoption sentiment accelerated sharply with tax sector nearly tripling enterprise usage (8% to 21%) and 71% of professionals believing GenAI should be integrated into daily work; Intuit reported $90M annualized AI efficiencies and 90% data automation in TurboTax; Forrester study validated 148% ROI for enterprise Direct Tax deployment. However, critical accuracy and regulatory concerns emerged—Veriprajna documented systematic 100% failure rates in major LLMs for tax compliance tests, and CPA assessments found frequent calculation errors. Regulatory scrutiny intensified as IRS began patterns-based enforcement on AI-generated returns, widening gap between optimism and production-ready confidence.

  • 2025-Q2: Professional deployment accelerated with TurboTax Live revenue up 47% and AI agents reducing customer time by 12%; H&R Block and OpenAI announced partnership for 60,000 tax professionals. Thomson Reuters expanded ONESOURCE to Brazil and reported firms with AI strategies 2x more likely to see revenue growth. However, implementation barriers sharpened: 92% believed AI applicable but only 20% deployed enterprise-wide; 59% unsure vendors delivered promised AI; only 17% had formal AI policies. Tax law firms warned of AI limitation—inability to track 35 new IRS Practice Units in five months—with $10,000+ penalty risks. Implementation tensions deepened with ROI measurement gaps (only 20% measured) and skills development challenges limiting broader adoption despite peak strategic intent.

  • 2025-Q3: Deployment maturity advanced with consumer-facing platforms scaling and optimization tools expanding; Intuit reported FY2025 TurboTax Live revenue of $2.0B (47% growth) with AI agents reducing customer completion time by 12% and over 50% of returns filed in under one hour. H&R Block's AI-enhanced "Second Look" tax review service saw tenfold increase in new client adoption, identifying discrepancies in 25% of returns, signaling scaling of AI-driven tax optimization beyond preparation. Thomson Reuters documented real-world ROI with case examples: 60% error reduction in AI-automated document categorization and 25 minutes saved per return through intelligent processing. However, accuracy and reliability concerns deepened; AI chatbot failures documented (£13K loss from hallucinated case law, tax firm testing showing >50% inaccuracy rates), and tax attorneys warned of continued limitations—inability to handle FBAR nuances and complex international tax rules. The bifurcated landscape persisted: enterprise platforms delivering documented efficiency gains while consumer-facing AI systems remained unreliable, widening trust and regulatory risk gaps as IRS enforcement on AI-generated returns continued escalating.

  • 2025-Q4: Strategic enterprise adoption intent peaked with EY's global survey of 1,200 tax leaders (53 jurisdictions) showing 90% turning to GenAI for risk management and tax compliance strengthening. Intuit completed full deployment of AI agents to 2.8M customers (October 2025) with documented efficiency: 12 hours monthly savings and 5-day payment acceleration. Major vendor consolidation: H&R Block and OpenAI launched joint GenAI solution targeting 60,000+ tax professionals with deployment for 2026 tax season. However, technical maturity concerns deepened critically—Veriprajna's whitepaper documented fundamental LLM failure mode (Consensus Error) showing major systems fail consistently on tax law interpretation, creating unacceptable compliance risk. Tax practitioners reported persistent accuracy failures: hallucinated case law, 50%+ inaccuracy on complex questions, blind spots on FBAR and international rules, and unresolved liability gaps. Trust barriers hardened: 47% of tax firms desired AI but feared implementation (Thomson Reuters), with data security and regulatory liability cited as top barriers. The bifurcated landscape solidified: enterprise platforms with human oversight continued advancing with documented ROI, while technical limitations of general-purpose AI and unresolved liability made reliance on LLM chatbots legally untenable for professional tax compliance work.

  • 2026-Jan: Vendor momentum accelerated with Thomson Reuters launching ONESOURCE Sales and Use Tax AI (January 2026) with documented ROI—65% time reduction, 75% audit risk cut, $60K annual enterprise savings, and 30-day to 11-day compliance cycle compression. Intuit sustained deployment at 2.8M customers with margin expansion (FY2025 $18.8B revenue, 340bp margin expansion). Adoption intent remained high but uneven: Thomson Reuters survey showed 79% of tax pros believed in AI impact yet only 37% invested, with early adopters projected to unlock $52K per professional annually. However, deployment accuracy risks surfaced critically: Dext survey (500 Canadian accountants) documented 50% aware of financial losses from AI tax advice, with specific error rates (44% expense, 43% tax claims, 35% payroll); CPA analysis documented IRS regulatory vacuum, taxpayer liability, and data privacy risks. Bifurcated landscape persisted: productized enterprise platforms delivered ROI, professional adoption sentiment was high, yet regulatory uncertainty and accuracy risks remained deployment barriers.

  • 2026-Feb: Vendor ecosystem matured with Thomson Reuters launching CoCounsel Tax GA and Deloitte research confirming AI moved from speculation to operational necessity in tax departments. Forrester validated ONESOURCE Direct Tax ROI at 148% ($2.8M over three years with 50% time reduction), while Wolters Kluwer survey showed 40% of North American firms already using AI-driven tax research with 80% planning increased investment. However, consumer trust retreated sharply: only 37% of taxpayers would consider AI over a professional (down from 43%), signaling adoption barriers. Practitioner warnings escalated: Kennas documented hallucinated case citations in tribunal proceedings, 64% of businesses required professional cleanup of AI-generated advice, and systematic accuracy failures persisted despite enterprise vendor momentum. Bifurcated landscape hardened with professional-tier platforms demonstrating ROI while consumer confidence and accuracy reliability remained critical barriers.

  • 2026-Mar: Professional adoption reached mainstream thresholds — Thomson Reuters data shows 86% of tax professionals integrating AI weekly and a Pennsylvania CPA survey found 93% of large firms engaged, with 30-70% time savings documented; Intuit scaled to 3M+ AI agent users categorising 237M monthly transactions, while enterprise case studies (Amazon 99.6% ML accuracy, Zoom automated invoice processing) confirmed production-grade deployment. Countervailing signals emerged simultaneously: TurboTax growth decelerated to 8% as emerging competitors captured 12% market share, a US Tax Court case (Clinco v. Commissioner) recorded audit loss from hallucinated citations, and the IRS 2026 Dirty Dozen guidance explicitly warned against sole reliance on AI for complex tax questions.

  • 2026-Apr: Vendor ecosystem matured with Intuit expanding ecosystem integrations — TurboTax GA on Claude.ai and ChatGPT platforms reaches 40M users with 90% automation of form data entry and median $12K tax liability reductions; Thomson Reuters CoCounsel Tax reached GA with 32% average task time savings and agentic AI adoption among tax firms rising to 34% (from 21% a year prior). Professional adoption acceleration validated: Thomson Reuters 2026 AI in Professional Services Report shows org-wide adoption surge to 40% (from 22% in 2025), agentic AI at 15% with 77% expected by 2030; AICPA 2025 Report documents some firms achieving >80% automation of individual return preparation with human-in-loop verification via confidence-threshold flagging. Professional governance crossed a critical threshold: seven professional accounting bodies (ICAEW, ICAS, ACCA, AAPA, STEP, CTA, Tax Faculty) jointly codified AI ethics standards via PCRT (Professional Conduct in Relation to Taxation), moving from guidance to formal conduct standards; US regulatory framework simultaneously went active across AICPA professional competence requirements, SEC financial reporting standards, PCAOB audit inspection, and state laws (IL, CO, CA, NYC). However, practitioner adoption challenges intensified: Dext survey shows 50% of accountants aware of client losses from AI advice, 40% spending 4-10 hrs/week fixing errors, 43% forecasting business insolvencies; peer-reviewed accuracy research (Boston U TaxCalcBench) confirms frontier LLMs compute <1/3 of returns correctly with 82% prediction variance on input changes; independent ORCA V2 benchmark documents 78.3% ChatGPT instability on tax calculations. Agentic AI milestone: Basis AI achieved end-to-end partnership return automation without human keyboard input; firms deploying on 1040s. Critical strategic gap: 61% of tax departments prioritize efficiency over business outcomes (time savings vs. revenue growth), with only 18% tracking AI ROI—signaling maturity gap between adoption velocity and business value realization. Bifurcated landscape solidified: enterprise platforms with human oversight delivering documented ROI alongside unresolved systemic accuracy barriers limiting broader professional adoption.

  • 2026-May: A convergence of adoption and governance signals defined the month. Intuit reported 3M customers actively using its AI platform with revenue reaching $4.7B; 47% of tax professionals now prioritise AI investment (versus 18% using no automation), with AI reducing preparation time by 70% per industry benchmarks. The AICPA codified professional accountability standards for AI tool use under SSTS 1.3/1.4, formalising governance requirements into practice obligations. Against this, academic analysis (Texas A&M) documented structural LLM failures on tax law — hallucinated legal authority, inability to comprehend anti-abuse doctrines, attorney-client privilege gaps — reinforcing that human oversight remains mandatory for complex compliance work despite rapid production-scale deployment.