The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
AI for managing contracts, regulation, governance, and organisational risk. Contract review and e-discovery are good practice with proven ROI; regulatory monitoring and due diligence are advancing steadily. Most of the domain sits at leading-edge — adoption is constrained by liability concerns and the need for domain-expert validation rather than by tooling gaps.
Legal AI has reached a paradoxical maturity. The adoption metrics are unambiguous: 92% of lawyers across the US, China, and nine EU countries now use AI tools daily, 87% of general counsel report active deployment, and 78% of Am Law 200 firms have integrated AI into workflows. Contract review, clause extraction, e-discovery, and template-based drafting cross into proven practice with validated returns -- 98% accuracy on production contract portfolios, 89-90% time reduction on standard drafting, and triple-digit ROI documented by Forrester and independent benchmarks. Thomson Reuters CoCounsel occupies a million seats across 107 countries. Relativity's aiR suite has logged over 240 million defensible document predictions. Harvey, the bespoke-drafting leader, reached $300M in annual recurring revenue at an $11B valuation, serving 142,000-plus lawyers. The technology argument, for most of this domain, is settled.
Yet the domain's defining characteristic in mid-2026 is not capability but stagnation at scale. The clear majority of the practices tracked here are stalled in their advancement trajectory -- including several that reached mainstream adoption years ago. The pattern is consistent: organisations buy tools but fail to embed them. Only around a quarter have fully implemented AI governance; 95% of enterprise generative-AI pilots deliver no measurable ROI; and the abandonment ratio from proof-of-concept to production runs as high as 33-to-4, with an average sunk cost of $7.2M per failed initiative. Legal-specific tool usage has, in places, declined as practitioners default to general-purpose alternatives. This is an implementation crisis, not a technology deficit, and the structural causes are now well documented.
The explanation is threefold. First, governance remains catastrophically underdeveloped: a 735-to-1 ratio of AI capability spend to governance and security spend reflects systematic under-investment in the infrastructure production deployment requires, and the consequences arrive in production -- a large share of autonomous-agent deployments have been rolled back or forced offline after governance failures surfaced only once they were live. Second, hallucination has hardened from a quality concern into a regulatory and liability event: courts now sanction attorneys directly, with the Ninth Circuit suspending lawyers and the Florida Supreme Court mandating written certification that cited cases exist. Third, the billable-hour model fragments the domain -- law firms capture efficiency internally while in-house teams, freed from that constraint, pull ahead, and 60% of in-house leaders report seeing no cost reduction passed to clients from their firms' AI gains. Financial-services compliance, by contrast, operates a generation ahead: SymphonyAI cuts manual AML investigation effort by 90%, Silent Eight processes over 100 million investigations across 150-plus markets, and tier-1 banks run agentic monitoring at scale. Those deployments demonstrate what is achievable -- and set regulatory expectations that less-resourced organisations now cannot ignore.
The headline movement this cycle is a reversal, not a leap. Three of the domain's most-watched practices -- autonomous compliance monitoring, bespoke contract generation, and contract clause extraction and risk assessment -- lost their "advancing" momentum and reverted to stalled. This is significant precisely because these were the practices gaining ground. Bespoke drafting illustrates the dynamic cleanly: Harvey scaled to $300M ARR and 142,000-plus lawyers, productivity gains held at an 89% reduction in time-to-first-draft, and corporate adoption jumped from 44% to 87% year on year -- yet the trend stalled because liability and ROI gaps hardened faster than capability advanced. Only 23% of adopters report high comfort with AI drafting, 82% do not measure ROI, and regulators across the FTC, California, and the EU have settled that deployer liability rests with the organisation, not the vendor. The same pattern recurred in autonomous compliance monitoring, where 86-89% of agentic pilots stalled or were shelved and the Federal Reserve's SR 26-2 withdrew formal guidance for generative and agentic systems pending new standards.
Underneath that reversal, the cycle's new research converges on a single, sharper claim: the governance-adoption mismatch is structural, not transitional. Fresh work from MIT, the AAA, and Gartner quantifies it -- governance failures, not model quality, account for 77% of project failures; 40% of enterprises will decommission autonomous agents by 2027; and an AAA survey of 500 senior legal and compliance leaders found that enforced governance frameworks rise from 1% among limited AI users to 60% among extensive users, confirming that deeper adoption without governance infrastructure predicts failure. Regulatory enforcement, meanwhile, moved from imminent to active. The SEC brought $42M in AI-washing charges, NYC's Local Law 144 bias-audit enforcement issued over $2M in violations on its first day, Delaware's Caremark fiduciary-oversight doctrine was applied to AI systems, and the EU AI Act's 2 August deadline now sits weeks away. Four practices held their advancing momentum -- audit anomaly detection, both flavours of regulatory-change monitoring, and risk-register horizon scanning -- a reminder that where the work is read-only and human-in-the-loop, AI continues to gain ground even as the autonomous frontier retreats.
The advancing frontier is retreating, not expanding. This cycle's three trend reversals share a cause: the practices that touch autonomous decision-making and direct client liability -- bespoke drafting, autonomous monitoring, clause-risk scoring -- hit governance and liability ceilings precisely as their tooling matured. Bespoke drafting confines itself to lower-risk, high-volume work with mandatory attorney review despite $11B-valued vendors; 86-89% of agentic compliance pilots stalled or were shelved. The practices still advancing -- audit anomaly detection, regulatory-change monitoring, horizon scanning -- are the ones where AI assists and a human decides. The dividing line is no longer capability; it is whether the organisation can own the output.
Governance is a structural deficit, not a transitional gap. The investment imbalance is now quantified at a 735-to-1 ratio of AI capability spend to governance spend, and the consequences land in production: a large share of autonomous-agent deployments have rolled back after failures surfaced only once live, against a backdrop where 90% of teams claim deployment readiness but 75% have experienced rollbacks. The AAA's finding that enforced frameworks scale from 1% to 60% with adoption depth confirms the mismatch is not random. Only around a quarter of organisations have fully implemented governance programs, and IBM's survey of 2,000 C-level executives finds just 26% can enforce the AI security strategy they have written -- a 51-point enforcement gap documented by multiple independent sources.
Hallucination has become a regulatory and liability event, not a quality concern. Independent trackers document well over 1,000 cases globally of AI-fabricated citations, accelerating at 30-50 new incidents a month, with sanctions ranging from $1K to $86K. The Ninth Circuit suspended two attorneys and imposed fines for nonexistent cases; the Florida Supreme Court now requires written certification that cited cases exist. Domain-trained tools remain the problem, not the cure: Princeton's benchmark put a frontier general model at 6.57% but Lexis+ AI at 17% and Westlaw at 34%. Sullivan & Cromwell -- an elite firm that advises OpenAI -- filed a brief with roughly 40 fabricated citations despite comprehensive controls, demolishing the assumption that governance frameworks alone solve the problem. The liability framework is shifting from user error to tool architecture.
Financial-services compliance sets expectations the rest of the domain cannot meet. SymphonyAI's agentic AML deployment cuts manual investigation effort by 90% with 99% false-positive reduction; WorkFusion's Tara agent processes over a million alerts daily across leading institutions; the Federal Reserve extended SR 11-7 model-risk governance to agentic systems and the FCA's review of 150-plus firms now demands evidence-based system tuning. These production-scale systems demonstrate the achievable ceiling -- but they also harden regulatory expectations for entity resolution, explainability, and auditable reasoning that organisations without comparable capital and data infrastructure cannot satisfy. Enforcement underlines the stakes: TD Bank's $1.3B penalty, OFSI's penalties on Deutsche Bank and Bank of Scotland, and Danish prosecutors seeking a record €880M from Nordea all trace to monitoring failures.
Regulatory fragmentation has hardened into incompatible regimes. The US chose deregulation (Executive Order 14110 revoked); the EU moves to binding enforcement with penalties up to €35M or 7% of global turnover on 2 August 2026; China requires state algorithmic approval. Unified multinational compliance programs are, on the Cloud Security Alliance's assessment, no longer feasible. The burden compounds because organisations must comply with regulations governing their AI tools while using those same tools to monitor a regulatory landscape generating 234 daily alerts across 1,374 regulators in 190 countries. The EU AI Act classifies litigation outcome prediction and most autonomous compliance monitoring as high-risk, requiring conformity assessment -- and surveys find the large majority of enterprises unable even to generate a basic inventory of their AI systems.
Ninth Circuit on AI Hallucinations: 17%-33% error rates in Westlaw/Lexis legal AI tools (case-study) — The court's own citation of peer-reviewed error rates for domain-trained legal AI tools destroys the industry assumption that legal-specific products are safer than general models, and the attorney suspensions turn what was a quality debate into a professional liability event. https://reason.com/volokh/2026/06/03/ninth-circuit-on-ai-hallucinations/
LePhantomCite: LLM Citation Hallucinations Benchmark (research-paper) — Princeton's finding that Lexis+ AI hallucinates at 17% and Westlaw at 34% against a frontier general model at 6.57% is the empirical anchor for the summary's claim that domain-trained tools are not the cure; it shifts the liability argument from user practice toward tool architecture. https://blog.citp.princeton.edu/2026/05/27/can-ai-reduce-burdens-on-courts-by-automatically-verifying-citations/
Florida Supreme Court Rules on AI Hallucinations (Effective June 15) (news-coverage) — Moving from case-by-case sanction to a standing court rule requiring written certification that cited cases exist represents the transition from anecdote to systemic regulatory response, and signals what other jurisdictions will follow. https://www.insurancejournal.com/news/southeast/2026/06/02/872039.htm
AI Agent Workforce Governance: The Enterprise Management Framework for 2026 (opinion) — The 86-89% pilot-stall rate for agentic AI is the single most important number in the summary's "advancing frontier retreating" argument; it quantifies precisely where autonomous compliance monitoring lost momentum this cycle. https://trusenta.com.au/blog/ai-agent-workforce-governance-enterprise
IBM Study: 67% of CIOs Own AI They Can't Control (adoption-metric) — A 2,000-executive survey across 33 geographies documenting that 77% report adoption outpacing governance and only 26% can enforce their AI security strategy provides the cross-industry evidence base for the summary's 735-to-1 governance-spend ratio claim. https://www.beri.net/article/2026-06-08-ibm-ai-control-gap-67-percent-cios-accountable
Harvey Revenue, Valuation & Funding — Sacra (adoption-metric) — $300M ARR at 400% YoY growth reaching 142,000-plus lawyers in 60-plus countries is the clearest proof that capability and adoption maturity in bespoke drafting are settled — which makes the simultaneous stall in trend advancement a structural, not commercial, problem. https://sacra.com/c/harvey/
AML Roundup — Record €880M Fine Against Nordea (adoption-metric) — The largest AML penalty sought in European history arrives as the summary describes finserv as a generation ahead of the rest of the domain; it demonstrates that the enforcement stakes driving that advanced investment are real and escalating, not hypothetical. https://kyc360.com/knowledge-hub/resources/aml-roundup-5th-june-2026-prosecutors-seek-record-880m-aml-fine-against-nordea
Celent Solution Brief — Agentic AI in Action, SymphonyAI (industry-report) — Independent analyst validation of up to 80% false-positive reduction through multi-agent AML workflows is the specific finserv benchmark the summary cites to show what is achievable — and to set the regulatory expectations less-resourced organisations now must meet. https://www.symphonyai.com/resources/analyst-report/financial-services/celent-agentic-ai-sensa-agent-flow/
Governance, Risk, and Compliance Platforms, Forrester Wave Q2 2026 (industry-report) — The Wave's finding that AI delivers "minimal value" in GRC platforms and continuous controls monitoring is in "embryonic stage" directly undercuts vendor claims and confirms the summary's point that governance infrastructure lags capability by a structural margin, not a transitional one. https://www.forrester.com/blogs/announcing-the-forrester-wave-governance-risk-and-compliance-platforms-q2-2026/
Why AI in Compliance Is Currently Hard to Deploy in Every Major Jurisdiction (opinion) — Eight parallel EU regulatory instruments with no horizontal coordination, mapped against US deregulation and UAE/Singapore divergence, is the practitioner-level evidence behind the summary's "incompatible regimes" claim and explains why unified multinational programs are no longer feasible. https://sheinebrief.substack.com/p/why-ai-in-compliance-is-currently