The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI that autonomously scores contract risk, generates assessment reports, and recommends accept/reject/negotiate decisions. Includes automated risk scoring and recommendation generation; distinct from risk flagging which highlights issues for human assessment rather than making recommendations.
Autonomous contract assessment -- AI that scores risk, generates reports, and recommends accept/reject/negotiate decisions -- has moved from bleeding-edge experiment to leading-edge production standard in high-volume triage, yet the practice faces a hard tier ceiling beyond routine work. Global adoption has normalized rapidly: 92% of lawyers across 10 countries now use AI daily; 87% of general counsel employ AI; 52% of in-house teams actively use or evaluate contract review AI (quadrupled since 2024). Deployments deliver measurable ROI for high-volume screening -- 40-60% efficiency gains, 75%+ time savings, 300-450% reported ROI. Yet regulatory frameworks and deployment realities impose binding constraints. EU AI Act (Annex III) classifies autonomous contract assessment as high-risk, mandating human oversight mechanisms and conformity assessment by December 2027; Article 14 design requirements mean oversight must be architectural (human-in-the-loop, not rubber-stamped approval), making fully autonomous assessment regulatory non-compliant in major markets. Production governance maturity lags significantly: Icertis' May 2026 survey of 1,000+ corporate legal practitioners found 47% would not detect unauthorized AI action until days or weeks, only 26% confident in AI accuracy for high-stakes decisions, and 40% accountability fragmented. Hallucination incidents have spiked: 1,200+ documented cases globally, with $145K in court sanctions in Q1 2026 alone. Autonomous scoring tools exhibit documented algorithmic bias. Contractual data-access barriers and governance gaps prevent 78% of agentic AI pilots from reaching production. The tier-defining tension is structural: the practice excels at high-volume first-pass screening where human review is downstream, but regulatory requirements and accuracy-on-complexity ceilings prevent autonomous decision-making on contested or complex agreements without solving governance, fairness, and liability exposure.
Production deployments at scale demonstrate the economic momentum, though maturity diverges sharply between routine screening and autonomous decision-making. Icertis Vera (June 2026 GA) introduces portfolio-wide autonomous risk assessment against business events across 1/3 Fortune 100 customer base; Microsoft Cloud Operations achieves 2-hour-to-15-minute contract-to-PO cycles through autonomous Icertis-SAP Ariba integration. LegalMind AI's production deployment automates 70% of workload across 3,400 contracts monthly through an eight-step autonomous pipeline (normalization, extraction, template comparison, risk scoring, compliance checking, summary, queue prioritization), compressing 4.2-hour reviews to 38 minutes and reducing infrastructure costs 76%. Skopx case studies document mid-market deployments achieving 80% time reduction (2.1 hours to 25 minutes) and financial services achieving 97.3% regulatory compliance accuracy with 0.89 correlation to attorney assessments. Concord's engine processes 10k+ contracts monthly with 94% autonomous risk-spotting accuracy, compressing review from 92 minutes to 26 seconds per contract. Inkvex's independent validation on 327 real contracts confirmed 94% catch rate of high-severity flags with 99% catch on auto-renewal clauses and 95% on liability caps. Vendor ecosystem consolidation is deepening: Icertis serves 250+ Fortune 500 customers with $350M ARR and 30%+ Fortune 100 penetration; LinkSquares reports 1,300+ teams managing 13M contracts with 800k+ hours saved. Global adoption has normalized: Wolters Kluwer's 810-lawyer survey across 10 countries shows 92% use AI daily, 62% report 6-20% time savings, and 61% are confident in AI-driven workflows.
However, autonomy maturity claims diverge sharply from deployment reality. Stanford Law School research (June 2026) demonstrates AI outperforms law professors 75% on contract law reasoning, but this bench-mark clarity masks persistent autonomy barriers. A production AI adjudication platform processing 23,000 cases reports that 100% human review remains required and user-preferred—even at scale and with mature systems. Axiom's survey of 500+ legal leaders across 8 countries shows only 31% at wide-scale autonomous deployment despite 96% adoption in some form; two-thirds remain piloting with 43% citing accuracy/reliability as top barrier. Conga's survey of 250 CLM professionals found 92% still require human review of AI outputs with governance and trust as biggest scaling barriers. Stanford AI Index 2026 benchmarks hallucination rates of 22-94% across 26 leading models, with best-performing models delivering incorrect answers in roughly 20% of responses—a reliability ceiling that directly constrains autonomous legal assessment viability for high-stakes decisions. The autonomy-maturity divergence is not a technology problem but a governance reality: production systems marketed as autonomous are operationally dependent on human oversight gates, escalation rules, and review thresholds that bind them to hybrid human-AI architectures rather than genuine autonomy.
The advancement barriers, however, are hardening rather than softening. Brittney Ball's April 2026 research documents 1,200+ AI hallucination incidents in legal proceedings globally (roughly 10 per day), with $145K in court sanctions in Q1 2026 alone and indefinite attorney suspension for filing 57 defective AI-generated citations. Thomson Reuters analysis identifies the strategic risk: 80% of legal professionals see AI as transformational, yet only 38% expect near-term organizational change, and Gartner projects over 40% of agentic AI projects will be discontinued by 2027. Real-world deployment data shows 17-34% error rates in production despite 95%+ accuracy benchmarks; governance and infrastructure gaps prevent 78% of agentic pilots from reaching production. Regulatory constraints compound the ceiling. EU AI Act Article 14 establishes that human oversight is a design requirement, not a staffing workaround; organizations achieving 98%+ approval rates without meaningful human judgment are regulatory non-compliant. Academic research demonstrates the underlying tension: higher autonomy compresses viable agency in regulated contexts—organizations cannot simultaneously maximize autonomous decision-making and satisfy regulatory human oversight mandates. Independent benchmarking establishes realistic performance ceilings: 50-75% of clause changes autonomous in steady state (vs. vendor claims of higher coverage), with 70-80% playbook coverage meaning 20-30% of changes always require human judgment. The bias vulnerability identified in January 2026 law review research persists: autonomous scoring tools systematically favor corporations over individuals in negotiation, creating direct liability exposure. Contractual data-access restrictions force reliance on generic models rather than fine-tuned deployment. Autonomous decision-making on complex or disputed agreements remains out of scope for all but the most risk-tolerant teams.
— Axiom survey of 500+ legal leaders across 8 countries: only 31% at wide-scale deployment, 66% piloting, 43% cite accuracy/reliability as top barrier. Contradicts leading-edge maturity claims.
— Stanford Law School blind evaluation: AI outperformed 16 law professors 75% on contract law reasoning (~3,000 comparisons). Only 3.53% AI answers flagged harmful vs 12.06% professor answers—demonstrating autonomous assessment capability threshold.
— Icertis Vera GA (June 2026): portfolio-wide autonomous risk assessment against business events; Vera Analytics drives strategic decisions and compliance monitoring across 1/3 Fortune 100 customer base.
— PocketOS incident: AI agent deleted production database and all backups autonomously. Real failure mode exemplifying accountability and governance gaps in autonomous systems.
— Microsoft Cloud Operations deployed Icertis integrated with SAP Ariba for autonomous assessment: reduced contract-to-PO time from 2 hours to 15 minutes with automated summaries and AI-driven approval routing.
— Two named deployments: mid-market SaaS 80% time reduction (2.1hr→25min), financial services 62% cost reduction with 97.3% regulatory compliance accuracy and 0.89 correlation with attorney assessments.
— LegalMind AI deployment: 70% workload automation, 4.2h→38min per contract, 76% infrastructure cost reduction, 3,400 contracts/month. Eight-step autonomous pipeline (normalization, extraction, template comparison, risk scoring, compliance, summary, prioritization) demonstrates end-to-end autonomous assessment at scale.
— Stanford benchmark: hallucination rates 22-94% across 26 leading models; best-performing model delivers incorrect answers in ~20% of responses. Foundational reliability ceiling constraining autonomous legal assessment viability.