The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
AI for cross-functional workflow automation, document processing, and business process optimisation. Evenly split between good-practice and leading-edge: RPA and document extraction are mature; intelligent process mining and autonomous workflow orchestration are still proving out. One practice remains at research stage. Momentum is low — most practices are stalled, with gains coming from incremental automation rather than architectural shifts.
Operations is the domain where the gap between what AI can do and what organisations actually do with it is widest — and where, this fortnight, that gap finally got a number attached to it from the vendors' own books. The technical question is settled. AI-augmented robotic process automation, intelligent document processing, predictive maintenance, quality vision, scheduling solvers and process mining all work, are generally available from consolidated vendor stacks, and deliver documented returns at named enterprises: Forrester's study of 287 enterprise agent deployments puts average ROI at 540% within 18 months; pharma packaging lines hit 99.8% defect detection at 315% ROI; accounts payable runs at 60% touchless with cost-per-invoice falling from roughly $13 to under $3. McKinsey now counts 45% of the Fortune 500 running production AI agents, up from 8% in 2024 — the fastest enterprise software adoption curve in 25 years. The capability is real and the economics are proven in the right place.
The binding constraint, uniformly across all thirteen practices we track here, has moved off the technology and onto the organisation. The single most cited diagnosis this cycle is brutal: 79% of enterprises claim AI agent adoption, but only 11% have agents running in production at scale, and 88% of pilots never reach production at all. Gartner's inaugural agentic AI Hype Cycle forecasts that 40% of agentic projects will be cancelled by 2027. The reasons are monotonously consistent and almost never about model quality: data silos and data quality (cited as root cause in 70–85% of failures), absent governance (only around one in five organisations has a mature model for autonomous agents), orchestration immaturity, and a refusal to redesign jobs and processes around the new capability. Confluent's survey of 4,625 IT leaders is the sharpest version of the pattern: of the 32% running agents in production, 77% report stalled projects and 61% report abandonment — and 74% blame data infrastructure, not the AI.
The result is a domain that is bimodal, not stalled. Value concentrates ferociously in bounded, well-governed domains with stable processes — invoice processing, claims triage, HR intake, IT incident routing, machine-vision defect detection — where the top quartile of deployments clears 800% ROI. Everything that requires crossing system boundaries, normalising fragmented data, or routing genuinely unstructured work remains pilot-stage for the broad market. The distinguishing feature of this domain is that the leaders are now pulling away on execution discipline rather than tooling: they have the same platforms everyone else has, but they built governance, clean data pipelines and process redesign first. Operations is where "buy the platform" stops working and "fix the substrate" begins.
The headline of this scan is vendor financial proof colliding with structural failure data in the same fortnight. UiPath posted its first GAAP operating profitability — $28M on $418M revenue (up 17% year-on-year), ARR at $1.9B — with its CEO explicitly framing agentic products as moving "from pilot to production" at customer sites. Automation Anywhere disclosed 6 million agent executions in production, against 1,500 a year earlier: a roughly 4,000-fold scale-up that marks the pilot era closing. Against that, three independent datasets landed describing the same wall — Gartner's first agentic AI Hype Cycle (40% cancellation forecast by 2027), the "agentwashing" diagnosis (79% claim, 11% deliver), and Confluent's finding that 77% of production agents stall on data infrastructure. The vendors are profitable; their customers are mostly stuck. That is the story.
Underneath, the practices held position with one exception. Multi-system data synchronisation slipped from advancing to stalled: despite Microsoft's GA of low-latency Dataverse-to-Fabric sync (1M+ records/hour) and SAP's strategic Reltio acquisition, the integration substrate is not improving fast enough — only 27% of an average enterprise's 957 applications are integrated, and 50% of deployed agents still operate in isolation. Multimodal document understanding remains the sole advancing practice, but even there June research (MM-Snowball at ICML, FinDocMRE, attribution hallucination in Gemini-3.1-Pro) is shifting the binding constraint from raw model capability to architectural composition and interactive reliability. Elsewhere the additions reinforce rather than move: Honeywell's connected-sites footprint grew 32-fold since 2020 to 324,000 sites with a named Nvidia deployment (100,000 assets, 90% nuisance-alarm reduction); Google Document AI and AWS Bedrock Data Automation shipped GA document services; a Fortune 500 IDP deployment was documented wrongly approving $4.2M in invoices on miscalibrated confidence thresholds. Stability here is itself the signal: the constraints diagnosed a quarter ago are the same constraints, now better evidenced.
Vendor profitability now coexists with customer failure — and the two are not in contradiction. UiPath's first GAAP profit and Automation Anywhere's 4,000-fold execution growth are real proof that platforms have matured. But the same fortnight produced Gartner's 40% cancellation forecast and Confluent's finding that 77% of production agents stall. Vendors monetise the 11% who execute well plus the 79% still buying licences to pilot; the buyer's problem — turning a licence into production value — is unsolved at scale.
The constraint is the substrate, not the model. Across every practice, failure traces to data quality, silos and integration: 70–85% of failures in RPA, 74% of stalls in Confluent's data, 85% of IDP failures, 81% of process-mining respondents citing governance, only 27% of enterprise applications integrated. The downgrade of multi-system data synchronisation to stalled is the canary — the layer everything else depends on is not improving fast enough, which caps how far any agentic deployment above it can scale.
Silent failure is the production risk nobody has priced. A documented Fortune 500 IDP deployment wrongly approved $4.2M in invoices because confidence thresholds drifted without surfacing evidence; auditors put net-financial-exposure failure rates above 60%. Multimodal models exhibit "attribution hallucination" — correct answers with fabricated sourcing. In document, quality and exception-handling practices alike, the failure mode that matters is no longer the obvious error but the confident, undetected one, which is precisely why governance-first, human-in-the-loop architectures are becoming mandatory rather than optional.
Orchestration is the missing abstraction layer. Traditional orchestration (Airflow, Camunda, Step Functions) delivers 400%+ ROI reliably; agentic orchestration stalls — IDC finds 50% of organisations have 10+ agents but only 7% run them in full production, and ServiceNow's index shows 59% past pilots but only 9% with meaningful autonomous workflows. The market is converging on "durable by default" deterministic engines (Temporal, Mistral Workflows, Cloudflare Workflows V2) wrapping bounded AI calls, after multi-agent LLM systems showed 41–87% failure rates. Coordination, not intelligence, is the bottleneck.
Regulation is hardening the human-in-the-loop into a legal requirement. FDA 21 CFR Part 11 now mandates human review and explicit model governance for AI quality decisions; the EU AI Act's August 2026 provisions touch worker-management scheduling and document audit trails; 75% of FDA AI-related inspection findings cite inadequate SOPs. The practices furthest from mainstream — quality control, vendor management, process documentation — are precisely those where compliance is crystallising fastest, raising the floor on what "production-ready" means and widening the gap for organisations without documented, governable process infrastructure (84% still have none).
The Agentwashing Crisis: 79% Claim Adoption But Only 11% Ship to Production (adoption-metric) — The hardest single number in this domain: the 68-point gap between claiming and delivering operationalises the entire summary's "bimodal, not stalled" thesis and names the five structural barriers that separate the two groups. https://agentmarketcap.ai/blog/2026/06/07/agentwashing-crisis-enterprise-ai-agents-2026
Reaching Production Isn't the Finish Line: 77% of Deployed Agents Face Project Stall (adoption-metric) — Confluent's 4,625-IT-leader survey extends the adoption paradox past pilots into production: even organisations that shipped agents find 77% stall on data infrastructure, directly evidencing the "constraint is the substrate" tension. https://diginomica.com/reaching-production-isnt-finish-line-agentic-ai-its-where-problems-start
UiPath Achieves First GAAP Profitability as Agentic Products Move to Production (adoption-metric) — Vendor financial proof that platforms have matured: $418M revenue, $1.9B ARR, first operating profit. The contrast between this milestone and the same fortnight's failure data is the central tension the summary is built around. https://automationtoday.net/featuredarticles/uipath-reports-profitable-quarter-as-agentic-ai-products-move-into-production/
Automation Anywhere Reports 6M Agent Executions in Production (opinion) — A 4,000-fold scale-up in twelve months is the vendor-side signal that the pilot era is closing; the gap between this figure and customer stall rates illustrates how vendors monetise both the 11% who execute and the 89% still buying licences. https://automationtoday.net/featuredarticles/automation-anywheres-kuruganti-says-agentic-ai-needs-more-than-just-agents/
Gartner's First Agentic AI Hype Cycle: 40% Project Cancellation Forecast (industry-report) — Gartner naming "agentwashing" and forecasting 40% cancellation by 2027 is the canonical structural-failure signal for the fortnight; its inaugural status means this framing will set the analyst vocabulary for the next two years. https://xpander.ai/blog/gartner-hype-cycle-for-agentic-ai-what-it-means-for-ai-agent-development-platforms
Intelligent Document Processing in 2026: 7 Production Patterns (opinion) — The documented Fortune 500 case of $4.2M in wrongly approved invoices from miscalibrated confidence thresholds is the clearest evidence of "silent failure" as the production risk nobody has priced — the system kept approving, no one noticed. https://www.velsof.com/ai-automation/intelligent-document-processing-production-patterns/
Most Agentic AI Projects in Production Have Stalled Over Data Problems (adoption-metric) — HelpNetSecurity's write-up of the Confluent data (74% blame data infrastructure, not AI) is the cleanest direct citation for the "constraint is the substrate" key tension and the reason multi-system data synchronisation was downgraded to stalled. https://www.helpnetsecurity.com/2026/06/18/report-agentic-ai-in-production/
The Orchestration Gap: Why Process Automation Stalls in Operationally Complex Industries (research-paper) — Peer-reviewed research naming orchestration as the missing abstraction layer — distinct from capability limits — and identifying constraint enforcement, legacy bridging, and human approval routing as the load-bearing gaps that deterministic engines must fill. https://arxiv.org/abs/2606.19790v1
Augury Report: Industrial AI Reaches a Tipping Point (adoption-metric) — The 42%-to-14% tripling of enterprise deployers who have scaled AI across more than half their facilities, combined with data quality rising to #1 barrier (+20pp YoY), is the manufacturing-specific evidence for the bimodal split between leaders pulling away and the broad market stuck on substrate. https://www.augury.com/media-center/press/augury-report-industrial-ai-reaches-a-tipping-point/
Why AI Pilots Stall: What Must Be True to Scale AI in Manufacturing (industry-report) — NAMES 2026 summit synthesis names workflow redesign as the #1 success factor (61% of respondents) and documents four failure modes that are never about model quality — the uncomfortable truth that "buy the platform" stops working and "fix the substrate" begins. https://board.org/manufacturing/resources/why-ai-pilots-stall-what-must-be-true-to-scale-ai-in-manufacturing/