The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI voice agents that replace traditional IVR menus with natural conversational phone support experiences. Includes voice-first customer service and natural language call routing; distinct from text-based chatbots which operate in written channels.
Voice AI for IVR replacement has crossed from leading-edge selectivity into geographic and sectoral expansion while governance readiness remains the binding constraint on broader organizational transition. Technological viability is established: production deployments in banking, government, healthcare, and logistics achieve 26-point improvement in first-call resolution (Zillow 69%→95%) and 85-94% cost reduction (Indian banks ₹63-114 vs ₹4-10 per call), with 90-95% autonomous resolution confirmed at 1.4M-call scale (NextPhone). Yet 74% of deployed agents are subsequently rolled back or shut down (Sinch survey of 2,527 executives), with paradoxically higher rollback rates (81%) among organizations with mature failure-detection infrastructure—indicating governance discovery, not capability deficiency, triggers disengagement. The practice is no longer constrained by model quality or platform features (major cloud platforms—AWS, Google, Zendesk—bundle production-ready capabilities; deployment timelines compressed from 6-12 months to weeks), but by organizational barriers: 84% of organizations fail AI compliance audits pre-deployment; integration complexity with CRM and payment systems remains a primary production blocker; architectural limitations (real-time orchestration failures at scale, latency stacking exceeding 1-second natural conversation threshold) expose infrastructure layer constraints that platform-level improvements cannot solve. Production readiness exists for structured, high-volume scenarios (contact center triage, government services, healthcare appointment scheduling) with organizations deploying disciplined governance, phased escalation design, and mature operational practice achieving 60-80% containment and 40-60% cost reduction. Broader organizational adoption remains blocked by integration complexity, governance readiness, compliance risk tolerance, and infrastructure orchestration barriers rather than technology maturity.
Voice AI for IVR replacement has crossed into mainstream enterprise adoption with production-scale deployments demonstrating viability in structured, high-volume scenarios—yet organizational and infrastructure barriers are crystallizing as the binding constraints preventing broader transition. Industry milestone: voice AI agents crossed 1 billion customer calls per month globally in February 2026, representing 400% growth from Q1 2025, with platform vendors (Vapi, ElevenLabs, Retell, Bland) collectively processing 500M+ calls monthly at sub-second latency. Production ROI remains compelling: voice AI costs $0.40-0.50 per call versus $6-8 for human agents, Zillow achieved 26-point FCR lift (69%→95%) on complex test cases, and 67% of Fortune 500 companies now run production voice AI with 340% year-over-year implementation growth. Geographic expansion signals adoption is no longer US/EU-centric: Indian banking deployments (Small Finance Bank ₹8.9 crore annual savings with 1.2-month payback, Mid-Size Private Bank ₹58.8 crore with 9-day payback, Large PSU Bank ₹199 crore with 5-day payback) demonstrate 85-94% cost reduction and 4-6% cross-sell uplift beyond offshore labor-cost parity. Production-scale validation from 1.4M+ real business calls confirms 90-95% autonomous resolution viability at commercial scale; vertical analysis shows resolution variance (Ecommerce 70-84%, SaaS 55-65%, Fintech lower) and cost savings ranging 39-47% depending on use case.
Vendor consolidation has accelerated around native speech-to-speech architectures eliminating intermediate ASR→LLM→TTS bottleneck. Five9 launched production Voice AI Agents June 24, 2026 with Exact Sciences achieving 45% autonomous containment and 60% lower handling time; AWS expanded Amazon Connect with dynamic voice/language personalization; Zendesk deployed human-like voice agents with $200M AI ARR; Google and Microsoft bundled voice as core CCaaS offering. Platform maturity compressed deployment timelines from 6-12 months to weeks; adoption now extends beyond enterprise to SMBs (GetDandy serving 10K+ small businesses, up from zero in 2023). Adoption surveys show 92% of organizations (3,000 consumers + 600 leaders, US/UK/Germany) have implemented or piloted AI in customer service; 80% of consumers willing to engage voice AI; 66% still prefer human agents.
Yet paradoxically, high failure rates are accelerating despite capability maturity. Sinch's 2026 survey of 2,527 enterprise decision-makers found 74% rolled back or shut down deployed AI agents after production launch, with rollback rates climbing to 81% among organizations with most mature governance frameworks—indicating better monitoring detects systemic issues that platform technology cannot solve. Five specific failure modes documented: (1) edge cases (agents hallucinate confident incorrect responses on unexpected inputs), (2) governance gaps (organizations with monitoring detect failures; unmonitored agents fail silently), (3) integration debt (agents unable to access CRM/scheduling/billing systems become voice-enabled chat, not operationally useful), (4) latency/performance collapse at scale (latency stacking 150ms ASR + 800ms LLM + 200ms TTS = 1.15s exceeds 1-second natural conversation threshold; POCs handle 10 concurrent calls successfully; 500 concurrent calls exceed performance ceiling), (5) escalation architecture failures (agents lacking context transfer to humans create worse customer experience). Lab-to-production gaps are substantial: third-party evaluation platform documented systematic failure modes (VAD false-triggers on background noise, speaker diarization errors, transcription accuracy collapse with signal-to-noise ratio drop, workflow state corruption from background speaker interference) across multiple customer engagements. Real-world handoff analysis reveals critical gap: 83% of consumers report they repeat themselves after AI-to-human transfer despite organizations claiming context preservation infrastructure.
Governance readiness has emerged as the primary adoption constraint. 84% of organizations fail AI compliance audits pre-deployment; week-7 procurement stall-out occurs when data governance is questioned; 96% of GDPR penalties trace to data governance gaps rather than malicious conduct. 84% of AI teams spend >50% of time building safety and compliance infrastructure rather than improving customer experience. Root pattern: vanguard organizations deploying in government, banking, healthcare with disciplined governance, mature operational practice, and phased escalation design achieve 60-80% containment and 40-60% cost reduction; broader organizational transition remains blocked by compliance readiness, integration complexity (CRM/payment system data flow), governance infrastructure burden, and organizational change management rather than core AI capability gaps. McKinsey research shows only 23% of agentic deployments achieve successful scaling; governance overhead erodes ROI in majority of cases. Adoption pressure from leadership is high (92% of organizations implementing/piloting), but execution barriers remain structural.
— Sinch survey of 2,527 enterprise leaders: 74% rollback/shutdown deployed voice agents; 81% among mature governance orgs. Key failure drivers: customer data exposure (>30%), hallucination (22%), diagnosis gaps (16%); infrastructure satisfaction predicts success better than guardrails.
— Five9 launches production Voice AI Agents June 24, 2026; Exact Sciences 45% autonomous containment with 60% lower handling time; PODS projected 100K+ calls/year-end; enterprise voice AI market $62B by 2034 (29.5% CAGR).
— Large survey (3,000 consumers + 600 leaders, US/UK/Germany): 92% implemented/piloted AI; 80% willing but 66% prefer human; critical signal—83% repeat themselves after AI-to-human handoff despite orgs claiming context preservation, revealing handoff architecture gap.
— Third-party evaluation platform documents systematic lab-to-production gaps: VAD failures on background noise, speaker diarization errors, transcription accuracy collapse with SNR drop, workflow state corruption. Solutions include synthetic multi-speaker scenario generation and CI-integrated evaluation.
— Production-scale validation: 1.4M+ real business calls, 90-95% autonomous resolution, four-dimension accuracy framework (ASR, intent, task completion, sentiment), validating IVR replacement viability at commercial scale across diverse businesses.
— Vertical variance quantified: Ecommerce 47% cost savings (70-84% resolution), SaaS 39% (55-65% resolution), Fintech lower due to compliance; named deployments (Anthropic 1,700 hours saved month-1, Topstep 65% on 150K calls) validate use-case-dependent containment ceilings.
— Geographic expansion: Small Finance Bank ₹8.9 crore annual savings (1.2-month payback), Mid-Size Private Bank ₹58.8 crore (9-day payback), Large PSU Bank ₹199 crore (5-day payback); 85-94% cost reduction; 61-75% autonomous resolution; 4-6% cross-sell uplift signals IVR replacement adoption beyond US/EU.
— Industry milestone: 1B AI voice calls/month globally (Feb 2026), 400% growth from Q1 2025; 85% first-call resolution vs 72% human baseline; healthcare 30%, BFSI 25%, retail 20% of volume; cost $0.40-0.50 per AI call vs $6-8 per human call.