The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI that monitors service level indicators and predicts SLA breaches before they occur, enabling proactive intervention. Includes predictive SLA risk scoring and early warning systems; distinct from APM which monitors application health rather than business-level commitments.
Predicting SLA breaches before they happen has transitioned from vendor feature to operationalised capability in large enterprises and SaaS platforms, yet remains inaccessible to mainstream IT operations. The vanguard -- LINE, United Airlines, Agos Ducato, BT Digital -- run sophisticated agentic and ML-based SLA prediction workflows integrated with ITSM platforms, achieving measurable breach prevention and MTTR improvements. New Relic, Dynatrace, and emerging platforms like Lyzr and StackOne now ship GA breach prediction as core observability features. However, mainstream adoption faces a persistent barrier: the gap between platform capability (prediction algorithms are proven) and organisational readiness (data quality, SRE maturity, integration discipline, and tool consolidation) continues to widen. Industry data shows 60% of MSPs have formalised SLA management programs and 70% of IT professionals prioritise SLO-based monitoring, signalling ecosystem maturity and mainstream awareness; yet implementation complexity and integration friction remain the binding constraints. For most mid-market and smaller teams, SLA breach prediction remains a purchased but undeployed vendor feature.
The vanguard is producing measurable operational wins at scale. Dynatrace-ServiceNow integrations have reached GA for autonomous incident workflows; Agos Ducato (Credit Agricole) achieved 30-point lift in critical transaction success (65%→95%) and 30-second latency reduction. United Airlines operates ~800 Dynatrace-monitored applications with documented top on-time performance. New Relic shipped SRE Agent (full incident lifecycle automation) and reported 25% faster incident resolution, 80% higher deployment frequency, and 27% less alert noise among AI-enabled operations teams. In May 2026, technological maturity continued advancing: a large telecom operator (25M subscribers) deployed ML-based SLA breach prediction achieving 40% breach reduction and $3.5M annual penalty savings; Dynatrace released Intelligence GA as the first agentic operations system combining deterministic SLO insights with autonomous remediation. New agentic breach prediction platforms emerged: StackOne deployed AI agents predicting breach probability by monitoring ticket burn rate and queue depth; Lyzr released 'Breach Predict' agents with customer reports of 30% critical incident reduction; LINE (Japanese platform) deployed SLI/SLO-centric observability with automated breach detection tied to user-facing SLA targets. Peer-reviewed research (May 2026, arXiv) demonstrated transformer-based breach prediction achieving 30-minute advance warning for data center colocation SLAs using per-customer multi-head attention models.
This activity masks a widening bifurcation. SaaS observability vendors (New Relic, Dynatrace, Chronosphere, emerging agentic platforms) achieved production breach prediction with enterprise deployments; mainstream ITSM platforms (ServiceNow on-premise, Jira Service Management) retain calculation accuracy gaps, automation reliability issues, and class-imbalance problems blocking prediction. Industry adoption metrics are maturing: 60% of MSPs now operate formal Customer Success programs with structured SLA management; 70% of IT professionals prioritise SLO-based monitoring; 52-74% of tech companies and telcos deployed AI monitoring capabilities; GitLab publicly documented error budgets as operational release-gating mechanism at a leading-edge tech company; a 10-vendor SLA tracking ecosystem (Zendesk, Freshdesk, ServiceNow, Datadog, etc.) has standardised real-time breach alerting as table-stakes -- yet these metrics reflect widespread threshold-alerting adoption, not breach prediction. The barrier remains organisational: McKinsey data shows 6% of organisations achieve meaningful AI ROI; ServiceNow Predictive Intelligence documentation lists 20+ implementation failure modes (data quality, label corruption); Broadcom surveys find 98% of IT teams cite automation/integration issues as root cause of SLA breaches, not inadequate tooling. Organisational readiness gaps -- data quality discipline, SRE maturity, integration architecture, business alignment -- constrain deployment of proven prediction capabilities across the broader market, even as platform vendors accelerate agentic AI shipping and market growth (13.7% CAGR, USD 1.38B in 2024 to projected USD 4.21B by 2033) continues.
Critical blockers to autonomous deployment were documented by independent practitioners: infrastructure hygiene (data quality, staging/production parity) must precede agentic automation; organizations cannot delegate SLA breach prevention to AI agents without first achieving operational maturity (clean pipelines, unified tooling, SRE discipline); 80-90% of AI agent projects fail in production due to unrealistic assumptions about infrastructure readiness, not algorithm limitations. This fundamental asymmetry—vendor platform maturity exceeding organizational deployment readiness—is the defining constraint preventing SLA breach prediction from crossing from leading-edge practice (SaaS vendors, Fortune 500 early adopters) into mainstream operations.
— Public tech company deploying error budgets as operational enforcement mechanism—when consumed, triggers policy changes and gates release velocity, demonstrating mainstream adoption of SLO-driven SLA management at leading-edge organization.
— Five predictive breach detection techniques (burn-rate monitoring, pattern analysis, ML risk scoring, queue analysis, dependency risk) with claimed 30-50% breach reduction, but acknowledges that most organizations remain at reactive threshold-alerting levels.
— Regulatory-driven three-tier SLO framework mapping to FDIC, EU DORA, and SEC requirements; error budget governance gates releases based on breach risk, demonstrating high-stakes SLA management in compliance-driven domains.
— Salesforce GA feature forecasting SLA breach likelihood in healthcare prior authorization workflows, enabling predictive intervention to identify delay factors before SLA breaches occur.
— Peer-reviewed transformer-based SLA breach prediction framework achieving 30-minute advance warning for data center colocation SLAs (power, temperature, humidity) with structured role-specific output schemas for finance, ops, and compliance.
— Maturity framework positioning SLA evolution as Reactive→Predictive→Autonomous with AI-driven breach detection as transformational capability; aligns ITIL 5 practices with predictive SLA management.
— Production-ready SLO implementation guide with Prometheus/Grafana queries, error budget burn-rate calculations, and multi-service SLO composition patterns demonstrating mainstream technical adoption.
— LY Corporation (LINE messaging platform) production deployment of SLI/SLO framework across critical services, defining critical user journeys, SLI targets (p99.9 latency, 99.999% success), and dashboard-driven SLO ownership model.