Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Customer support chatbots — scripted

GOOD PRACTICE

TRAJECTORY

Declining

Rule-based or decision-tree chatbots that handle common customer queries through predefined conversation flows. Includes button-driven flows and FAQ matching; distinct from LLM-powered chatbots which generate responses dynamically rather than following scripts.

OVERVIEW

Scripted chatbots are a mature, proven automation tool for narrow customer support tasks -- and a technology whose growth cycle has ended. Rule-based and decision-tree systems that route queries through predefined flows reached good-practice status around 2021, with GA tooling from Zendesk, Dydu, and Zoho, documented ROI across verticals, and widespread enterprise deployment. The economics are settled: deflection rates of 40--70% on Tier 1 volume at a fraction of human agent cost make the business case straightforward for FAQ handling, order status, and returns. Banking alone has reached 88--92% adoption among major institutions. The defining tension is not whether scripted bots work -- they do, reliably, within well-scoped boundaries -- but whether they remain strategically relevant. LLM-powered alternatives now resolve broader query types at higher rates, and vendors including Zendesk have shifted investment toward AI agents, with Zendesk formally deprecating its legacy Answer Bot builder by end of 2026. Scripted systems persist where predictability, compliance, and low cost matter most, but the ceiling is explicit: rule-based systems achieve only 20-35% resolution versus 80%+ for AI-powered alternatives, and maintenance costs scale exponentially with ruleset growth. This is a practice to operate and optimise in narrowly-scoped use cases, not to bet on for growth.

CURRENT LANDSCAPE

Vendor support is consolidating and strategically retreating. Zendesk formally deprecated its legacy Answer Bot builder in February 2025 with end-of-life scheduled for December 2026, signaling the major incumbent's exit from rule-based scripted approaches. Dydu secured EUR 6.3M in early 2026 to scale its rule-based and conversational AI systems, sustaining 160-plus enterprise projects across energy, finance, telecom, and public sector clients including EDF, SNCF, and Orange, but frames itself as offering both rule-based and conversational AI rather than pure scripted. Zoho maintains its SalesIQ Answer Bot for small and mid-market buyers. The installed base is large and operationally active, even as new investment flows elsewhere.

Deployment economics remain compelling within defined scope, documented by June 2026 evidence across geographies. A D2C brand in Bengaluru achieved 60% cost reduction via rule-based chatbot handling 70% of queries (covering 5 core FAQ topics); Ozonetel research shows Muthoot's WhatsApp chatbot delivering 2.5X order value growth and Mahindra's KisanBot reaching Tier-3 farmers in markets previously unreachable through traditional channels. Industry ROI framework remains consistent: McKinsey-IDC benchmarks document 25-40% cost reduction, 9-12 month payback, and 180-320% 3-year ROI. Conservative practitioner methodology estimates 40-55% realistic containment for mixed-query bases, delivering £30k-£55k annual savings for UK SMEs with 2-5 month payback on implementation costs. Industry benchmarks put scripted interactions at $0.10--$0.50 each versus $5--$12 for human agents. These are real, sustained savings within narrow, well-scoped use cases. Customer satisfaction tells the other side of the story: only 29% of banking chatbot users report satisfaction despite a 70% return rate, and 35% of small business chatbot projects fail outright due to implementation complexity. The gap between deployment and satisfaction remains the practice's persistent weakness.

The competitive case against scripted systems is now quantified and explicit. Industry benchmarks document rule-based systems achieving only 20-35% resolution versus 80%+ for AI-powered systems, with 78% of scripted interactions requiring escalation. Production evidence from June 2026 documents the execution gap: a DECTA study of UK banking apps found chatbots resolve only 11.4% of blocked-payment inquiries—the core value-add task—with 50% requiring human escalation and only 5.4% of customers trusting chatbots compared to 65.2% trusting humans. Zendesk's legacy Answer Bot is documented at 2-3% deflection, 'insufficient for meaningful support automation.' A meta-signal emerged: Sinch's survey of 2,500+ enterprise leaders found 74% of deployed enterprise chatbots are pulled offline and relaunched since deployment, revealing high production failure rates and gaps between promised and delivered outcomes. Systemic measurement challenge: 61% of enterprises cannot demonstrate actual ROI due to failure to establish baselines before deployment, indicating adoption barriers extend beyond technology to organizational maturity. For organizations already running scripted bots in well-scoped banking Tier 1 and internal support niches, the ROI justifies continued operation and optimization. For new deployments, the strategic calculus strongly favours LLM-powered and agentic alternatives capable of broader scope.

TIER HISTORY

ResearchJan-2016 → Jan-2016
Bleeding EdgeJan-2016 → Jan-2020
Leading EdgeJan-2020 → Jan-2021
Good PracticeJan-2021 → present

EVIDENCE (143)

— Zendesk Answer Bot deprecation timeline: August 31, 2026 development stops, December 10, 2026 full shutdown. Scripted article-recommendation system absorbed into generative AI agents, signaling industry shift away from rule-based toward LLM-powered approaches.

— Critical failure analysis: chatbot containment vs resolution gap (78% reported vs 41% actual). Documents four scripted-system failure modes: frustration cascade detection (missed 3-5 turns before escalation), intent mismatch (20-35% initial misclassification), deflection trap, post-bot escalation tax.

— Multi-source benchmark (18 sources: Gartner, Zendesk, Forrester, Intercom, HubSpot, IBM, Salesforce, NICE): rules-based systems 28-38% containment vs AI-powered 52-65%, a 20-35 percentage-point performance gap. Rules-based systems show industry-specific ceilings (healthcare 28-40%, B2B professional services 22-35%).

— Benchmarking framework distinguishing legacy/rule-based chatbots (10-25% true resolution) from standard AI assistants (40-60%) and agentic systems (70-85%). Clarifies deflection vs resolution gap: legacy bots function as intake/routing layers, not problem-solving systems.

— Explicit performance segmentation: 'Rule-based chatbots and deflection tools resolve 30-40%' vs AI agents 70-85%. Identifies deflation-as-metric trap where 100% deflection rate can mask 0% resolution rate, illustrating why scripted system effectiveness is systematically mismeasured.

— Direct comparative benchmark: traditional rule-based bots deflect ~15% of tickets vs 60-80% for modern LLM-based agents. Gartner data shows >45% AI deflection but only ~14% genuine self-service resolution, establishing the performance ceiling for scripted systems.

— Zendesk's legacy Answer Bot (rule-based scripted system) documented at only 2-3% deflection rate, 'insufficient for meaningful support automation,' signaling market obsolescence of rule-based approaches and vendor deprecation signals.

— WhatsApp chatbot deployment guide recommends rule-based for repetitive tasks (FAQs, order tracking, appointments) with 60-80% Tier 1 containment target; named outcomes: Muthoot (2.5X order growth), Mahindra KisanBot reaching Tier-3 farmers.

HISTORY

  • 2016: Chatbot hype reached peak with Zendesk's Automatic Answers GA, Microsoft Tay experiment, Facebook Messenger bots, and 80% enterprise adoption intent, offset by critical analysis of UX limitations and failure cases in real deployments.

  • 2017: Zendesk Answer Bot GA (Aug) marked ecosystem maturity with production deployments at Dollar Shave Club (4.5K tickets/month) and GogglesNMore (70% deflation). Facebook's autonomous chatbots failed at 70% of requests, reinforcing viability of scripted approaches. Retail sector saw high abandonment rates (Everlane, 73% consumer intolerance) but successful deployments demonstrated 25–70% deflation when scoped tightly. Shift from experimental to early-production phase with clear use-case boundaries.

  • 2018: Vendor ecosystem solidified with 14+ commercial chatbot solutions supporting scripted workflows. Dollar Shave Club sustained Answer Bot deployment, scaling to 9,000 monthly resolutions (14% rate). Consumer adoption reached 15% of US adults, but implementation success hinged on organizational readiness (knowledge base maturity, existing automations) rather than technology capability. Scripted bots remained confined to narrow use cases (FAQ, order status); broader expansion awaited more sophisticated AI approaches. Early-production phase continued with concentrated early-adopter base.

  • 2019: Zendesk Answer Bot expanded to all web and mobile channels; customers collectively solved 1M+ tickets and saved 225K agent hours. Enterprise intent surged: 70% of organizations without bots planned deployment within 12 months. Major banking client handled 300K questions daily. Implementation challenges surfaced: Forrester predicted 60% of deployments would lack effective human handoff; 50.7% of users cited inability to reach live agents as top frustration. Scripted bots proved viable at scale for high-volume, low-complexity tasks but implementation success remained dependent on organizational process maturity rather than product capability.

  • 2020: Public sector adoption validated scripted chatbot viability: UK Government Digital Service published guidance with DVLA and MoJ case studies showing measurable contact reduction. International commercial deployments achieved at scale—Väre (Finland) reached 63% automation rate; Garden Games processed 15K+ emails annually with 40%+ intent accuracy. Proof points reinforced core thesis: scripted bots excel at narrow, high-volume transactional tasks when properly scoped, but organizational readiness and human escalation design remained the primary adoption constraint.

  • 2021: Market adoption reached critical mass with 65% of 2,800 European companies deploying chatbots. Harmonie Mutuelle case study demonstrated production viability at scale (500 dialogues/month, 80% comprehension). Zendesk continued product evolution with language model improvements and 17-language support, driven by 95% COVID-era request surge. Peer-reviewed research revealed design trade-offs: humanlike chatbots reduced satisfaction for angry customers when expectations were unmet. Scripted bots consolidated as proven technology with clear organizational prerequisites (data protection compliance, clean data structures, effective escalation), but fundamental limitations (no learning, no personalization) persisted. The tier remained leading-edge—adoption was now mainstream for well-scoped use cases, with clear understanding of success factors and constraints, but broader scaling awaited either organizational maturity improvements or algorithmic advances.

  • 2022-H1: Vendor platforms continued maturation with Zendesk transitioning from Answer Bot usage metrics to Automated Resolution pricing and integrating Flow Builder for low-code workflow automation. Consumer adoption remained strong (82% usage, 70% satisfaction for basic queries) but user experience limitations persisted: independent survey found 32% rarely felt understood, 30.8% abandoned interactions, and 60%+ cited frustration at chatbot handoff. Production deployments confirmed enduring value in narrow, high-volume use cases (Agence Française de Développement's HR chatbots), while peer-reviewed research documented the core constraint—inability to handle unanticipated inputs. Tier transitioned to good-practice as adoption became normalized, technology accessible to business users, and success factors well understood, yet algorithmic constraints remained.

  • 2022-H2: Market adoption reached mainstream scale: 54% of enterprises used chatbots for customer interactions, with 38% planning deployment. Production deployments validated ROI—Crosscard/Viabuy's Zendesk bot resolved 10K+ questions annually, SNCF handled 10M+ questions with eight languages, PwC achieved 97% comprehension on internal support. Market forecasts confirmed ecosystem maturity: $10.5B projected by 2026 from $2.9B in 2020 (23.5% CAGR). However, persistent user experience barriers remained: Zendesk research showed 60% customer disappointment, 55% reported inaccurate responses, and 50% frustrated with bots' inability to recognize limitations. Scripted bots continued to excel at narrow, high-volume use cases but broader adoption awaited organizational maturity or algorithmic advances.

  • 2023-H1: Multi-industry production deployments confirmed via Dydu (BNP Paribas, Total Energies, DSAC aviation, French prefecture) and hybrid adoption in regulated industries (Switzerland study). Enterprise deployment held at ~19% with 62% planning. However, customer adoption remained limited: Gartner survey found only 8% used chatbot recently and 25% would repeat; Capterra retail study showed 50%+ negative experiences and low conversion (17% product search). Early ChatGPT disruption signals emerged. Adoption growth plateaued despite awareness; tier remained good-practice as organizational process readiness remained the binding constraint.

  • 2023-H2: Enterprise adoption intent surged (71% of executives aimed for full automation by 2027) but customer-facing deployment stalled—50% of B2C support managers cited perception of chatbots as "cold and static," while industry experts documented high cost and low customer satisfaction; deployment shifted toward internal use cases (HR, IT). Dydu Observatory showed 92% consideration among professionals (up from 48% in 2021), but ChatGPT awareness reached 94% with 34% using LLMs, signaling competitive displacement. Zendesk Answer Bot remained technically limited—NLP mapping with tunable thresholds but no learning capability. Critical tension: widespread enterprise intent clashed with documented customer dissatisfaction, implementation complexity, data privacy concerns, and emerging LLM-powered alternatives. Tier remained good-practice; adoption growth plateaued as organizational readiness and scope boundaries remained the primary adoption constraint.

  • 2024-Q1: Production deployments continued across internal support workflows: Agence Française de Développement scaled Dydu HR and recruitment chatbots to 3,000 employees with 90%+ qualified interaction rate, while Dydu added nine new enterprise deployments (tax advisory, energy, public finance sectors) by March. Research validation affirmed technical maturity: peer-reviewed hybrid rule-based system achieved 98% accuracy and 97% precision on 1,684 e-business queries, with response time optimization (41 seconds vs. 20 minutes). Market metrics held steady at 92% enterprise adoption consideration, with 8/10 satisfaction but "reliability of responses" persisting as primary barrier. Vendor analysis showed 300-400% average ROI with 65% ticket automation in successful deployments. Critical barriers remained: documented limitations (incorrect responses, inability to handle complex queries, empathy deficits, security concerns) continued to constrain customer-facing expansion. Tier remained good-practice; LLM disruption signals intensified with ChatGPT adoption reaching 34% among professionals, pressuring scripted systems from above while organizational process maturity continued limiting broader adoption from below.

  • 2024-Q2: Vendor platforms continued incremental improvements (Zoho's Answer Bot 2.0) alongside sustained enterprise adoption patterns. Market adoption metrics held steady (74% FAQ preference, 39% B2C chatbot involvement), but growth had stalled; customer satisfaction barriers remained (74% found chatbots useful for specific tasks but uptake constrained by perceived limitations). Real-world deployment challenges persisted, with platform configuration and integration complexity documented as practical barriers to expansion. Competitive displacement from large language models accelerated, shifting strategic focus from rule-based expansion toward integration choices and narrowly-scoped use-case optimization. Tier remained good-practice as core value proposition (high-volume, low-complexity automation) remained proven and accessible to business users, yet economic and competitive pressures intensified.

  • 2024-Q3: Production deployments validated scripted chatbot viability in specialized domains: Mass General Brigham's rule-based return-to-work chatbot handled 5,575 users with 71.6% meeting criteria, reducing daily OHS calls from 633 to 115 (82% reduction) with wait times dropping from 28 to 6 minutes—demonstrating operational impact in healthcare. Peer-reviewed evidence confirmed persistent maturity constraints: healthcare chatbots offered delivery and administrative benefits but faced ethical, technical, and UX limitations; consumer adoption remained limited (only 30.1% likely to use health chatbots despite interest in voice-based support). Vendor platform evolution accelerated with Zendesk repositioning knowledge base chatbots toward AI agent integration (promoted as "next evolution"), signaling incumbent shift toward LLM-powered alternatives. Documented chatbot failures (DPD poetry incident, Chevy price manipulation) highlighted deployment risks from inadequate guardrails and human oversight gaps. Tier remained good-practice—scripted bots delivered proven value in narrow, well-scoped use cases with measurable operational savings—but the competitive landscape shifted decisively toward LLM-powered approaches, constraining further growth of rule-based systems.

  • 2024-Q4: Scripted chatbot adoption remained stable in internal support workflows with evidence of sustained production deployments: Renault's IT helpdesk chatbot (deployed 2021, Teams integration Oct 2023) achieved 72% traffic increase and 94% comprehension with 137K visitors, demonstrating long-term viability of rule-based automation in employee support. However, market headwinds persisted: Zendesk shifted strategic messaging to AI agents (Oct-Nov 2024), positioning them as next-generation chatbots with 80%+ autonomous resolution; practitioners documented continued customer frustration barriers (redirected to knowledge bases without solving problems) and ROI measurement challenges limiting expansion. Critical insight: scripted bots' role shifted decisively from customer-facing growth engine to stable, well-scoped internal automation tool. The tier remained good-practice—proven, accessible, economically sound in narrow use cases—but growth ceiling was explicit as vendor innovation and market attention flowed toward LLM-powered alternatives capable of broader scope.

  • 2025-Q1: Scripted chatbots established a stable niche in 2025 with quantified economics and defined use-case boundaries. Market research confirmed rule-based systems holding ~35% of the chatbot solutions market due to cost-effectiveness and ease of deployment, handling 65% of routine inquiries in utilities and retail. Zendesk maintained support for legacy automated article response features with ongoing analytics dashboards. Industry benchmarks quantified the economic case: setup costs $3k–$20k, development $3k–$7k, deflation rates 20–40%, with operational savings of up to 40% AHT reduction. E-commerce adoption metrics showed sustained customer acceptance (31% for order management, 28% for returns). However, critical limitations persisted and were well-documented: ML6 consultancy analysis highlighted scripted systems' manual maintenance requirements, rigid flows, and inability to extract context from prior interactions, contrasting them with GenAI approaches. Independently, 75% of chatbot users reported failures on complex issues, with root causes including training data limits, classification failures, and lack of reasoning capability. The strategic picture remained clear: scripted chatbots delivered proven ROI in narrow, well-defined use cases (FAQ, order status, returns) but faced ceiling for broader adoption. Tier remained good-practice as the technology proved accessible, economically sound, and operationally viable in defined scopes, with clear understanding of both strengths (cost, predictability, compliance) and constraints (rigidity, context limitations, complexity handling).

  • 2025-Q2: Scripted chatbot market remained stable with sustained enterprise deployments alongside strategic vendor shifts toward LLM-powered alternatives. Zendesk officially migrated from legacy Answer Bot to AI agents as the platform standard (June 2025), signaling strategic deprecation of rule-based scripted approaches within major incumbents. However, real-world deployments continued: a major French hypermarket managing 25M+ annual calls achieved 18% reduction in handling time and 17% agent productivity gains through Zendesk conversational bots in production. Market context showed continued industry adoption (58% B2B, 42% B2C companies deploying chatbots) with broader market growth ($15.57B in 2024 projected to reach $46.6B by 2029), but customer-side barriers persisted—50% of customers reported frustration, and 70% of consumers signaled willingness to switch brands after bad chatbot experiences. Scripted bots remained economically viable in defined niches, but competitive pressure from GenAI approaches and persistent customer satisfaction gaps continued limiting expansion beyond narrow, high-volume transactional use cases. Tier remained good-practice as the value proposition (cost-effective, predictable automation in narrow scopes) held steady, yet the strategic landscape shifted decisively as vendors repositioned toward broader AI agents.

  • 2025-Q3: Scripted chatbot deployments remained operationally viable but faced mounting evidence of fundamental limitations and competitive displacement by LLM-powered alternatives. Zendesk continued supporting Answer Bot with updated metrics documentation (Sept 2025), including suggestion rate, resolution rate, and ticket-assist measures for production monitoring. However, peer-reviewed research affirmed persistent technical constraints: rule-based systems provide low cost and operational transparency but fall short in scalability and flexibility—core findings from Aug 2025 comparative analysis of chatbot development methods. Critical evidence emerged documenting real-world failure modes in production deployments: leading chatbots delivered inaccurate or misleading answers 27% of time on complex queries; 67% of users abandoned chatbots stuck in instruction loops or generating irrelevant responses; one major UK bank's scripted chatbot misinformed over 140,000 customers about overdraft policies, resulting in lost customers and regulatory scrutiny. Infrastructure failures imposed quantifiable costs: retailers documented $4.2M annual revenue loss from chatbot failures with no human escalation path. Hybrid deployment models gained traction as pragmatic mitigation: analysis of 2025 implementations showed organizations increasingly adopting human-bot collaboration (bots handling 24/7 routine inquiries, agents managing complex problem-solving) with implementation costs of $2k–$150k, though integration complexity remained a persistent adoption barrier. Market analytics revealed ROI measurement challenges: companies focusing solely on activity metrics (interactions, messages) achieved 60% lower ROI than those measuring business outcomes; a Fortune 500 retailer with 2M monthly chatbot interactions experienced 25% customer churn due to poor resolution rates. Tier remained good-practice as scripted bots delivered proven value in narrowly-scoped, high-volume transactional workflows (FAQ handling, order status, returns), but the empirical case for broader expansion had weakened substantially—critical limitations on accuracy, context awareness, and escalation transparency now documenting the ceiling of rule-based automation viability.

  • 2025-Q4: Scripted chatbot market approached closure of growth cycle with consolidating adoption evidence and explicit technical constraint documentation. Consumer adoption reached mainstream scale: Pew Research (Dec 2025) survey of 1,458 U.S. teens found 64% use chatbots, including 30% daily—demonstrating broad acceptance across demographics. Banking sector adoption reached 92% of North American banks with $2B+ market value, though satisfaction gaps persisted (29% satisfaction despite 70% return rate), revealing persistent implementation-execution challenges. Small business ROI remained proven (148-200% returns over 12-18 months) but adoption barriers persisted: 35% of projects failed due to implementation mistakes. Critical technical analysis surfaced in Q4: AIQ Labs documented fundamental rule-based limitations—requirement for hundreds of thousands of hand-tuned rules yet inability to learn or adapt, with maintenance costs scaling exponentially. NICE projection held at 70% of interactions managed by AI agents by 2025. Industry analysis emphasized that isolation, governance gaps, and accuracy failures prevented broader expansion. By year-end, the strategic picture remained stable: scripted bots maintained proven niche value in narrow, transactional use cases (FAQ, order status, returns) with clear ROI and compliance advantages, but technological ceiling was explicit and well-documented. Tier remained good-practice as adoption plateaued within defined scope boundaries; further growth awaited either organizational process maturity improvements or fundamental algorithmic advances beyond rule-based paradigm.

  • 2026-Feb: Scripted chatbot market consolidated around stable, narrowly-scoped deployment patterns with explicit competitive displacement signals from AI-powered alternatives. Vendor support remained: Dydu announced €6.3M funding in Feb 2026 to scale rule-based and conversational AI systems, with 160+ enterprise projects handling millions of monthly conversations across verticals (energy, finance, telecom, public sector). Production-scale deployments validated continued viability in defined use cases—banking sector achieved 88-92% Tier 1 automation with scripted systems, processing up to 90% of routine interactions. Market economics held steady: 2026 benchmarks confirmed $0.10-$0.50 per-interaction cost for scripted systems deflating 40-70% of Tier 1 volume, with ROI remaining strong ($8 return per $1 invested in some verticals). However, critical competitive pressure from LLM-powered alternatives intensified: industry analysis declared rule-based chatbots obsolete due to fundamental limitations (inability to handle off-script inputs, 30-40% resolution vs. 80%+ for conversational AI, rigid decision trees, high escalation rates of 78%). Customer satisfaction remained constrained (29% reported satisfaction despite high return rates), highlighting persistent implementation-execution gaps. The tier remained good-practice—scripted bots delivered proven, economically sound automation in narrow, high-volume transactional tasks—but the strategic position continued narrowing as vendor innovation and market attention flowed decisively toward LLM-powered autonomous and agentic systems capable of broader scope.

  • 2026-Apr: Scripted chatbot market faced explicit vendor deprecation and competitive obsolescence signals. Zendesk announced sunsetting of legacy scripted bot builder by August 2026, consolidating all bot functionality into advanced AI agents—signaling the major incumbent's formal exit from rule-based approaches. Industry research (Gartner, Forrester) confirmed: 70% of enterprises have deployed some form of automated conversational system, yet 54% of consumers report chatbot interactions as frustrating due to inability to handle complex semantics, rigid scripting, and context loss. Independent study of 50+ mid-market companies (travel, mobility, events) showed 50% adoption with mixed outcomes: 40% achieved response time improvements, but cost reductions and resolution rate gains remained limited, suggesting organizations are "still in early stages of learning curve." Technical analysis documented core limitations driving organizational abandonment: rigid linear conversation flows that crash on out-of-sequence inputs; poor NLU causing high escalation; and lack of learning capability preventing improvement over time. Real-world practitioner finding: organizations recognize scripted bots "work, but don't scale"—solving specific FAQ problems but failing to improve autonomously or adapt to new conditions. Banking sector maintained high automation rates (88-92%) in narrow Tier 1 tasks, validating continued economic viability in well-scoped, high-volume transactional workflows. However, the strategic ceiling was explicit: no major vendor invests in rule-based architectures as primary platform; all incumbent innovation and market messaging has shifted to agentic AI. The tier remained good-practice due to proven, accessible, economically sound operation in defined scopes, but further expansion of scripted systems was constrained by documented technical limitations and explicit vendor deprecation signals.

  • 2026-May: Late April and early May 2026 evidence consolidated the narrative of scripted chatbot obsolescence and implementation challenges. UC Berkeley research (California Management Review) documented five frustration sources specific to rule-based systems: understanding failures, inability to solve complex problems, poor handover integration, lack of humanization, and lack of personalization; Gartner survey showed 64% customer preference against AI chatbots entirely. TIMEWELL analysis of major 2026 implementations across Klarna, Intercom, and others concluded: "Almost no companies are getting results from a chatbot in isolation"—Klarna's 2.3M conversations/month shifted to hybrid model in 2025 due to quality erosion. Oxford peer-reviewed research found chatbots trained for warmth paradoxically made 10-30% more errors on critical topics (medical, false-belief correction), highlighting design constraints. Practitioner benchmarks remained narrow: ecommerce deployments cost $0-29/mo and achieve 5-15% conversion lift with 20-30% deflection, but only for stores under $500K annual revenue with stable catalog. Real-world deployment failures documented: automotive dealership case study across 41 franchises showed scripted decision-tree bots zero phone captures and lost sales-critical conversion window; SMS-first alternative achieved 92% new customer contact rate. Market data confirmed: $15.12B AI customer service market in 2026 with 80% adoption intent, but only 25% fully integrated; 79% customer preference for human agents persisted despite cost investments in automation. Mid-to-late May evidence quantified the performance ceiling of rule-based systems with greater precision: eCorpIT benchmarks documented 30-40% cost reduction and 45-65% deflection for scripted systems but identified a realistic ceiling of 50-70% and hallucination risks at boundary cases; Netguru framework analysis quantified rule-based containment ceiling at 40% on non-FAQ inquiries versus 55-70% for LLM-backed systems, and identified a 67% chatbot failure rate with recommendation that scripted flows be paired with AI reasoning for off-script queries; Qualimero analysis cited Gartner 2025 data showing rule-based bots resolve 20-35% versus 55-65% for AI-powered, explicitly defining the sub-20-FAQ domain where scripted bots remain viable. A hybrid architecture case study (IFDA educational institute) validated scripted funnels for high-volume lead capture with LLM fallback for open-ended questions — illustrating ongoing utility in cost-optimized hybrid designs. Robylon analysis documented rule-based chatbots achieving only 20-35% resolution versus 60-80% for AI-powered systems; BluIP's enterprise buyer guide quantified first-contact resolution at 40-60% for rule-based versus 80-95% for conversational AI; RetentionCheck's SaaS churn benchmark showed 43.9% annual churn for rule-based chatbot platforms, with 30% of cancellations citing insufficient deflection and 26% citing maintenance burden. Independent analysis (Customerland, Plain) confirmed the 20-30% deflection ceiling for rule-based B2B chatbots. Measurement framework analysis (Lorikeet) revealed legacy chatbots resolve only 10-30% of tickets versus 80-93% for action-taking agents, exposing deflection-vs-resolution metric traps. The tier remained good-practice — scripted bots delivered proven, economically sound automation in well-defined, high-volume transactional use cases — but the evidence base reinforced a narrowing ceiling: quantified performance gaps versus AI alternatives, platform churn driven by insufficient capabilities, customer frustration with failure modes, and vendor exit signals constrain any expansion beyond the current scope.

  • 2026-Jun: Competitive obsolescence signals sharpened with a confirmed hard shutdown date. Zendesk's legacy Answer Bot deprecation timeline finalized: development stops August 31, 2026 and full shutdown December 10, 2026 — the scripted article-recommendation system absorbed into generative AI agents, marking the major incumbent's formal exit from rule-based approaches. Zendesk's legacy Answer Bot had been documented at only 2-3% deflection ("insufficient for meaningful support automation"), while Sinch's survey of 2,500+ enterprise leaders found 74% of deployed enterprise chatbots have been pulled offline and relaunched, revealing high production failure rates across the broader chatbot category. Independent multi-source benchmarking (18 sources including Gartner, Zendesk, Forrester, Intercom) confirmed rules-based systems achieve only 28-38% containment versus AI-powered 52-65% — a 20-35 percentage-point performance gap — with industry-specific ceilings (healthcare 28-40%, B2B professional services 22-35%). Critical failure analysis documented four scripted-system failure modes: frustration cascade detection (missed 3-5 turns before escalation), 20-35% intent misclassification at entry, deflection trap (bots deflect but don't resolve), and post-bot escalation tax. Rule-based deployment economics remain viable in defined niches: a D2C brand achieved 60% cost reduction handling 70% of queries across 5 FAQ topics; Ozonetel case studies show WhatsApp scripted chatbots delivering 60-80% Tier 1 containment with documented business outcomes (Muthoot 2.5X order growth). The practice's role is now settled and the vendor timeline is explicit: scripted bots remain economically justified for stable, high-volume FAQ deflection through 2026, but organizations on Zendesk face a mandatory migration deadline and the structural case against new investment in rule-based architectures is vendor-confirmed.

TOOLS