Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

HR policy Q&A chatbot

GOOD PRACTICE

TRAJECTORY

Stalled

AI chatbot that answers employee questions about HR policies, benefits, and procedures from organisational documentation. Includes policy RAG and benefit eligibility checking; distinct from enterprise search which serves general rather than HR-specific knowledge needs.

OVERVIEW

HR policy Q&A chatbots have matured to production-scale deployment with proven ticket deflation but face hardened adoption barriers rooted in regulatory liability and organizational readiness, not product capability. GA platforms from ServiceNow EmployeeWorks, Moveworks, and Leena AI now handle benefits, leave, compensation, and procedural queries with documented deployments achieving 40-98% deflection rates on routine policy questions. However, legal liability has inverted the adoption equation: corporate liability frameworks (Air Canada v. Moffatt tribunal ruling, German OLG Hamm precedent, May 2026) establish that organizations own all chatbot statements regardless of model confidence or training data quality—meaning hallucinated policy guidance creates direct company liability for damages. Regulatory frameworks (EU AI Act Article 50 transparency by August 2, 2026; state disclosure laws) impose mandatory disclosure and audit obligations. The inflection point has shifted from "does the technology work?" (yes, at 70-85% Tier 1 deflection for transactional policy questions) to "is the organization ready to defend and govern the output?" Organizations with knowledge-base discipline, runtime governance, and bias-testing frameworks extract demonstrable value; organizations deploying without rigorous governance infrastructure are accepting six-figure litigation exposure per hallucinated policy statement.

CURRENT LANDSCAPE

Vendor Maturity and Deployment: The vendor ecosystem has consolidated around platform incumbents with production-scale deployments. ServiceNow's EmployeeWorks GA (February 2026, integrating Moveworks' conversational AI) achieved 5x YoY growth in Q1 2026 with 6 enterprise deals exceeding $1M ACV, signalling strong adoption momentum among large organizations. Documented deployments include Siemens Healthineers (74,000 employees, 5,000 hours monthly saved, 91% satisfaction), CVS Health (300,000 colleagues, 50% chat reduction), and City of Raleigh (98% initial touchpoint routing). Pebl's Alfie HR chatbot achieved 83.5% support ticket deflection on general queries, with 98.2% deflection specifically on global hiring/compliance policy questions and 42% reduction in compensation inquiries—demonstrating capability-level maturity on policy-focused use cases. SAP Joule (HR Path Group case study, 2,500 employees) reduced leave request handling from several minutes to 30 seconds, saving 20 hours monthly, and reduced job description creation from ~1 hour to minutes. Named production deployments at Morning Brew (Brew Bot in Slack for HR Q&A) confirm channel-native deployment models achieving high employee engagement. Moveworks (350+ customer base), Leena AI (400+ customers), and IBM watsonx Orchestrate demonstrate production viability at scale.

Adoption and Organizational Readiness: Bimodal adoption pattern has become entrenched. SHRM March 2026 data (500+ HR leaders) shows AI adoption doubled to 43% in one year, yet only 11% embedded AI into daily workflows—organizational readiness remains the binding constraint. CHRO Association survey (150 CHROs) found 91% prioritize AI but 47% lack productivity measurement frameworks. Fuel50 Q1 2026 survey (250+ HR leaders) shows 48% exploring/piloting AI in talent workflows while 25% have paused or discontinued initiatives in the past 24 months. The adoption-outcomes gap is severe: 88% of HR leaders report organizations have NOT realized significant business value from AI investments despite deployment (StealthAgents analysis, June 2026). Knowledge-base quality and governance discipline—not AI model capability—determine deployment success; failure modes include stale policy retrieval, wrong documentation pulls, and loss of organizational context in complex cases.

Regulatory Liability and Governance Imperatives: Liability frameworks have crystallized. Air Canada v. Moffatt tribunal ruling (February 2024) established that airlines cannot defend chatbot errors by claiming the chatbot is a separate legal entity—companies own all statements. May 2026 OLG Hamm (Germany's highest HR court) ruling extends this principle: companies are strictly liable for chatbot hallucinations regardless of training data quality, directly applicable to false HR policy statements. EU AI Act Article 50 transparency obligations take effect August 2, 2026 (disclosure that users are interacting with AI); high-risk system compliance (Annex III) postponed to December 2027 but already reshaping governance expectations. Organizations deploying HR policy Q&A systems must implement mandatory disclosure, knowledge-base source verification, runtime hallucination detection, and human-in-the-loop controls for employment-decision-affecting advice. Organizations without governance infrastructure face six-figure litigation exposure per hallucinated policy statement and regulatory fines reaching €35M or 7% global turnover. Implementation barriers (67% of chatbot deployments fail to meet expectations per Netguru) center on knowledge-base staleness, broken escalation paths, and channel adoption friction—technical solvable problems, but organizational change-management requirements impose months of design-phase work before deployment readiness.

TIER HISTORY

ResearchJan-2020 → Jan-2021
Bleeding EdgeJan-2021 → Jan-2023
Leading EdgeJan-2023 → Apr-2025
Good PracticeApr-2025 → present

EVIDENCE (135)

— Expert legal analysis establishing AI agents are legal agents of the deploying organization; cites Air Canada case and German court ruling on Google liability for AI errors, clarifying corporate accountability for chatbot output.

— Technical compliance guidance for Article 50 transparency obligations on HR chatbots with concrete examples (leave entitlement, expense reimbursement) and August 2, 2026 deadline for disclosure requirements.

— EmployeeWorks (ServiceNow/Moveworks combined product) adoption signal: 5x YoY growth in Q1 2026, 6 deals exceeding $1M net new annual contract value indicating enterprise-tier adoption in employee workflow platform.

— SHRM26 conference journalism: Morning Brew deployed 'Brew Bot' in Slack for HR Q&A hub; employees ask service/business questions for prompt answers; HR team uses backend data for training/education needs—production deployment signal.

— Legal liability framework via Air Canada v. Moffatt tribunal ruling: companies own all statements chatbots make and cannot defend errors as 'AI hallucinations,' establishing direct corporate liability for HR policy misstatements.

— Alfie HR chatbot deployment metrics: 83.5% support ticket deflection on general queries, 60% reduction in reporting requests, 42% reduction in compensation inquiries, 98.2% deflection on global hiring/compliance questions.

— SAP Joule deployment at HR Path Group (2,500 employees): leave requests reduced from several minutes to 30 seconds (saves 20 hours/month), job descriptions reduced from ~1 hour to minutes, demonstrating measurable HR policy Q&A impact.

— Implementation failure analysis: 67% of businesses report chatbot technology did not meet expectations; identifies six core failure areas (integration, data quality, handoff paths) directly applicable to HR policy Q&A system maturity barriers.

HISTORY

  • 2021: VC funding ($30M+ for Leena AI, Paradox, Aisera) validated HR chatbot market; ServiceNow reported 80% L1 support reduction across departments including HR; peer research identified intrinsic employee motivation as key adoption driver; identified critical failure modes (intent misunderstanding, loops, handoff failures) as barriers to broader adoption.
  • 2022-H1: Moveworks launched Moveworks for HR (GA) with deployment at Solidigm (1,000+ employees); Kimberly-Clark's HR chatbot received 2.5x more questions than traditional channels; research identified specific design factors (bidirectionality, social cues) driving successful interactions; limitations in intent classification and emotion handling remained common, suggesting technology still immature despite growing vendor investment.
  • 2022-H2: Leena AI recognized as market leader (G2 analyst report); practitioner deployments in professional services demonstrated production viability but required narrow scoping and organizational change; market growth reports indicated sustained momentum; critical assessments documented enduring limitations (NLP brittleness, organizational resistance, unrealistic expectations) as barriers to mainstream adoption.
  • 2023-H1: Moveworks published Forrester ROI study (256% three-year ROI, $2.2M HR cost savings) confirming financial viability; ServiceNow expanded Virtual Agent with 18 pre-configured HR topics; academic research and practitioner case studies reinforced implementation criticality; market consolidated around specialist vendors and platform incumbents, with viable deployments limited to narrow, well-scoped use cases.
  • 2024-Q1: ServiceNow released HR Virtual Agent updates (Washington DC release); Leena AI maintained niche adoption (99 customers, 0.1% market share); Air Canada tribunal ruling established corporate liability for chatbot errors; NYC government chatbot audit revealed dangerous accuracy failures in legal/compliance domain, raising stakes for HR policy applications; compliance experts warned of million-dollar exposure from HR chatbot errors.
  • 2024-Q2: ServiceNow released Q2 2024 platform enhancements (Virtual Agent topic migration, Sensitivity Detection) advancing product maturity; Gartner survey showed 38% of HR leaders piloting/implementing GenAI with HR chatbots as top use case; however, significant adoption friction emerged—Paychex survey found 41% of employees prefer less AI and 71% uncomfortable with AI-led HR; U.S. Department of Labor issued field bulletin warning AI cannot substitute for human oversight in wage/hour compliance, establishing acute regulatory liability for HR chatbots handling policy decisions; Czech corporate survey showed 54% deploying or planning AI in HR but confirmed employee preference for human contact and EU AI Act high-risk classification.
  • 2024-Q3: IBM's AskHR case study revealed critical organizational barriers—initial CSAT collapse (-35) from poor change management, but full recovery to 94% query handling and 40% HR budget reduction after strategic redesign; Moveworks achieved Forrester Leadership across 19 criteria; ServiceNow achieved 37% faster case resolution through internal policy knowledge search; employee adoption sentiment improved (TriNet: 66% use AI for HR, 1 in 3 prefer AI to humans) but remained polarized (Paychex: 41% prefer less AI); HR leadership endorsement strengthened (Forrester: 75% deem AI-powered HR essential for future investment, 66% report satisfaction); peer research confirmed persistent technical limitations alongside functional value, confirming market bifurcation between successful narrow deployments and broader automation blocked by accuracy, compliance, and trust barriers.
  • 2024-Q4: ServiceNow released November 2024 enhancements (LLM Topics, multi-turn Q&A, Sensitivity Detection, multilingual support) advancing platform maturity; market adoption reached 70%+ of large enterprises handling 80,000+ HR queries monthly with 40% onboarding time savings, while HR professional adoption climbed to 94% despite governance gaps (40% lack policies); Leena AI confirmed scale (400+ customers, 70% self-service); legal analysis surfaced persistent risks (bias, discrimination, privacy, unreliable advice), emphasizing that functional capability had decoupled from organizational and regulatory readiness.
  • 2025-Q1: Vendor consolidation accelerated—ServiceNow announced $2.85B acquisition of Moveworks (March 2025), signaling market concentration around platform incumbents; real-world deployments expanded with Inspire for Solutions Development (450 employees, IBM watsonx), e2open (4,000+ employees, 75% HR question reduction), and EverBank (40% task reduction), confirming production viability across scales. However, critical implementation barriers emerged: independent reviews showed Moveworks at 6.4/10 satisfaction with only 81% recommendation rate; ScreenMeet analysis revealed Virtual Agent deflection rates below 15% vs. promised 50% due to "Done Gap" in knowledge bases. EU AI Act compliance deadline (August 2026) began reshaping governance requirements with €35M/7% turnover penalties for non-compliance, signaling regulatory maturity of the practice but raising adoption friction for cautious organizations.
  • 2025-Q2: BambooHR and e2open deployments demonstrated production viability at scale (30% and 75% ticket reductions respectively); ServiceNow Yokohama release added GA-level KPI tracking for KB-driven HR case resolution; vendor platform maturity continued advancing (Moveworks Agent Studio claimed 95% production success, Forrester analyst validation). However, adoption expansion stalled at organizational level—Vlerick survey showed 59% of HR leaders with little/no AI use, only 25% using chatbots; HR Acuity benchmark showed 44% of organizations report no AI use in employee relations and <10% leverage AI for policy recommendations. Employee sentiment remained polarized (66% use AI per TriNet, but 41% prefer less AI per Paychex). Leadership recognized governance gaps (92% of HR leaders per G-P report) ahead of August 2026 EU AI Act compliance deadline. The inflection shifted from platform capability to knowledge-base completeness and organizational readiness barriers.
  • 2025-Q3: Moveworks reached 5 million platform users and 90% company-wide rollout; Globe Telecom deployment (Leena AI) achieved 75% HR ticket self-service resolution. Product enhancements matured (multi-turn Q&A, multilingual support); however, organizational adoption stalled—HR Acuity showed <10% of organizations leverage AI for policy recommendations. Critical risks surfaced: employment law analysis warned of hallucination-induced compliance liability (avg. $40K+ per claim); security researchers documented prompt injection vulnerabilities enabling sensitive data extraction. Regulatory pressure intensified with August 2026 EU AI Act compliance deadline reshaping investment priorities. The inflection clarified: the practice consolidated toward narrow-scope, high-confidence deployments (standardized benefits, routine policy questions) rather than mainstream adoption, with organizational readiness and knowledge-base completeness as primary adoption barriers.
  • 2025-Q4: Regulatory compliance became the primary inflection point. New state laws (California SB 243 effective Jan 2026, New York, Maine, Utah, Nevada, Illinois) introduced private rights of action and disclosure requirements for AI chatbots, raising litigation exposure. Analyst reports (Capstone DC) documented acute risks from Mobley v. Workday age discrimination case and EU AI Act high-risk classification (€35M or 7% turnover penalties). Ecosystem partnerships matured—Interact and Leena AI integrated agentic AI capabilities enabling policy Q&A with automatic action triggering. Real-world deployments continued: Moveworks secured multinational infrastructure company with 28,000+ employees across 53 countries. HarmonyHR analysis confirmed that successful implementations require process discipline first, with AI agents positioned as complementary to human oversight. Critical assessments (inFeedo) documented continued limitations: emotional context gaps, complex issue handling barriers, and need for human escalation. By end-Q4 2025, regulatory maturation and litigation risk had overtaken product capability as the primary adoption barrier; organizations pursued only highest-confidence narrow-scope deployments with rigorous governance and change management.
  • 2026-Jan: Regulatory framework stabilized with California SB 243 and companion state laws (New York, Maine, Utah, Nevada, Illinois) now in effect, establishing disclosure requirements and private right of action for AI chatbots. Vendor platforms advanced (Moveworks FedRAMP-authorized across regulated sectors including healthcare and financial services; ServiceNow HRSD with sentiment analysis and KB-driven metrics). Workplace AI adoption broadened (Gallup: 12% daily, ~6/10 of AI users rely on chatbots for admin tasks), though HR-specific adoption remained cautious. Governance frameworks matured (Pesync compliance matrix, ercel guidance for EU AI Act high-risk classification with August 2026 deadline). Security risks crystallized through case studies (Eurostar prompt injection); governance failure and regulatory liability emerged as equal barriers to product capability. Deployments remained narrow-scope and high-confidence with mandatory human-in-loop controls.
  • 2026-Feb: Regulatory framework operationalized with state chatbot laws now active (California SB 243 private right of action, New York/Maine/Utah/Nevada/Illinois disclosure mandates). Vendor consolidation advanced: Moveworks achieved FedRAMP Moderate Authorization for federal/healthcare deployments; ServiceNow launched EmployeeWorks integrating Moveworks' conversational AI; Leena AI released AOP Creator (GA) enabling automated workflow triggering from policy queries. Real-world accuracy failures reinforced adoption caution: NYC's MyCity chatbot shut down (Feb 4) for providing illegal advice (e.g., withholding tips), demonstrating deployment risks directly applicable to HR policy automation. Industry guidance (AI Tribune) positioned HR policy Q&A as viable "good first project" but noted 80%+ of companies gained no measurable productivity from AI yet. Organizational readiness remained weak: HR-specific adoption continued stalling with <10% leveraging AI for policy recommendations. The decisive inflection clarified: regulatory compliance and organizational readiness (not product capability) now gatekeep adoption; deployment remained confined to large enterprises with sophisticated governance disciplines.
  • 2026-Mar: ServiceNow EmployeeWorks reached general availability (March 2026) with documented customer deployments achieving measurable HR impact: Siemens Healthineers (74K employees, 5K hours/month saved, 91% satisfaction), CVS Health (300K colleagues, 50% chat reduction), City of Raleigh (98% initial touchpoint resolution). Moveworks reached 350+ total customer base with specific HR use case evidence; FedRAMP certification enabled healthcare/federal sector deployments. However, critical governance and security failures emerged: McKinsey's internal AI Q&A chatbot (40K employees, 500K+ monthly prompts) breached via SQL injection/unauthenticated APIs (46.5M messages exposed); People Central HR SaaS provider leaked 95K employee records through SQL injection vulnerabilities (salaries, bank accounts, emergency contacts exposed). These high-profile breaches demonstrated that governance maturity and security discipline—not product capability—remain the binding adoption constraint. SHRM conference data (500+ HR leaders) confirmed adoption bifurcation: AI usage doubled (26% to 43%) but embedding into workflows stalled (11%). CHRO Association survey (150 CHROs) revealed 91% prioritize AI but 47% lack productivity measurement frameworks. EU AI Act chatbot penalties (€35M or 7% global turnover, effective August 2026) sharpened the compliance urgency for organizations deploying policy Q&A systems. The inflection solidified: deployment success depends entirely on organizational and regulatory readiness (knowledge-base curation, auditability, bias testing, human-in-the-loop controls) rather than platform feature parity.
  • 2026-Apr: Enterprise-scale deployment evidence expanded: Johnson Controls' agentic HR assistant (100K+ global employees) achieved 30-40% call volume reduction on routine policy and onboarding queries, while an NHS Foundation Trust (5,000+ employees) completed a structured governance-first deployment using a 15-person HR/OD validation working group. Hallucination risk received renewed scrutiny—the Halluhard benchmark found Claude Opus 4.5 with web search hallucinated in ~33% of multi-turn cases, and Klarna's reversal of its 700-person HR/CS AI replacement highlighted agentic failures in complex handling. The EU AI Act August 2026 enforcement deadline (€35M or 7% turnover penalties) and SHRM's finding that 80% of HR professionals use genAI daily but governance and change management lag behind in high-judgment tasks confirmed that regulatory and organisational readiness remain the binding deployment constraints.
  • 2026-May: New production case studies confirmed deployment viability at scale—Coretus (85% resolution, 14 countries), Microsoft Eva multi-strategy RAG with live agent handoff, Capgemini Nortura multilingual deployment—while benchmarks from Deloitte and Gartner research validated 60-80% Tier 1 deflection rates (70-85% for PTO, payroll, and benefits use cases) and a consulting analysis ranked HR policy Q&A bots as the #2 enterprise HR AI use case with a ~4-month ROI window. Critical cost-benefit analysis (Crisp) debunked vendor ROI claims by documenting real-world failure rates and establishing three preconditions for chatbot ROI; sycophancy research (Stanford HAI testing 26 models: 22-94% hallucination rates under social pressure) and legal analysis confirming employer liability for AI output regardless of vendor reinforced the governance imperative—with EU AI Act high-risk classification (August 2026 enforcement) and Sophos data showing 71% of organisations suffered identity breaches adding compliance urgency to deployments processing sensitive HR data.
  • 2026-Jun: Legal precedent matured with OLG Hamm (Germany's highest court for HR/competition cases) ruling that companies are strictly liable for chatbot hallucinations regardless of training data quality (May 27, 2026)—a decision directly applicable to false HR policy statements and establishing that organizations cannot defend HR chatbot errors as "AI hallucinations." Expert legal analysis further confirmed that AI agents function as legal agents of the deploying organization, extending Air Canada v. Moffatt liability principles across HR policy contexts. Simultaneously, EU AI Act compliance framework crystallized: draft guidelines (May 19, 2026) distinguish logistical HR chatbots (answering procedural questions, low-risk) from selective chatbots (evaluating policy applicability, high-risk); Article 50 transparency obligations (August 2, 2026 deadline) require disclosure with concrete HR examples—leave entitlement queries, expense reimbursement responses—now documented for compliance teams. EmployeeWorks (ServiceNow/Moveworks combined product) achieved 5x YoY growth in Q1 2026 with 6 enterprise deals exceeding $1M ACV—the strongest commercial adoption signal yet for enterprise-tier HR policy chatbot consolidation. Named deployment metrics sharpened the capability picture: Pebl's Alfie achieved 83.5% support ticket deflection on general HR queries and 98.2% deflection on global hiring and compliance policy questions specifically—demonstrating that well-scoped compliance-focused deployments outperform general-purpose implementations. Morning Brew's SHRM26 presentation confirmed channel-native deployment (Brew Bot in Slack) as the production pattern that achieves high employee engagement. Meanwhile, enterprise AI adoption data revealed widening gaps: IBM study showed 61-point gap between AI access (85%) and actual use (25%); WRITER survey documented 79% of enterprises struggling with AI adoption despite >$1M investment, with five failure modes (strategy theater, trust-resistance, security gaps, productivity-to-ROI disconnect) directly applicable to HR chatbot rollouts. Knostic research documented LLM fabrication of HR data (70% of employers caught employees using AI for salary research; 63% report salary requests based on inaccurate AI information), demonstrating that hallucinated policy statements create organizational damage identical to data breaches. Deployment evidence from Bell Telecommunications (RAG for employee policy access) and India case study (600-employee company: portal failed at 18% usage, conversational AI in Slack/Teams succeeded) confirmed that channel-native architecture and knowledge-base quality are deployment determinants alongside legal liability. Deloitte trust data showed 33% collapse in employee trust toward employer-provided AI over three months, with agentic system trust falling 89%, yet organizations successfully rebuilding trust through reskilling, worker involvement, and transparency. Mid-June data confirmed bimodal adoption: Careertrainer (2026-06-14) surveyed 2,000+ enterprises finding 25% of large enterprises now using AI-powered chatbots for employee self-service HR queries with 49% of HR professionals reporting employees increasingly comfortable with AI HR chatbots, signaling mainstream adoption at scale. Conversely, Ringly's market aggregate showed $11.8B global chatbot market in 2026 (up 23% YoY) with 91% of 50+ employee organizations deployed, yet StealthAgents' critical analysis (2026-06-06) found 88% of HR leaders report their organizations have NOT realized significant business value from AI investments despite adoption—the fundamental adoption-outcomes gap. Real deployment evidence accelerated: Lumeris (AWS case study, 2026-06-08) deployed 'Ask P&C' chatbot across 1,000+ employees with 90%+ accuracy and immediate responses replacing 1-2 day email wait; Athletico (6,500 employees, 2026-06-09) achieved 80% AI resolution on benefits queries with $500K+ cost avoidance; Moveworks' Starburst customer case demonstrated 50% autonomous HR and IT issue resolution with 62% first-line-of-defense adoption within one month of deployment. Platform risk surfaced: Ookla's analysis (2026-06-10) of AI platform reliability showed Claude accounted for 39 of 51 disruption days in Q1 2026, indicating infrastructure volatility for HR chatbots built on public LLM platforms. Employee adoption friction persisted: TechBuzz (2026-06-09) reported 50%+ of US desk workers identify as AI skeptics citing accuracy and hallucination concerns as primary barriers, suggesting meaningful resistance despite overall enterprise adoption data. Governance infrastructure emerged as deployment differentiator: NHIMG analysis (2026-06-05) identified runtime governance gap in production chatbots (Chipotle, Air Canada, DPD examples) where policy existed but real-time enforcement did not, directly applicable to HR chatbot accuracy safeguards. Operational maturity frameworks crystallized: Netguru (2026-06-15) provided diagnostic KPI hierarchy (containment → cost → FCR → CSAT → escalation rate) for chatbot ROI measurement, with benchmarks of 70–85% containment for transactional HR use cases. The inflection sharpened: the practice had decoupled into bimodal adoption (25% mainstream, 88% realizing no value) and production readiness was no longer bottlenecked by vendor capability but by organizational readiness, knowledge-base discipline, runtime governance, platform reliability, and employee trust. Organizations with mature infrastructure, governance discipline, and change management infrastructure extract real value; those without it deploy chatbots that achieve adoption metrics but negligible business outcome.