Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Content localisation & translation

LEADING EDGE

TRAJECTORY

Stalled

AI-powered translation and cultural adaptation of marketing content for international markets beyond literal translation. Includes transcreation and cultural sensitivity checking; distinct from personal translation tools which support individual communication rather than marketing campaigns.

OVERVIEW

AI-powered content localisation has proven its economics for volume translation — cost reductions of 60–80% and throughput gains measured in orders of magnitude — but cultural adaptation remains the hard ceiling that keeps the practice at leading-edge rather than mainstream. Forward-leaning enterprises now run AI translation as production infrastructure, not an experiment. Post-editing workflows are the baseline, and platform vendors have shipped brand-voice controls and RAG-enhanced quality layers. The speed and scale story is settled. May 2026 research confirms: best-in-class LLM achieves only 44.48% accuracy on culturally-grounded tasks across 14 languages and 51 regions, while hallucination rates spike 15–35% in non-English languages and 38 points in low-resource contexts due to pretraining data imbalance.

What is not settled is everything beyond literal translation. Transcreation — rewriting content to land culturally, not just linguistically — still defeats LLMs. Research consistently shows AI mishandling idioms, cultural references, and emotional register, with accuracy on culturally specific items topping out around 67% even in leading models. Governance compounds the problem: adoption surveys find most organisations cannot maintain brand voice consistency across languages, and roughly half report no clear ROI despite deploying AI translation at scale. A critical adoption paradox emerged: while AI accelerates content production (86% of enterprises report this), localization workflows actually slow down (65%), with rework overhead consuming 21% of localization budgets. Additionally, translator adoption barriers persist—AI perception triggers lower trust and over-editing even when machine quality is equivalent. The practice has split into two distinct problems: high-volume translation, where AI delivers clear value, and cultural adaptation, where human judgment remains irreplaceable. Most organisations are still navigating that divide.

CURRENT LANDSCAPE

May 2026 deployments confirm volume translation viability at enterprise scale: AWS/Smartling case study achieved 26% BLEU improvement with 30% reduction in human editing and 15x cost savings; Translated.com reports multiple independent deployments (Asana 70% automation/30% effort reduction, NordVPN 43% sales increase across 24 locales, Airbnb 31 new languages, Cricut 100+ minute video in 5 languages in 2 weeks); Smartling Fortune 100 deployment delivered $3.4M annual savings, 50% faster cycles, 99%+ quality. Crowdin case studies show breadth: Snov.io (3M+ users, 14 languages), Turo (90% faster, 98% cost reduction), Strava (150M users, 6-week rollout). Platform maturity solidified: Lokalise's May 2026 Spring release introduced AI Orchestration Layer with MCP Server integration achieving 80% first-pass publish-ready translations; Crowdin Enterprise now available on AWS Marketplace with AI Pipeline presets and custom context instructions, signaling vendor parity on agentic workflows. Enterprise-scale deployment at Lyft achieved 99% automation coverage with 30-min SLA and days-to-minutes turnaround with human-in-the-loop model. Adoption baseline: 65% of enterprises incorporate AI-assisted translation and 74% prioritize AI automation (TransPerfect 2026 Business Outlook). Scale is operationally viable.

Governance and adoption friction remain binding constraints. Nucleus Research quantified 80–90% cost reduction ROI but identified governance gaps: ungoverned tool sprawl creates compliance risk and quality variance. A survey of 400+ translation decision-makers found 79% had incorporated AI into core infrastructure, yet only 57% maintained consistent brand voice across languages—and the ROI split ran nearly even, with 48% reporting gains and 52% seeing none. A critical adoption contradiction: 86% of enterprises report AI accelerates content production, yet 65% report AI slows localization workflows due to 21% rework overhead. Named-enterprise challenges persist: Adobe JAPAC executive (20-year cross-cultural marketing career) documents real deployment problems of AI over-generalization, cultural nuance loss, governance risks, and regulatory variation across markets. Regulated industries face systematic risks with documented failures: healthcare AI translation governance gap resulted in FDA/HIPAA/Title VI compliance failures in 2024, with no clear ISO 18587 MTPE certification standard adoption. Peer-reviewed research from Frontiers in Psychology documented that translators' AI perception triggers lower trust and over-editing even when quality is equivalent. A GAO report on the National Weather Service documented the kind of failure that erodes trust: "rip current" rendered as "hangover current" in Spanish-language alerts. Peer-reviewed research confirms structural causes: non-English language pairs show lower COMET scores due to Token Activation Rate underrepresentation in training data; ICML 2026 proposes LiRA framework to address low-resource capability gap. Forrester warned that LLM safeguards degrade sharply in non-English and low-resource languages, a structural problem given that roughly half of LLM training data is English. Technical analysis confirms: hallucination rates jump 15–35% in non-English and spike to 38 percentage points in low-resource languages due to pretraining imbalance. The localization industry itself shows strain: Kobalt interview data identified 'gap between AI promise and reality' as the defining 2026 tension. Volume translation works. Governance, regulatory compliance, cultural appropriateness, adoption friction, and language-model disparity—everything else—remains the binding constraint.

TIER HISTORY

ResearchJan-2020 → Jan-2020
Bleeding EdgeJan-2020 → Jul-2024
Leading EdgeJul-2024 → present

EVIDENCE (120)

— Documented healthcare governance failure case (2024); identifies systematic FDA/HIPAA/Title VI risks—critical negative signal showing adoption barriers in regulated sectors despite volume translation maturity.

— Named Adobe executive documents real deployment challenges: AI over-generalization, cultural nuance loss, governance gaps, and regulatory variation—evidence of binding constraints blocking next-tier adoption.

— Peer-reviewed analysis of LLM translation failure across 22 language pairs reveals structural cause: non-English pairs show lower COMET scores due to Token Activation Rate underrepresentation in training data.

Latest Product Updates (May 2026)Product Launches

— Lokalise AI Orchestration Layer enables MCP-based agentic workflows with 80% first-pass publish-ready translations, marking infrastructure transition from manual localization to orchestrated multi-model parallel processing.

— Enterprise adoption baseline: 65% incorporate AI-assisted translation, 74% prioritize AI automation. Direct signal of infrastructure shift from specialty tool to production baseline.

— ICML 2026 research on LiRA framework addresses structural low-resource language gap through improved multilingual LLM adaptation, signaling technical progress on binding constraint.

— Lyft enterprise deployment achieved 99% automation coverage with 30-min SLA and days-to-minutes turnaround, demonstrating production-scale viability at Fortune 500 level with human-in-the-loop model.

New at Crowdin: April 2026Product Launches

— Crowdin Enterprise on AWS Marketplace with AI Pipeline presets and custom context instructions confirms vendor ecosystem maturity and agentic workflow parity across platforms.

HISTORY

  • 2020: AI-augmented translation moving into enterprise production; NMT quality improving but still unreliable for marketing; transcreation remains human-specialist work; vendors securing Fortune 500 customers.
  • 2021: Vendor platforms maturing with specialized features (Smartling+, Lilt Instant Translate for government); research documenting cultural adaptation complexity and data contamination issues; practitioner surveys showing quality and ROI barriers despite vendor claims.
  • 2022-H1: Weglot reaches 60K website deployments; consumer adoption of online translation tools exceeds 55% in LatAm, driving demand. SmileDirectClub case study validates human-in-the-loop ROI (58% cost reduction). Analyst reports and vendor examples document persistent barriers: transcreation still resists automation, brand failures from translation-only approaches, and cost viability challenges (Microsoft discontinues AI translation feature). Tier-defining tension: automation works for straightforward translation but not cultural adaptation.
  • 2022-H2: Vendor platforms accelerated feature rollouts and adoption metrics; Smartling expanded NMT Hub to all customers with multi-engine integration; Lilt reported 1M+ documents translated with 1000% MT volume increase and Gartner recognition. Marketing-specific quality studies emerged: Weglot/Nimdzi evaluation found 85% of MT translations acceptable for consumer content. However, peer-reviewed research documented persistent creativity and nuance barriers: academic study showed MT-generated translations unfit for publication, and clinical research found AI unable to accurately handle figurative language. Vendor adoption barriers persisted despite product maturity—quality suitable for website localization but not marketing transcreation or culturally sensitive campaigns.
  • 2023-H1: Vendor innovation accelerated with Smartling's generative AI integration claims and Lilt's new Lilt Create product for content creation and localization. Enterprise deployment continued: Farfetch adopted MT for high-volume localization. Industry consensus (CSA Research forum) reflected on market direction. Critical barriers persisted: documented limitations in AI handling of language nuance and cultural adaptation; marketing translation failures highlighted cost of insufficient localization. Automation continued to excel at volume translation but remained unsuitable for cultural adaptation and transcreation.
  • 2023-H2: Vendor innovation shifted toward multimodal localization and enterprise controls. Lilt launched Contextual AI Engine (Dec 2023) claiming GPT-4-parity performance. Video localization gained traction: Rosetta Stone achieved 5x ROAS with AI video translation; Lilt partnered with CaptionHub for multilingual subtitling. Smartling integrated with marketing platforms (Iterable) for end-to-end localization. However, critical limitations remained visible: Reuters documented systematic AI translation failures in U.S. asylum processing; brand case studies highlighted mistranslations from over-reliance on automation; peer-reviewed research confirmed AI gaps in cultural appropriateness and nuance. Tier-defining tension persisted: automation proven at scale for volume and video content, but cultural adaptation and creative localization remained unsuitable for full automation.
  • 2024-Q1: Smartling reported 40% translation business growth driven by AI Translation adoption, delivering 10x faster output at fraction of human cost—signaling strong enterprise deployment momentum. However, documented evidence of persistent limitations emerged: marketing agencies documented specific risks of AI in multicultural campaigns (accuracy loss, idiom failure, DEI risks) and sports domain specialists highlighted domain-specific failures despite vendor capability claims. Tier-defining tension sharpened: enterprise adoption at scale for high-volume translation, but cultural nuance and creative localization remained problematic for marketing-critical content.
  • 2024-Q2: Enterprise adoption momentum continued with Lightricks achieving 120% improvement in localization delivery rates through AI-assisted translation workflows. However, research from Aalto University and critical labor-market data surfaced persistent barriers: peer-reviewed evidence documented cultural bias in AI translation requiring additional training; Society of Authors survey found 36% of UK translators lost work to AI, with 43% experiencing income decline; healthcare professionals showed hesitation on AI validation (57% unsure/opposed). Market data showed 75% consumer preference for native-language content and 70% user dissatisfaction with cultural nuance handling, confirming tier-defining tension: automation proven for efficiency and high-volume content, but cultural appropriateness and creative localization remain problematic at scale.
  • 2024-Q3: Production-scale deployments advanced across major platforms and public institutions (Reddit 44% user growth attribution, Minnesota OpenAI rollout) alongside evidence of adoption friction and project abandonment. Gartner predicted 30% of GenAI projects abandoned by end-2025 due to ROI and data quality challenges; critical journalism documented AI failures in content-adjacent creative tasks. Academic research confirmed transcreation limitations in GPT-4 despite improving assistive capability. CSA Research synthesis identified strategic shift toward "Creative Language Intelligence" with elevated translator roles. Tier-defining tension persisted: volume translation proven viable at scale, but cultural adaptation and creative localization remained problematic, requiring strategic deployment rather than broad automation.
  • 2024-Q4: Production deployments continued (Personio 40% budget savings, Polhus 75% AI-ready rate across 1.6M words) but adoption momentum decelerated. CSA Research reported 2023 as "peak localization" with 40% of LSP CEOs reporting service decline and 39% of enterprises pausing spend—signal of strategic pause despite vendor feature releases. Forrester predicted traditional language support models becoming "old-fashioned." Practitioner survey (Middlebury, 450 respondents) recorded 5.69/10 mixed sentiment; consensus on AI-as-partner models rather than displacement. Academic and vendor analyses consistently flagged persistent limitations: cultural insensitivity from biased training data, inability to handle idioms and figurative language, ethical concerns around automation. Microsoft Translator discontinuation signaled ecosystem consolidation. Tier-defining tension unresolved: high-volume, lower-stakes content proven automatable, but cultural appropriateness and nuanced messaging remained problematic at scale.
  • 2025-Q1: Enterprise adoption of AI translation accelerated with new production deployments (European law enforcement using LILT for high-volume, time-sensitive translation) and evidence of rapid adoption growth (533% increase in AI translation adoption across 3,000 companies per Lokalise analysis). However, adoption friction remained pronounced: Slator's 2025 Localization Buyer Survey found 38% of localization buyers cited inefficient AI use as their top cost inefficiency, signaling continued integration challenges despite rising deployment volume. Peer-reviewed research documented persistent cultural sensitivity gaps: Macao Polytechnic study of Portuguese-Chinese translation via DeepL and ChatGPT confirmed AI tools fail to capture cultural nuances, idioms, and emotional depth. High-profile failure examples persisted: Amazon 'rape oil' translation scandal illustrated brand damage risks from over-reliance on automation without human review. Tier-defining tension crystallized: adoption and volume metrics accelerating, but quality and cultural appropriateness barriers remained unresolved for marketing-critical content; enterprises increasingly facing paradox of faster AI translation against need for human oversight on cultural adaptation.
  • 2025-Q2: Enterprise adoption accelerated with market baseline shift toward AI-assisted translation. Forrester reported 70% of translations now machine-assisted; Lokalise survey (500 leaders) found 55% using AI for localization, 81% planning hybrid models within a year. Machine translation post-editing adoption reached 46% among LSPs (up from 26% in 2022)—signaling AI as production baseline. Case study evidence documented ROI: Smartling customers achieved 60% cost reduction and 95% turnaround improvement; Orange County Superior Court deployed custom AI translation tool for high-stakes legal context. However, implementation barriers crystallized: vendor analysis identified six persistent risks (accuracy gaps, security constraints, brand voice loss, workflow bottlenecks, overhyped expectations, ethical compliance), and 63% of adopters acknowledged human review essential for quality. Adoption of AI for cultural adaptation and creative localization remained underdeveloped; Nimdzi buyer analysis showed significant interest but adoption beyond core translation still developing. Tier-defining tension sharpened: volume translation proven viable with cost/speed ROI, but cultural appropriateness, creative messaging, and nuanced content remained problematic at scale without human oversight.
  • 2025-Q3: Ecosystem matured with vendor entry into transcreation automation and empirical evidence on AI's role in cultural adaptation. MotionPoint launched AI-powered Transcreation platform (Sep 2025) for marketing with brand voice protection. Peer-reviewed research contradicted earlier transcreation-as-human-only assumptions: GPT-3 training improved cultural adaptation quality, with trained students surpassing professionals (Hassani et al.); ChatGPT demonstrated efficiency in health campaign adaptation but required human judgment for sensitive content (Gutiérrez-Artacho et al.). However, critical assessments documented persistent barriers: market analysis (174 sources) showed 85% of accuracy errors stem from AI misunderstanding local context; BLEND reported 60-85% AI accuracy vs. 95%+ professional, with ~40% cultural-phrase misinterpretation vs. <5% human error. Adoption metrics remained strong (70% machine-assisted, 55% of leaders using AI, 81% planning hybrid) but implementation focus shifted from cost-driven translation to quality-driven cultural adaptation. Tier-defining tension persisted: volume translation and initial transcreation capability proven, organizational readiness increasing, but persistent accuracy and context gaps kept practice at leading-edge with integration complexity and cultural appropriateness gaps unresolved.
  • 2025-Q4: Ecosystem reached critical juncture with advanced deployment patterns emerging and adoption barriers hardening. LILT deployed real-time fine-tuning via NVIDIA GPUs (Dec 2025) for government agencies with 30X throughput gains, demonstrating enterprise-scale production viability. Peer-reviewed research (Oct 2025) found Apple and Sunstech websites showed AI efficient for volume but lacking cultural nuance and creative adaptation. Industry expert analysis (Dec 2025) observed workflow evolution from "human-in-the-loop" toward "human-on-the-loop" in some contexts, with concern that internal AI adoption was bypassing localization teams. High-profile cultural failure: Apple removed culturally misinterpreted imagery from iPhone 17 Air Korea campaign, exemplifying transcreation automation inadequacy. Critical analyses intensified: LLM translation outperforms traditional NMT in research (WMT25) but production adoption lags due to technical debt; many hybrid systems create operational overhead without clear ROI. Metrics and errors: LARA AI 2.4 errors/1000 words vs. professional <0.5; Shopify data showed 30% of 2024 localization failures from AI over-reliance despite 10-15% conversion uplift from localization. Tier-defining tension crystallized: volume translation proven viable operationally, transcreation tooling entering market, but persistent gaps in cultural appropriateness and creative messaging kept practice at leading-edge with hard ceiling on full-spectrum mainstream adoption.
  • 2026-Jan: Enterprise adoption accelerated into infrastructure and government contexts with strong volume metrics but persistent governance and quality barriers. Smartling reported 218% YoY growth in AI translation volume, with Fortune 500 clients achieving 3x output and 60% cost reductions, signaling shift from experimentation to production deployment. However, Zogby survey (400+ leaders) found 79% adopted AI but only 57% maintained brand voice consistency, with ROI split 48%/52%, revealing adoption-outcome gap. Government deployment expanded: National Weather Service and other federal agencies deployed AI in production but GAO report documented cultural failures ("rip current" → "hangover current") requiring human review. Pronto Translations' assessment of thousands of real projects documented hallucinations, cultural misalignment, and stylistic flattening, advocating human-led hybrid model as necessary safeguard. RWS documented AI dubbing maturity for video localization (90% cost reduction, months-to-days cycles), showing multimodal expansion. XTM webinar (80% of leaders prioritizing practical implementation) noted LLM language imbalance (half training data English) limiting non-English quality. Tier-defining tension persisted: volume translation operationally viable at scale, but governance, cultural appropriateness, and brand consistency remained binding constraints on broader adoption.
  • 2026-Feb: Volume translation infrastructure solidified with new deployment patterns (3.9M words in 4 days via parallel agents, Lokalise Custom AI Profiles GA with RAG brand adaptation), but critical governance and cultural barriers hardened into permanent constraints. Peer-reviewed study (Appen, Feb 2026) confirmed LLMs fail on cultural idioms/puns in marketing (GPT-5 ~67% on cultural items), and acoustic failures (Korean 72% truncation in large-scale deployment). Kobalt interview-based report revealed adoption paradox: MTPE at ~46% but 90% of localization leaders 'exhausted' by change, identifying 'gap between AI promise and reality' as defining 2026 tension. Forrester warned LLM safeguards collapse in non-English and low-resource languages with governance failures. Deployment patterns emerging: orchestrated parallel translation now viable for volume, RAG-enhanced brand voice protection showing viability, but organizational exhaustion and quality inconsistency remained structural barriers to next-tier adoption.
  • 2026-Q2 (Mar-Apr): Enterprise production deployment validated with IBM achieving 50% time reduction, 40% quality improvement, and 99.5% automation at scale (170+ countries). Workforce bifurcation accelerated: CIOL data showed 88% of freelancers using MTPE with 70% work volume decline; $71.7B market growing 6-9% annually with clear winner/loser pattern (volume translation commoditizes, specialized domains—gaming $5.14B, legal, medical—thrive). Adoption breadth confirmed at 95% enterprise level, yet 1-in-5 report quality incidents. Critical barriers persisted across independent assessments: Slator documented 33-60% hallucination rates, 30-50% COMET degradation in low-resource languages; Meta NLLB candidly reported low-resource translation 'significantly below standard' and transcreation 'firmly in human territory'; Global Voices documented systematic language bias affecting 79% of low-resource speakers. Low-resource research advanced: LLM fine-tuning approaches demonstrated synthetic dataset viability (7,995 pairs achieving CHRF++ 24.38→32.02). Tier-defining tension crystallized: volume translation operationally mature at enterprise scale, but low-resource language capability, transcreation automation, and governance readiness remain binding constraints blocking full mainstream transition.
  • 2026-May: Volume translation ROI at enterprise scale is now extensively benchmarked: AWS/Smartling achieved 26% BLEU improvement, 30% editing reduction, and 15x cost savings; Smartling Fortune 100 deployment delivered $3.4M annual savings and 50% faster cycles; Crowdin case studies show Turo at 98% cost reduction and Strava at 6-week rollout for 150M users. Lokalise's Spring 2026 release introduced an AI Orchestration Layer with MCP Server integration achieving 80% first-pass publish-ready translations, signaling agentic workflow maturity across major platforms. Governance barriers sharpen with new evidence: regulated-industry AI translation failures (documented 2024 healthcare FDA/HIPAA/Title VI compliance gaps) and Adobe JAPAC deployment challenges (AI over-generalization, cultural nuance loss, regulatory variation) confirm adoption ceilings beyond commodity translation. Peer-reviewed structural analysis (22 language pairs) confirms non-English COMET score degradation is caused by Token Activation Rate underrepresentation in LLM pretraining data. Cultural performance ceiling remains hard: the best-performing LLM achieves only 44.48% accuracy on culturally-grounded tasks across 14 languages and 51 regions (CulturALL benchmark); hallucination rates run 15–35% higher in non-English and spike 38 points in low-resource languages. A critical adoption contradiction surfaces: 86% of enterprises report AI accelerates content production, yet 65% report AI slows localization workflows due to rework overhead consuming 21% of total localization budgets.