The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI-powered search and retrieval-augmented generation across internal documentation and enterprise systems. Includes cross-system search federation and context-aware answer generation; distinct from domain-specific RAG which targets specialised corpora rather than general enterprise knowledge.
Enterprise search and RAG is a proven practice with mature tooling, documented ROI, and broad adoption -- yet one where execution discipline, not technology, now determines success or failure. The pattern combines keyword and semantic retrieval with LLM-powered answer generation over proprietary data, grounding generative AI in internal knowledge rather than public corpora. Hybrid retrieval (vector plus BM25) settled as the production standard after vector-only approaches proved unreliable for exact matches, structured data, and multi-hop reasoning. The architecture has begun to evolve: agentic RAG (where agents decompose queries and refine retrieval iteratively) is emerging as the 2026 foundational pattern for complex enterprise questions, and research from Amazon Science shows agentic keyword-based retrieval can match pure-vector RAG performance without dedicated vector databases—signaling architectural diversification away from vector-centric assumptions. The technology works. The harder problem is everything around it: chunking strategy, document quality, governance, cost control, evaluation frameworks, and the persistent demo-to-production gap. Forty-five percent of enterprise AI deployments now incorporate RAG, and adopters report strong economics, but independent analysis shows 80% of enterprises fail critically at RAG implementation—with 73% of failures originating at retrieval layer and knowledge base quality as the binding constraint, not generative capability. Cost sustainability emerged as acute: five documented enterprise failures totalling $23M+ in losses stemmed from RAG abandonment or cost explosion, and 72% of implementations report failing within first year due to uncontrolled infrastructure expenses. The practice's defining tension is this gap between technological maturity and organisational readiness -- a gap that has stalled further tier advancement despite a market at $1.94B (2025) and projecting $9.86B by 2030.
The infrastructure layer is production-grade and still improving. Elasticsearch 9.3 shipped bfloat16 vector compression (halving storage) and GPU-accelerated indexing with 12x throughput gains. Azure AI Search added agentic retrieval (GA April 2026) with expanded knowledge sources including OneLake and SharePoint, and demonstrated cost-transparent multi-step RAG with transparent pricing models ($4.32 per complex query). Databricks rebranded Vector Search to AI Search (June 2026) with full-text search capabilities, signalling platform evolution beyond pure vector retrieval. Vector database adoption surged 377% year-over-year, with RAG now the primary use case driving adoption in 2026. These are not early-adopter tools -- they are GA platform features embedded in mainstream enterprise stacks. Gartner analyst forecasts show 60% of enterprises will deploy 6+ enterprise search platforms by 2028, with 60% embedding AI search into applications (3x increase). Architectural evolution is underway: agentic RAG (multi-step query decomposition with iterative retrieval refinement) is emerging as the 2026 production baseline for complex enterprise queries, with research demonstrating 62% hallucination reduction compared to naive RAG; Amazon Science and industry benchmarks show agentic keyword-based retrieval achieves >90% of traditional RAG performance without dedicated vector databases, suggesting future diversity in retrieval architectures beyond vector-centric designs.
Yet adoption metrics obscure a critical execution gap. The surface story looks confident: 92% of RAG adopters report ROI within 12 months, averaging 3.2x return. But analysis reveals 80% of enterprises fail critically at RAG implementation, with 73% of failures originating at the retrieval layer, not generation. Of roughly 1,000 enterprises that attempted RAG deployments through 2025, only about 200 succeeded. The pattern that emerges is not technology risk but operational neglect and knowledge base quality: 70% of deployments lack systematic evaluation frameworks, 30-40% of infrastructure budgets are wasted on poorly observed pipelines, and knowledge base quality (document freshness, authority clarity, structural preservation) has emerged as the binding constraint determining success or failure—not retrieval algorithms or embedding quality.
Document quality remains the most underestimated barrier. Standard chunking destroys the logical structure of technical documents -- tables, cross-references, embedded images -- producing hallucinations even when retrieval is technically correct. Five documented enterprise RAG abandonment cases totalled $23M+ in losses, with root causes including stale regulatory data ($12.2M trading loss), policy versioning failures (30% diagnostic error spike in healthcare), and cost explosion (87% of enterprise RAG systems report failing within first year due to uncontrolled infrastructure expenses). Recent production analysis identifies five specific knowledge quality failure modes: recency decay (superseded documents retrieved as current), authority ambiguity (no version markers), structural loss (tables flattened, labels detached), relationship fragmentation, and versioning confusion. The emerging discipline of RAGOps attempts to address this by treating retrieval pipelines as production systems requiring monitoring, governance, and lifecycle management rather than one-time integrations, with risk-controlled data flywheel architectures that integrate OCR, semantic chunking, verification layers, and governance from day one. Evaluation frameworks (six-layer maturity models evaluating corpus quality, retrieval accuracy, groundedness, task success, latency/cost, and escalation design) are now standard in mature deployments. For organisations willing to invest in that operational discipline, enterprise RAG delivers 12-18% precision improvement through hybrid search, 69% error reduction from contextual compression, and 75% accuracy gains on complex regulatory documents via agentic reasoning. For those expecting turnkey results, the failure rate remains punishing.
— Comprehensive synthesis of enterprise RAG failures Nov 2025–May 2026: 72–80% implementation failure rates, 51% of enterprise AI failures are RAG-related, and critical finding that retrieval quality (not model size) drives hallucination.
— Named enterprise deployment (Myntra e-commerce): RAG optimizations reduced latency from 8.5ms to 0.8ms, scaled to 500K personalization operations/second, demonstrating real-world production RAG maturity.
— Practitioner failure patterns with empirical backing: McKinsey (20% workday lost to search), Gartner (50% GenAI project abandonment by Jan 2026), MIT (95% pilots zero P&L impact). Prescribes 6-month adoption metrics over demo success.
— Peer-reviewed research on real Wyoming DoT corpus: accuracy collapsed from 75% to 40% scaling from 54 to 1,128 documents. Domain-scoped retrieval recovered P@10 from 0.77 to 0.86 with quantified solution.
— Production deployment: Independent enterprise AI company achieved 84% accuracy using Bedrock+Claude 3 in <5 months, handling 60–80% of customer service queries autonomously with 10.5% higher accuracy than competitors.
— Major vendor GA: Gemini Enterprise agentic RAG with iterative retrieval achieved 90.1% accuracy on FramesQA (34% improvement vs standard RAG) with Sufficient Context Agent addressing multi-hop reasoning gaps.
— Santiago & Company released Enterprise RAG Gold Standard benchmark showing retrieval (not model capability) determines success. Cites Gartner: 50% of GenAI projects abandoned post-POC by Jan 2026.
— Academic framework identifying RAG's architectural mismatch with hierarchical legal structure: mereological blindness (part-whole relationships), diachronic blindness (temporal dynamics), causal opacity. References real court failures with fabricated citations.
2023-H1: RAG emerged as standard enterprise AI pattern; major vendors announced production tooling (Databricks, Elastic, Azure) addressing deployment challenges. Academic analysis documented limitations and scenarios requiring RAG. Practitioner critique identified tunnel vision toward vector search; hybrid retrieval approaches gaining attention.
2023-H2: Cloud providers shipped production RAG infrastructure; Azure rebranded to Azure AI Search with vector search GA. Elasticsearch deployed RAG in production (Support Hub). Evaluation frameworks (Ragas) and quality benchmarks proliferated, addressing production measurement gaps. Prototype-to-production gap identified as primary adoption barrier.
2024-Q1: RAG stabilized as production standard with documented deployment scale (10TB+ docs/day, 500M+ embeddings, 100k+ users). Market reached $1.35B with 40%+ projected CAGR. Five critical barriers crystallized: retrieval method selection (hybrid > vector-only), prompt engineering, data quality/chunking, evaluation frameworks (RAGAs now peer-reviewed), and performance scaling. Vector-only approaches widely recognized as insufficient.
2024-Q2: Enterprise RAG deployment accelerated with proven adoption at scale: Azure achieved 88% cost-per-vector reduction; KPMG and AT&T deployed to 40k+ and 80k+ users respectively. Simultaneously, independent research revealed production risks: Stanford study found 17-34% hallucination rates in legal RAG tools, while peer-reviewed industry deployments confirmed RAG effectiveness with proper architecture. Hybrid retrieval and data quality emerged as enforced operational requirements, not options.
2024-Q4: Enterprise search & RAG transitioned to mainstream adoption: Menlo Ventures survey reported 28% adoption across 600 enterprise leaders, with RAG now used by 73% of production LLM systems (McKinsey). Platform maturation accelerated—Azure AI Search launched agentic retrieval GA and enterprise security features; Elastic's internal deployment achieved 75% relevance improvement. Research emphasis shifted from architecture to content design discipline; academic analysis identified data governance and security integration as primary adoption barriers. Market attention moved toward agentic search capabilities as evolution beyond basic RAG.
2025-Q1: Mainstream enterprise RAG adoption revealed implementation headwinds: WRITER survey showed only ~33% ROI despite $1M+ annual investment, with 68% of C-suite reporting organizational friction from AI adoption. Technical barriers persisted: Salesforce's HERB benchmark revealed enterprise RAG struggles with multi-hop reasoning over heterogeneous data (documents, transcripts, messages, code), with best agentic methods achieving only 33% performance and retrieval identified as bottleneck. Security gaps remained critical: 13% of enterprises reported AI breaches with 97% lacking proper access controls, demonstrating unresolved governed RAG implementation despite frameworks existing. Product evolution continued with Azure AI Search expanding agentic capabilities and Elasticsearch maturing observability for RAG deployments. Bifurcation emerging between highly-committed Fortune 500 deployments scaling to 40k+ users and mainstream enterprises struggling with proof-of-value and operational adoption.
2025-Q2: Product maturity accelerated with AWS Bedrock custom metrics GA and Azure production deployments (Japan Digital Design's operational case study). However, real-world deployment studies quantified critical quality failures: RAG systems in banking and insurance achieved only 71% citation accuracy and 23% incorrect answers despite perfect retrieval; domain-specific embedding fine-tuning remained mandatory. Infrastructure failures cited in 87% of failed implementations (Gartner), with vector database misconfigurations, monitoring gaps, and backup procedures unresolved. Market consensus shifted decisively: enterprise RAG's barrier was no longer algorithmic but organizational—governance, security integration, operational discipline, and change management remained blocking factors. Bifurcation deepened between Fortune 500 scaling toward governed agentic RAG and mainstream enterprises stuck in proof-of-concept.
2025-Q3: Market growth continued with RAG market at USD 1.92B and 39.66% CAGR projected to 2030 (Mordor Intelligence); Gen AI adoption among enterprises accelerated to 30% scaling (5x growth from 2023, Capgemini). However, critical execution gaps persisted: Salesforce HERB benchmark revealed enterprise RAG quality failures with agentic RAG achieving only 32.96/100 on heterogeneous data, with retrieval as core bottleneck. Cost sustainability emerged as acute problem—72% of enterprise RAG implementations reported failing within first year due to uncontrolled infrastructure expenses; governance and formal policies remained critically underdeveloped (46% of organizations). Quality assurance challenges documented: deployment failures attributed to poor document quality, lack of evaluation loops, and absence of reranking strategies. Market bifurcation endured: Fortune 500 organizations advanced toward governed agentic retrieval with operational discipline, while mainstream enterprises remained blocked by execution complexity, cost overruns, and ROI realization barriers.
2025-Q4: Enterprise RAG market continued growth ($2.33B in 2025, projected 42.7% CAGR to 2035) amid persistent execution challenges. GenAI enterprise spending accelerated sharply to $37B in 2025 (3.2x from 2024), with 76% of AI use cases purchased rather than built internally. RAG became consolidated as the primary production use case for internal enterprise AI—most companies building internal AI systems used RAG pipelines—yet quality and sustainability barriers mounted. Independent failure analysis documented high attrition: 42% of enterprise AI use cases failed in 2025, with 51% of those failures being RAG implementations (S&P Global), only 200 of 1,000 enterprises successfully deploying RAG. Document quality emerged as primary execution barrier: 40% of RAG implementations failed due to poor OCR, inconsistent formatting, and absence of domain-specific fine-tuning; semantic search failed 15-20% of the time in specialized domains (banking, insurance, legal) despite hype around vector embeddings. Practitioner adoption accelerated (36% of developers learning RAG per Stack Overflow 2025 survey) while quality concerns persisted (75% of developers wanted human validation of AI outputs). By year-end, enterprise RAG had consolidated as established practice with proven technology but unresolved organizational, cost, and operational implementation barriers.
2026-Jan: Enterprise RAG adoption plateaued with infrastructure maturation but persistent execution barriers. GenAI usage reached 71% organizational penetration (up from 65% in 2024), with vector database adoption surging 377% year-over-year supporting RAG workloads; hybrid retrieval (vector + BM25) became industry standard achieving 20-40% better quality vs vector-only. Production platforms matured: Elasticsearch 9.2 shipped AI Agent Builder and DiskBBQ optimization; AWS and Azure continued expanding observability tooling. However, 70% of RAG deployments lacked systematic evaluation frameworks leading to silent degradation, and critical barriers persisted: 30-40% of infrastructure budgets wasted due to cost visibility gaps, data engineering challenges (governance, document quality, fragmentation) overshadowing technology maturity, only 17% of organizations realizing 5%+ earnings from GenAI despite widespread deployment. Bifurcation sharpened between Fortune 500 organizations optimizing adaptive retrieval and agentic capabilities vs mainstream enterprises stuck with POC execution gaps and ROI realization challenges.
2026-Feb: Platform maturity continued: Elasticsearch 9.3 GA introduced bfloat16 vector compression (50% storage reduction) and GPU acceleration (12x vector indexing throughput); Azure AI Search expanded agentic retrieval capabilities with portal support for new knowledge sources and reasoning effort tuning. Market consolidation deepened: 45% of enterprise AI deployments incorporated RAG (up from 15% in 2023), with 92% of adopters reporting ROI within 12 months (3.2x average return), confirming mainstream adoption trajectory. However, measurement blind spots emerged as critical risk: 87% of enterprises focused on answer quality metrics while neglecting infrastructure health (data freshness, governance, pipeline reliability), creating silent failure modes in production systems. Document processing challenges persisted: standard chunking strategies destroyed logical structure in technical documents (tables, images, captions), necessitating semantic chunking and multimodal approaches for reliable enterprise deployments. By month-end, consensus solidified around RAGOps as operational discipline, addressing production reliability gaps and establishing enterprise RAG as technology with proven economics but unresolved execution complexity.
2026-Mar: Enterprise RAG consolidated as production infrastructure at significant scale: enterprise search market valued at $7.76B growing to $16.41B (11.3% CAGR), with RAG now at 30-60% of enterprise AI use cases and 87% of enterprises with AI in production (up from 31% in 2020). Capacity's Azure AI Search deployment achieved 97% accuracy with 4.2x cost reduction; Ruhrkohle AG deployment achieved 40% search time reduction. Slack AI native enterprise search reached GA, connecting 55+ data sources with permission-aware results and federated architecture. A critical structural tension crystallized: industry research found enterprises feel they "cannot live without RAG, yet remain unsatisfied"—architecture proven, execution barriers unresolved, with EU AI Act imposing 15-30% performance reduction risk for BFSI (26% of the market). The core execution challenge remains unchanged: governance, document quality, and evaluation discipline continue to determine whether deployments sustain or degrade.
2026-Apr: Production RAG maturity shifted focus to evaluation infrastructure and failure-mode taxonomy. A six-layer enterprise evaluation framework (corpus quality, retrieval accuracy, groundedness, task success, latency/cost, escalation design) emerged as the practical standard for mature deployments; hybrid search with contextual compression reported 69% error reduction and 12-18% retrieval precision gains over vector-only approaches. Agentic knowledge graph architectures achieved 75% accuracy improvement on complex regulatory corpora (Code of Federal Regulations), signalling a structural split between standard RAG for general enterprise knowledge and graph-augmented RAG for multi-hop reasoning over regulated domains. Vector database benchmarks (April 2026) across Pinecone, Weaviate, Qdrant, and Milvus now provide standardised latency, recall, and cost comparisons for enterprise selection decisions; Meta published a peer-reviewed HUMBR technique reducing hallucinations in enterprise RAG workflows; and regulated pharma enterprises are increasingly building internal RAG pipelines rather than purchasing commercial solutions to meet governance requirements.
2026-May: Infrastructure expansion and execution failures advanced in parallel. Databricks announced GA of Mosaic AI compound system capabilities including Agent Bricks, MLflow evaluation, Vector Search, and governance tooling; Atlassian opened the Teamwork Graph to push AI agents into enterprise workflows, with Mercedes-Benz reporting 10x faster software delivery in production (across 75% of Fortune 500 and 90% of enterprise cloud customers). RAGAS became the de facto quality framework with AWS, Microsoft, Databricks, and Moody's running 5M+ monthly evaluations collectively; Uber deployed OpenSearch at 1.5B-item scale. Market sizing confirmed mainstream status: $1.94B in 2025, projected $9.86B by 2030; Gartner forecasts 60% of enterprises will deploy 6+ enterprise search platforms and 60% of applications will embed AI search by 2028. Named deployment from Ontop showed 130 hours/month saved and legal response time reduced from 20 minutes to 20 seconds via RAG. Despite this, independent analysis confirmed 80% of enterprises fail critically at RAG implementation, with 73% of failures originating at the retrieval layer rather than the generation layer—a persistent structural gap that infrastructure maturity alone has not closed.
2026-Jun: Architectural alternatives, failure-rate data, and knowledge base quality emerged as the month's defining signals. A comprehensive synthesis of enterprise RAG deployments (Nov 2025–May 2026) documented 72–80% implementation failure rates, with 51% of enterprise AI failures traced to RAG—and confirmed that retrieval quality, not model size, drives hallucination. Gartner data reinforced this: 50% of GenAI projects were abandoned post-POC by January 2026, with McKinsey finding 20% of the workday still lost to search despite widespread RAG deployment. Amazon Science published research demonstrating agentic keyword-based retrieval achieves >90% of traditional RAG performance without vector databases; Alibaba Cloud's production billion-scale Elasticsearch hybrid RAG achieved 95% cost reduction with 7–8x performance gains via BBQ quantization—together signalling diversification away from vector-centric assumptions. On the success side, Myntra's production RAG deployment reduced personalization latency from 8.5ms to 0.8ms and scaled to 500K operations/second; Google Gemini's Sufficient Context Agent achieved 90.1% accuracy on FramesQA (34% improvement over standard RAG) at GA. Peer-reviewed research (TechScience) formalised the risk-controlled data flywheel architecture as the production standard for regulated domains. Critical failure-mode analysis reinforced that knowledge base quality—not retrieval algorithms—remains the binding constraint: five enterprise abandonment cases traced to recency decay, authority ambiguity, and structural loss in document processing, with 73% of RAG failures originating at the retrieval layer.