The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI-powered semantic search and question answering across large codebases, going beyond keyword matching. Includes tools that answer questions about architecture, dependencies, and usage patterns; distinct from documentation generation which produces static artefacts.
Code search and codebase Q&A has reached inflection: the practice is now table-stakes infrastructure for enterprise AI development, yet fundamental architectural limitations persist. Semantic code search—asking natural language questions about architecture, dependencies, and usage patterns—addresses a real bottleneck (developers spend ~15% of time on discovery), and deployments at Palo Alto Networks (2,000 developers, 25% productivity gain), Sourcegraph customers (200+ enterprises, 54B LOC indexed), Qualtrics, Coinbase, and Altisource demonstrate production value. GitHub and Sourcegraph reached GA milestones in Q1 2026: Copilot's semantic indexing achieved seconds-fast retrieval and now extends to issue search (May 2026); Cody shifted enterprise-only (July 2025). Yet vendors are moving beyond RAG architectures precisely because semantic-only retrieval has hit limits. Retrieval precision drops sharply with corpus scale (87% degradation at 50K+ documents); retrieval consistency varies >50% across prompt formulations; context window constraints cause accuracy degradation on large codebases (50% on >10K LOC). Semantic collapse—where embedding drift silently degrades relevance in production without visible error signals—causes 28% increase in hallucinations. The tier-defining tension has inverted: maturity is now about moving beyond semantic search toward hybrid keyword-semantic-structural approaches. Peer-reviewed research (PwC, ISSTA 2026) reveals deployment barriers: grep outperforms embeddings on evidence-location problems; concept-alignment approaches achieve 15x improvement on out-of-distribution tasks. Adoption and trust remain misaligned: 52% of developers use leading tools but 96% distrust AI output. The practice remains leading-edge but constrained by architectural fragility and unresolved reliability barriers.
GitHub and Sourcegraph control enterprise deployments, with Q1 2026 and May 2026 milestones confirming market consolidation and feature maturation. Copilot's semantic code search reached GA (March 2026, sub-second indexing); semantic issue search launched (May 2026) extending infrastructure beyond code; Sourcegraph shipped Smart hover summaries (April 2026) grounding Q&A in precise code intelligence rather than embeddings alone. Cody operates at enterprise-only tier ($19-$49/seat, July 2025), signaling market bifurcation. Real-world deployments demonstrate measurable ROI: Palo Alto Networks onboarded 2,000 developers via Cody + Claude in 3 months with 25% average productivity gain and peak gains to 40%; Sourcegraph serves 200+ customers (Stripe, Reddit, BlackRock, Nutanix) with 54B LOC indexed, delivering 4-day Log4j vulnerability responses and 80% time reduction for cross-repository changes; Qualtrics' 1,000-developer rollout reduced IDE navigation 28% and code Q&A time 25%; Altisource modernized 350K lines with 25% productivity gain and 54% vulnerability reduction. Scale Labs' 2026 benchmark (124 Codebase Q&A tasks) revealed 30% frontier capability ceiling for architecture and root-cause analysis, quantifying limits of semantic retrieval alone.
Adoption breadth masks critical limitations. Market surveys show 52% developer adoption of leading tools (Claude Code, Cursor) but 96% distrust AI output; code review time now exceeds writing time. Production failure modes documented: semantic collapse causes 28% hallucination increase where embeddings drift silently in static indexes; embedding fine-tuning for precision degrades broad retrieval 40%; grep outperforms vector search on evidence-location problems (PwC research); Sourcegraph abandoned RAG embeddings at 100K+ repository scale, switching to BM25; EA's internal study found off-the-shelf semantic search provided minimal productivity uplift on 10M LOC codebases. Technical assessments document scaling failures: context window constraints cause 50% accuracy degradation on >10K LOC; XSearch research (ISSTA 2026) shows concept-alignment approaches achieve 15x improvement on out-of-distribution benchmarks, addressing poor semantic search generalization. Wikimedia Foundation deployed semantic search at scale (1.1M snippets, 83K files) but leading vendors pivot toward hybrid strategies (keyword-embedding-AST, language servers) recognizing semantic-only approaches hit hard limits. Third-party tools remain niche; duopoly consolidation driven by adoption barriers—secret leakage (6.4% for Copilot users, 40% above baseline), vulnerability generation, and vendor lock-in concerns restraining broader ecosystem. The practice achieved mainstream integration but remains constrained by unresolved reliability and architectural boundaries.
— Technical guide on three code search modalities (lexical, structural, graph) for agents; documents Sourcegraph Cody removing embeddings in favor of BM25F+graph, evidencing shift away from pure semantic RAG.
— Large-scale empirical study (35,361 GitHub code comments, Dec 2022–Mar 2026) showing longitudinal shift from direct code generation toward greater emphasis on knowledge and conceptual support via AI-assisted codebase Q&A.
— Turbopuffer benchmark (50 tasks, ContextBench) showing semantic search reduces wasted file reads from 1-in-3 to 1-in-8, with file precision improving from 65% to 87%.
— Official Microsoft documentation of GA semantic code search capabilities including automatic indexing, multi-tool search orchestration, and scale handling from 5 to 500K files.
— Deep Search GA extends code search beyond retrieval into quantitative analysis (count, rank, aggregate across codebases) with architectural summaries, advancing from find-only to analytical code Q&A.
— Production code retrieval models (70% win rate vs grep, 56% fewer search operations, 60k token savings per query) showing semantic code search reaching SOTA performance and operational efficiency.
— Practitioner guidance on semantic issue search (GA May 20, 2026) showing feature-complete maturity for sprint planning, bug triage, and cross-semantic issue discovery workflows.
— 72+ actively maintained code search projects (Rust, updated May–June 2026) with hybrid semantic+BM25, AST indexing, MCP integration, demonstrating ecosystem maturity and commodity-level code search infrastructure.