The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI that identifies application performance bottlenecks, optimises database queries, and recommends resource improvements. Includes query plan analysis and runtime profiling; distinct from code refactoring which targets code quality rather than runtime performance.
AI-assisted performance optimisation and query tuning is a mature, proven practice — the tooling works, the ROI is documented, and the rollout question is operational, not existential. A deep vendor ecosystem (Dynatrace, Datadog, New Relic, IBM Db2) delivers GA products for execution-plan analysis, runtime profiling, and recommendation engines, backed by analyst recognition and quantified returns (Forrester reports 267% ROI from observability deployments). The defining tension is no longer whether AI adds value but how far automation can safely extend. Research demonstrates that learned optimisers outperform traditional cost models in controlled settings, yet production deployments universally keep humans in the loop. Persistent reliability gaps in features like SQL Server's Query Store — plan-forcing failures, silent regressions under concurrency — explain why. The practice occupies a stable plateau: broadly adopted as an assisted-tool discipline, with autonomous optimisation remaining aspirational despite sustained vendor investment.
By June 2026, query optimization has fractured into three parallel tracks: learning-based vendor platforms shipping autonomous optimization with measured outcomes; LLM-assisted suggestion layers providing decision support; and foundational research validating new capabilities while exposing production reliability gaps. Snowflake leads with learning-based optimization in production: Optima Planning automatically improved recurring query performance (180x on 44-join query, 8.5x on cardinality issues) and Cortex AI Functions optimization (19x overhead reduction via learned filter ordering, 3.2x cost reduction via claim verification) demonstrate end-to-end autonomous tuning. PostgreSQL community published core optimization roadmap (pgconfdev2026): focus on cardinality hints and join statistics collection as pragmatic alternatives to speculative runtime adaptation, reflecting mature consensus that feedback mechanisms beat stateful learning for open-source adoption. Research widened LLM's proven capability scope: agentic query execution (EnumGRPO) achieves 317x cost reduction at $0.011 per-query, GPU kernel synthesis (DataKernelBench) delivers 2.11x speedups via LLM-generated CUDA, and join order optimization improved 80% of test cases via semantic reasoning. Critical negative signals emerged: BIRD benchmark evaluation on 95 real production databases showed GPT-4 at 54.89% accuracy with curated hints (34.88% baseline), revealing a 38pp gap versus human performance (92.96%) and documenting failure modes on dirty data and complex schemas. Practitioner framework (TroubleshootingSQL ONE CARD) articulated production evaluation standards: clarity (DBA override trust), ask (p95-specific improvements), risk (graceful degradation), and data (improvement attribution), exposing that operational validation remains unsolved for most AI tools. Observability vendor momentum continues: Dynatrace $1.972B ARR (20% YoY), database performance monitoring market approaching $2.6B, with FreedomDev documenting real outcomes (10x-50x improvements, 92% CPU reduction, $35K-$75K savings per engagement). Yet foundational reliability constraints block autonomous adoption: Query Store plan-forcing failures, OPTIMIZATION_REPLAY_FAILED errors, SQL Server upgrade regressions (3+ hours on compat 170), and parameter-visibility gaps remain endemic. Practitioner consensus solidified around assisted workflows: AI rewrites validated against EXPLAIN and Query Store, paired with comprehensive observability (48.5% OpenTelemetry adoption) for independent verification. The field shows clear capability expansion (LLMs, RL agents, GPU kernels) but divergent adoption: learning-based platforms deliver measurable autonomous optimization in specific contexts, while broader enterprise deployments remain in assisted-tool mode. Convergence toward unsupervised optimization is not visible—operating assumption remains human oversight mandatory.
— BIRD benchmark evaluation on 95 real databases: GPT-4 achieves 54.89% accuracy with curated hints (34.88% without), vs 92.96% human performance. Documents critical limitations of AI-assisted SQL generation on production databases with dirty data and complex schemas.
— Uber data infrastructure engineer documents Java-to-C++ Presto/Velox migration: query latency improved significantly for compute-heavy workloads; memory usage became predictable. Represents real production optimization addressing execution model bottlenecks.
— Peer-reviewed research on EnumGRPO, a self-improving optimizer for LLM-backed query execution. Achieves 35.4% execution accuracy at $0.011 per-query cost (317x reduction vs hybrid), demonstrating quantified efficiency for agentic query optimization at scale.
— SQL Server query optimization case study: execution plan analysis identified 62% cost bottleneck; YEAR function removal optimization achieved 7.5x speedup (30s → 4s). Demonstrates practical query tuning via plan cost analysis and targeted rewrites.
— Framework for evaluating AI query planners: clarity (DBA override trust), ask (specific p95 improvements), risk (graceful degradation), data (improvement attribution). Critical perspective on production readiness of AI-assisted optimization tools.
— Comprehensive technical guide framing query optimization as senior-engineer skill: EXPLAIN plan literacy, index type selection, join algorithm prediction, empirical SARGable rewrites. Positions query optimization literacy as critical discipline.
— Snowflake Optima Planning GA: learning-based optimization showing 180x improvement on 44-join query (1+ hour to 20 seconds) and 8.5x speedup on join cardinality issues. Demonstrates autonomous learning-based optimization shipping at scale with measured outcomes.
— PostgreSQL core committers (Robert Haas, Tomas Vondra, Alexandra Wang) discuss query optimization improvements via feedback mechanisms: cardinality hints, join statistics, runtime adaptation strategies, reflecting active community work on core optimizer maturity.
2018: Google Cloud launched Stackdriver APM as a developer-focused alternative to AppDynamics and Splunk. Microsoft released SQL Server 2017 with Query Store as a production feature for query plan history and automatic tuning recommendations. Dynatrace reached 14,700+ live website deployments. Critical assessments emerged showing Query Store default settings caused performance overhead on some workloads.
2019: Dynatrace confirmed market leadership (9th Gartner Magic Quadrant leader designation). Datadog APM achieved production maturity with enterprise adopters (Airbnb, Zendesk, Square, Peloton). SQL Server 2019 released with enhanced Query Store defaults attempting to reduce overhead. Academic research accelerated: CIDR 2019 and SIGMOD 2019 papers on deep learning for query optimization signaled progress toward automation, but production gap remained large. Industry assessments highlighted persistent challenges: tuning still required manual expertise in execution plans and workload analysis; many practitioners struggled with configuration complexity.
2020: APM market consolidation accelerated with Big Five vendors controlling $5B in spending; aggressive pricing and complexity raised adoption barriers for mid-market organizations. Academic research advanced on automated tuning: SBBD 2020 presented empirical results from self-tuning frameworks, and aiDM 2020 explored deep RL for join optimization. Critical finding: ESEC/FSE 2020 discovered 159 bugs in DBMS query optimizers (including 51 optimization bugs in PostgreSQL, SQLite, CockroachDB), exposing fragility in optimizer foundations. SQL Server Query Store adoption matured in Microsoft ecosystem with growing tutorials and practitioner advocacy. Community-driven open-source APM alternatives (rails_performance gem, perf-tools) emerged in response to commercial tool cost barriers. Autonomous optimization remained research and roadmap territory; production deployments still relied on manual guidance and vendor monitoring dashboards.
2021: Academic research continued advancing ML approaches to query optimization: CIDR 2021 introduced DBEst++ for learned approximate query processing, and arXiv published a comprehensive survey of cardinality estimation and cost model improvements. Industry deployments of APM tools expanded (HashiCorp, practitioner case studies with New Relic showing 99%+ improvement in specific queries), yet autonomous tuning remained largely vendor-roadmap territory. Query Store adoption persisted in Microsoft ecosystem but with recognized overhead (3-4% typical, critical for high-volume workloads), limiting enterprise rollout. Conference presentations (FOSDEM 2021) highlighted ongoing work on adaptive query optimization and neural network-based improvements for PostgreSQL, signaling continued academic-to-open-source momentum but slow progress toward production autonomous systems.
2022-H1: Deep divergence emerged between research and production. SIGMOD 2022 published Balsa, a RL-based query optimizer achieving 2.8x improvement over expert optimizers, representing a major AI breakthrough. However, concurrent research systematically revealed limitations: traditional cost models often outperform learned models in practice, signaling maturity gaps despite accuracy gains. Microsoft SQL Server 2022 public preview shipped major Intelligent Query Processing enhancements (parameter-sensitive plans, DOP feedback), reflecting vendor investment in database-native automation. Production incidents (Datadog APM tracer memory leaks) exposed reliability concerns in major vendor tooling. Analyst recognition (Gartner MQ 2022 Leader for Datadog) confirmed ecosystem maturity, but deployment remained cautious with most tuning still manual and expertise-driven.
2022-H2: Production deployments widened: Glovo deployed Datadog Database Monitoring to optimize queries and reduce computational load; Alpiq deployed Datadog Mule integration across entire application portfolio. UC Berkeley research advanced ML-driven optimization (Naru, NeuroCard, Balsa RL agents outperforming commercial engines). Platform vendors accelerated automation: New Relic released NRQL productivity features (aparse, conditional, regex multi-capture); Dynatrace maintained analyst leadership (12th MQ designation). Critical limitations persisted: MySQL optimizer failures caused 12-minute delays on queries that should run in 0.1s; underlying databases exposed 159+ optimizer bugs (PostgreSQL, SQLite, CockroachDB); Query Store overhead remained endemic (3-4% typical). Production tuning still relied on vendor dashboards and manual expertise rather than autonomous optimization. Research-to-production gap widened despite academic breakthroughs.
2023-H1: Vendor platform momentum continued with incremental improvements: New Relic GA'd advanced NRQL query features for performance correlation across systems (June); Microsoft Azure released Query Performance Insight public preview for PostgreSQL (April); Datadog and Dynatrace maintained integrated APM and database monitoring. However, reliability emerged as a critical blocker: Brent Ozar documented multiple production-affecting bugs in SQL Server 2022 CU4 affecting Query Store, including incorrect query results and memory dumps, signaling tool fragility despite vendor investment in database-native automation. The ecosystem showed no evidence of convergence toward fully autonomous optimization—tools remained dependent on human expertise and vendor dashboards for effective tuning. Database engines (MySQL, PostgreSQL, CockroachDB) continued to harbor dozens of optimizer bugs, and Query Store overhead remained endemic (3-4% typical).
2023-H2: Platform vendors consolidated analyst recognition: Dynatrace ranked #1 across all six Gartner Critical Capabilities for APM and observability use cases (July), maintaining market leadership; SQL Server 2022 expanded Query Store capabilities to secondary replicas in Availability Groups (November), reducing primary impact for performance monitoring. Academic research continued scrutiny of ML-based optimization: arXiv paper (September) examined robustness of learned query optimizers, raising concerns about ML model behavior and reliability in production systems. The field remained in a holding pattern—no major production deployments of fully autonomous AI-driven query tuning emerged in H2, and the research-to-production gap showed no convergence despite vendor investment and academic advances. Performance optimization remained an assisted-tool ecosystem dependent on human interpretation of vendor dashboards rather than autonomous systems.
2024-Q1: Major platform vendor GA releases matured the ecosystem. Dynatrace released its Databases app with AI-powered anomaly detection and statement performance analysis (January); Microsoft GA'd Azure SQL Database Query Performance Insight (January) following its successful preview; both products demonstrated continued investment in productized query optimization tools. Real-world deployment evidence: MAMPU (Malaysian Administrative Modernisation unit) reported a case study showing 99% response time reduction (18.2s→60ms) and 413% APDEX improvement through Dynatrace-assisted optimization, alongside 68% user adoption gains—indicating strong deployment momentum despite continued reliance on tool-assisted tuning rather than autonomous optimization.
2024-Q2: Vendor consolidation and enterprise adoption accelerated. Dynatrace's Q1 FY2025 earnings (published May 2024) disclosed $1.5B ARR with major platform consolidation wins including a top-20 global financial institution and Fortune 50 company moving to cloud, signaling large-scale enterprise reliance on observability-driven optimization. IBM publicly described evolution of Db2 query optimizer toward AI-based automation using ML models trained on customer workloads, signaling persistent vendor investment in automated tuning despite research-production gaps. Independent survey data: Brent Ozar's May 2024 SQL Server population report showed 49% of monitored production servers running SQL Server 2019 (highest 3-year adoption rate), indicating widespread deployment of platform-native query performance capabilities. Practitioner adoption continued in modern stacks: hands-on integration of Datadog APM in Rails/GraphQL applications; real-world testing of AI-driven tools like Releem for MySQL configuration optimization. However, the field showed no evidence of convergence toward fully autonomous tuning—optimization remained dependent on tool-assisted analysis and human expertise.
2024-Q3: Vendor AI features and infrastructure challenges emerged. IBM shipped Db2 12.1 AI Query Optimizer (August) using neural networks for cardinality estimation, advancing database-native automation; New Relic released cardinality management UI (September) for metric optimization. Forrester TEI study (August) reported 267% ROI and 40% IT time savings from observability deployments, validating enterprise adoption. However, Query Store deployment friction persisted: real-world case showing SQL Server 2022 on 254-core infrastructure with 166 databases automatically switching to READ_ONLY mode due to memory limits (August), losing performance visibility; Kendra Little documented two plan-forcing bugs causing compilation time to jump from 28s to 60+ minutes (August). APM market growth continued with projections to $8.665B by 2025 (22.69% CAGR). Field remained dependent on tool-assisted analysis and human interpretation rather than autonomous optimization.
2024-Q4: Multi-vendor AI acceleration with persistent reliability concerns. Dynatrace extended AIOps capabilities for Oracle and SQL Server databases (December) enabling automated query issue resolution; Azure SQL Database previewed ABORT_QUERY_EXECUTION hint (December) for automated problematic query blocking via Query Store; IBM Db2 12.1 AI Query Optimizer GA (December) used neural networks for zero-input cardinality estimation and plan selection. Research frontier advanced with LLM-based approaches: arXiv preprint (November) showed LLM embeddings enable simple classifiers to outperform heuristic query optimizers. New Relic observability study (October) reported quantified ROI: organizations with full-stack observability experienced 79% less downtime and 48% lower outage costs, validating ecosystem value. However, production fragility persisted: Query Store plan-forcing bugs continued causing GENERAL_FAILURE states and 60+ minute compilation times; parameter value visibility limitations hindered troubleshooting complex queries (November). The ecosystem showed continued vendor investment in AI-driven optimization but remained dependent on human oversight and tool-assisted analysis rather than autonomous decision-making.
2025-Q1: Vendor momentum in AI-enhanced observability with organizational constraint barriers emerging. New Relic announced enhanced Database Performance Monitoring with agentic AI integrations at New Relic Now+ 2025 conference (February), positioning AI-driven database optimization as uptime-critical. Datadog deepened enterprise adoption through workflow embedding: Go1 (enterprise learning platform) integrated Datadog APM into engineering culture where teams proactively detect issues before escalation (January). However, platform maturity constraints surfaced: Kendra Little documented Microsoft's declining SQL Server investment (March), noting promised features like Query Store on secondary replicas remaining in preview years after 2022 announcement, with accumulating unfixed bugs undermining confidence in automated tuning infrastructure. Enterprise adoption continued with stable Query Store deployment base (49% SQL Server 2019 market share), but optimization remained dependent on human interpretation of observability signals rather than autonomous systems. LLM-based optimization research remained exploratory with no major production deployments.
2025-Q2: Vendor AI-feature announcements revealed by production reliability constraints. SQL Server 2025 and Oracle AI Database 26ai shipped AI-driven optimization capabilities (vector indexing, in-database agents); Dynatrace expanded custom database monitoring across major engines (Oracle, SQL Server, MySQL, PostgreSQL). However, Query Store reliability gaps persisted: CU10 introduced intermittent plan regression bugs where forced plans silently failed under high concurrency; OPTIMIZATION_REPLAY_FAILED errors continued in SQL Server 2022+; autotuning algorithms documented limitations (minimum execution thresholds, ML model detection gaps). Observability tooling overhead emerged as deployment challenge: New Relic Agent data collection caused performance degradation (NR-1018 error) requiring sampling adjustments. Production optimization remained dependent on human interpretation and manual remediation, not autonomous systems—the field showed no convergence toward fully autonomous AI-driven tuning despite vendor capability announcements.
2025-Q3: Incremental vendor AI feature releases with persistent reliability constraints blocking autonomous adoption. New Relic released Predictive Analytics (NRQL predictions, predictive alerting via Holt-Winters) and Compute Optimizer (efficiency identification, CCU savings estimation) in July; Datadog announced unified APM/RUM data integration enabling end-to-end performance correlation. SQL Server 2025 RC0 (September) advanced Intelligent Query Processing with automatic cardinality estimation feedback for expressions and parameter sensitive plan optimization. Dynatrace maintained enterprise momentum ($1.822B ARR, 18% YoY growth, 12 seven-figure deals with 50% Log Management adoption). However, Query Store reliability gaps persisted (plan regression bugs, OPTIMIZATION_REPLAY_FAILED errors, autotuning thresholds); practitioner analysis (September) confirmed tools remained assistant-focused requiring human interpretation rather than autonomous. No convergence toward fully autonomous optimization by Q3 2025—organizational barriers stemmed from reliability concerns and missing autonomous capabilities.
2025-Q4: Vendor ecosystem diversification with incremental AI feature consolidation. IBM Db2 v12.1.3 released neural network AI join cardinality prediction showing 100x individual query improvements and 17% TPC-DS gains (December); Dynatrace extended Oracle monitoring with AI root-cause analysis (October); Redgate enhanced aborted-query tracking (October); DBmarlin reached GA with SQL co-pilot for multi-database tuning (November). Market validation continued with Database Performance Monitoring sector projected to $2.6B by 2026. However, structural adoption barriers persisted: Query Store plan-forcing failures, autotuning minimum thresholds, and tool-overhead issues remained endemic. By end-2025, the field showed no convergence toward autonomous optimization—all major deployments remain in assisted-tool mode requiring human interpretation and decision-making authority. The research-to-production gap endures despite sustained vendor investment.
2026-Jan: Vendor ecosystem maturation with persistent operational constraints. Dynatrace released AI-native Database Monitoring GA (January); SQL Server 2022 adoption reached 29% (January, Brent Ozar); Dynatrace Perform 2026 conference demonstrated end-to-end workflow from observability to developer remediation via GitHub Copilot. Practitioner consensus shifted toward balanced AI+native-tools approach for optimization rather than autonomous systems, reflecting organizational trust constraints. Query Store reliability gaps (plan forcing failures, OPTIMIZATION_REPLAY_FAILED errors) remained endemic; all major deployments continued in assisted-tool mode.
2026-Feb: Vendor agentic feature announcements continued with persistent tool maturity constraints. Datadog released APM Recommendations (February 18) offering AI-generated performance and reliability guidance from unified telemetry; Dynatrace launched Database Operations Agent (February 5, preview) providing agentic workflows for query remediation using execution plan data; New Relic announced SRE Agent (February 24) for autonomous full-stack diagnostics. Academic review (February) confirmed AI-assisted optimizers outperform traditional approaches but face interpretability, runtime overhead, and variability challenges. Real-world deployments revealed tuning complexity: SQL Server 2016→2022 upgrade case (February) showed 10% performance degradation requiring Query Store, compatibility mode, and plan-forcing mitigation; expert analysis highlighted Query Store limitations (table hints unsupported, plan guide constraints). All major vendor AI features remained in preview or early adoption—the field continued showing no convergence toward fully autonomous optimization, maintaining dependency on assisted-tool analysis and human decision authority.
2026-Mar: Vendor automation waves with real-world deployment acceleration. SQL Server 2025 released Intelligent Query Processing enhancements including AI-driven automatic plan optimization, neural-network cardinality estimation, and Adaptive Parameter Optimization (APPO) eliminating parameter-sniffing failures (March 27). Cast AI (unicorn) released Database Optimizer GA with ML-powered transparent caching achieving 80-90% cache hit rates in production (Flowcore, Akamai deployments); Virtana announced AI-native system-aware observability correlating performance failures from code through infrastructure; CubeAPM achieved GA with AI-based intelligent trace sampling enabling 60-80% observability cost savings (Delhivery 75% reduction case study). Cross-database adoption signals emerged: DBtune (Optimizer-as-a-Service) deployed iterative ML tuning across AWS RDS, Azure, Google Cloud, Patroni HA clusters with claimed 50% cost reduction potential; OpenTelemetry reached 48.5% production adoption with 57% reporting cost reduction (STCLab achieved 72% cost reduction via full-coverage tracing). Independent consulting firm FreedomDev documented real-world SQL/PostgreSQL/Oracle/MySQL optimization outcomes: 10x-50x query improvements, 92% average database CPU reduction, $35K-$75K annual productivity savings per engagement. Research frontier continued advancing: arxiv 2603.15970 demonstrated AI query approximation achieving >100x cost/latency reduction for semantic database operations using lightweight proxy models. Field remained dependent on assisted-tool workflows—no evidence of autonomous optimization deployment without human approval despite sustained vendor feature velocity.
2026-Apr: LLM research breakthrough on cardinality estimation shifts field understanding; multi-vendor GA announcements and agentic integration reach scale. Together AI + Stanford + Wisconsin–Madison published research (April 3) showing Large Language Models correct query optimizer cardinality misestimates via semantic reasoning, achieving 4.78x speedups on complex multi-join queries where traditional optimizers fail due to column independence assumptions; represents foundational breakthrough validating LLMs as solvers for 20-year DBMS limitation. Empirical study (Liu et al., April 3) demonstrated LLMs with fine-tuning and selective invocation significantly outperform ML cardinality estimators. Research advances on RL-based optimization: RELOAD paper (April 16) directly addresses production barriers (robustness, convergence), demonstrating 2.4x higher robustness and 3.1x greater efficiency vs state-of-the-art RL methods, with explicit focus on eliminating query-level performance regressions. Databricks research collaboration with UPenn (April 22) showed LLM agents improved join order optimization in 80% of cases with 1.3x latency gains, validating agentic reasoning over traditional estimators. Vendor GA consolidation: SolarWinds AI Query Assist (April 2) for SQL Server and Oracle with automated query rewrites; Microsoft SQL Server 2025 GA (April 2) with Cardinality Estimation Feedback and OPPO; Google Cloud Database Center GA (April 24) with Gemini-driven fleet insights and MCP support enabling Claude/ChatGPT/Gemini agentic integration for autonomous database analysis; ClickHouse deployed on Google Axion with 30-55% performance gains plus Antigravity AI IDE for natural-language query composition. IBM Db2 v12.1.3 documented 3x improvements (local predicates) and 20% join gains with realistic deployment guidance. Oracle 26ai retrospective documented 15-year evolution toward AI-informed optimization. Dynatrace Q3 FY'26 showed $1.972B ARR (20% YoY) confirming enterprise reliance. Practitioner ecosystem documented: five AI SQL assistants (Snowflake, dbForge, SQLAI.ai, Bytebase, AskYourDatabase) with text-to-SQL and automatic rewriting; bibliometric review confirmed 2024 as peak publication year for AI/ML query optimization research. Critical limitations persist: practitioners document cardinality estimation failures (prorated density on stale statistics causing nested-loop vs hash-join misselection); Query Store plan-forcing failures and OPTIMIZATION_REPLAY_FAILED errors endemic; autotuning minimum thresholds limit autonomous effectiveness. All major vendor AI features remained dependent on human oversight and validation—the field continued showing no convergence toward unsupervised optimization despite LLM research breakthroughs and agentic vendor integration reaching GA maturity.
2026-May: Observability vendors reach mainstream AI integration with practitioner adoption signals shifting field baseline, while agentic research and production deployments extend the accuracy frontier. Grafana released AI-powered query troubleshooting as GA (May 6), correlating live metrics, wait events, and execution plans to diagnose slow queries; Brent Ozar publicly documented adoption of AI-assisted query tuning workflows, signaling mainstream recognition threshold crossed. Research advances: PerfEvolve (arXiv, May 19) demonstrated LLM-agent dynamic PostgreSQL parameter tuning outperforming documentation-based baselines by up to 35.2% on TPC-C/TPC-H; Google Cloud GA'd proxy models achieving 30-100x LLM-powered SQL query latency reduction while maintaining 90-116% accuracy; Datadog DBM team improved AI optimization agent precision from 54% to 86% using Karpathy's autoresearch framework; agentic query planning with ML cost prediction achieved 23% latency reduction with 94% resource constraint satisfaction. Production deployments confirmed measurable impact: AI-powered Snowflake query intelligence reduced monthly credits 26%, idle costs 58%, and query latency 35%. Practitioner analysis drew clear role boundaries: AI excels at suggestion-layer optimization (pattern recognition on JOINs, subqueries) while humans retain decision authority for actual tuning requiring EXPLAIN analysis and storage engine knowledge. Production complexity persisted: SQL 2025 compat 170 caused >3-hour slowdowns versus 2.5-3 minutes on SQL 2019, illustrating that optimizer enhancements do not universally improve all workloads. Field showed no convergence toward autonomous optimization—all major GA features remained in assisted-tool mode requiring human validation despite sustained vendor investment and research breakthroughs.
2026-Jun: Learning-based vendor platforms delivered measurable production outcomes while BIRD benchmark evidence exposed persistent LLM accuracy gaps on real-world databases. Snowflake Optima Planning GA demonstrated autonomous query optimization at scale—180x improvement on a 44-join query and 8.5x on cardinality issues—alongside Cortex AI Functions achieving 19x overhead reduction and 3.2x cost savings via learned filter ordering on hundreds-of-millions-of-rows pipelines. PostgreSQL core committers (Haas, Vondra, Wang) published pgconfdev2026 roadmap prioritising cardinality hints and join statistics feedback as pragmatic optimizer improvements, signaling community consensus around incremental feedback mechanisms over stateful learning. Research advanced agentic execution: EnumGRPO achieved 35.4% accuracy at $0.011 per query (317x cost reduction vs hybrid), and practical case studies confirmed targeted rewrites remain high-value (SQL Server YEAR-function removal: 30s→4s, 7.5x speedup via execution plan cost analysis). Critical negative signal: BIRD benchmark evaluation on 95 production databases showed GPT-4 at 54.89% accuracy with curated hints (34.88% without) versus 92.96% human performance, quantifying a 38-point accuracy gap on dirty data and complex schemas and constraining unsupervised deployment confidence. Practitioner frameworks continued emphasizing human decision authority for production tuning; no convergence toward autonomous optimization emerged.