The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
AI controlling physical systems across manufacturing, agriculture, construction, healthcare, and service. The largest domain at 27 practices, with industrial robotics and pick-and-place at good-practice. Most practices cluster at leading-edge — sim-to-real transfer, humanoid robotics, and surgical automation are progressing rapidly. Nearly half the practices are advancing, making this one of the most dynamic domains by momentum.
Physical AI in mid-2026 is defined by a widening gap between what works and what is claimed to work. Across twenty-eight practices spanning manufacturing, agriculture, construction, logistics, healthcare, and consumer environments, a consistent pattern holds: narrow, structured-task robots have crossed into production with documented returns, while general-purpose and humanoid platforms remain overwhelmingly pre-commercial despite billions in venture funding. The domain's centre of gravity sits firmly at the adoption frontier -- most practices are deployed by forward-leaning organizations but not yet standard operating procedure across industries.
The strongest signal of maturity comes from factory floors and controlled logistics environments. Collaborative robots have reached 64,500 annual unit sales with 12% market share of all industrial robots; autonomous cleaning systems hit $4.2 billion in revenue growing 38% year-on-year; warehouse goods-to-person systems deliver 20-50% cost reductions at enterprises including Amazon (1 million-plus robots), Alza, and THG. Quality inspection, predictive maintenance sensing, and precision agriculture monitoring have all reached the point where the technology itself is no longer the constraint -- organizational readiness, data quality, and integration costs are. Autonomous haulage in mining is perhaps the domain's clearest success story: Komatsu's FrontRunner fleet crossed 1,000 vehicles in April 2026, XCMG deployed 100 all-electric autonomous trucks at a single Chinese mine, and payback periods are measured in months rather than years.
Yet this maturity coexists with persistent stalls and outright failures. Humanoid robots fail 88% of household tasks in Stanford's assessment. Tesla admitted Optimus Gen 3 production delays, declining to provide targets. A detailed industry analysis documented $18 billion-plus in humanoid and advanced robotics investment from 2022-2025 with "vanishingly small" productive impact -- Unitree's IPO prospectus revealed only 9% of revenue from industrial deployment. The gap between demo-stage capability and production deployment remains the domain's defining tension, and it is widening as hardware vendors pour resources into general-purpose platforms that have not yet found repeatable commercial use cases outside tightly scoped demonstrations.
The practices that are advancing tend to share three characteristics: task specificity, environmental control, and a clear labor-economics case. Autonomous construction haulage works because mines are controlled environments with high-value, repetitive routes. Collaborative assembly works because cobots handle 3-25 kg payloads in structured cells with 18-24 month payback. Digital twins deliver 18-25% efficiency gains at BMW, PepsiCo, and Siemens because those companies have the capital and data infrastructure to sustain them. The practices that stall lack one or more of these conditions: delivery robots on public sidewalks face unstructured environments and hostile regulation; smart home orchestration fails because residential networks lack professional-grade infrastructure; autonomous harvesting confronts the hardest biological manipulation problem in robotics.
This scan cycle added 227 new evidence items across all twenty-eight practices, reflecting broad research windows. No practices changed tier or trend -- the domain's structural shape held steady. The most significant new evidence clarified the operational boundaries of humanoid and advanced robotics rather than announcing breakthroughs.
The sharpest finding was an operations-level report from a major logistics operator comparing humanoid robots against 1,200-plus human workers across five U.S. facilities. The conclusion was unambiguous: humanoids dominate repetitive material movement (70-80% of human throughput with zero injuries) and structured picking with computer vision, but fail at multi-component kitting, variable SKU handling, and exception resolution. Humans remain faster and more accurate on roughly 80% of warehouse tasks. Meanwhile, Japan Airlines announced a three-year Unitree humanoid pilot for ground handling at Haneda Airport -- the first infrastructure-compatible humanoid deployment in a high-stakes regulated environment, notable because it requires no facility redesign. Figure announced its 03 model deployed in fully autonomous 24/7 operation with wireless inductive charging at seven sites, having scaled manufacturing 24-fold (one unit per day to one per hour) in 120 days. These are real deployments, but still measured in single digits of sites and narrow task scopes.
On the negative side, Tesla confirmed Optimus Gen 3 production delays for the first time during Q1 2026 earnings, admitting a four-month factory conversion timeline and 10,000 unique components with no prior mass-production experience. Stanford's peer-reviewed assessment documented humanoids failing 88% of household tasks, with battery life under 8 hours and hand dexterity variance as binding constraints. A broader survey found 79% of organizations exploring physical AI but fewer than 5% deploying at scale, with reliability (60%), dexterity (80%), and training data scarcity (80%) cited as primary barriers. Cleaning robots advanced to mainstream status with a $4.2 billion market, but the residential segment effectively collapsed following iRobot's bankruptcy and ongoing security vulnerabilities. Construction additive manufacturing hit a cost-parity milestone -- Sika Brasil printed a full concrete house in 60 hours at 30% lower cost than conventional methods -- while Heidelberg Materials scaled autonomous haulage to 30 trucks across 6 global sites, pushing retrofit autonomy beyond mining.
Narrow-task success versus general-purpose aspiration. The practices delivering measurable ROI -- warehouse goods-to-person, autonomous haulage, quality inspection, commercial cleaning -- all solve tightly scoped problems in controlled environments. Every attempt to generalize (humanoid household assistants, urban sidewalk delivery, smart home orchestration) has produced either stalled adoption or commercial failure. Stanford's 88% household task failure rate and the logistics operator's finding that humans beat robots on 80% of warehouse tasks define a hard boundary that foundation models have not yet moved. The $18 billion-plus invested in humanoid platforms from 2022-2025 has produced fewer than a dozen production-scale deployments.
Software orchestration has become the binding constraint, not hardware. Across warehouse robotics, digital twins, and autonomous construction, the pattern repeats: the machines work, but coordinating fleets of them at scale does not. Amazon caps warehouse robot utilization at 65% to avoid congestion deadlocks. Sixty-four percent of digital twin projects never leave pilot, with data governance -- not simulation technology -- as the root cause. LG CNS launched PhysicalWorks as a standalone multi-vendor orchestration platform delivering 15% productivity gains, validating that orchestration itself is now a commercial product category. The MODEX 2026 survey of 400 operators identified integration complexity, not cost, as the leading adoption barrier. The organizations that solve orchestration first will extract disproportionate value from the hardware investments already made.
China is setting the production pace and may set the standards. XCMG's 100-unit autonomous electric truck fleet, Agibot's 10,000-unit humanoid production run, and China's MIIT publishing the first national humanoid robot standard with a safety framework across physical, behavioral, and ethical dimensions all point in the same direction. Chinese manufacturers hold 58% unit share in commercial cleaning robots at 40-60% cost advantage. Asia-Pacific captures 70% of incremental collaborative robot deployment growth. The question for Western operators is not whether Chinese hardware will be competitive -- it already is -- but whether Chinese standards will become the default international framework, particularly given that neither the EU AI Act nor U.S. regulation has produced equivalent specificity for physical AI systems.
Regulatory hardening is outpacing deployment in consumer-facing applications. Chicago banned commercial sidewalk delivery robots in March 2026 after collision incidents. China's three ministries suspended all new L4 autonomous vehicle permits after 100-plus Baidu robotaxis froze on Wuhan streets. California's DMV closed an enforcement loophole allowing law enforcement to cite AV companies directly. The FDA documented surgical robot stapler malfunctions resulting in serious injuries. In every consumer-facing category -- sidewalk delivery, robotaxis, smart home devices, surgical autonomy -- regulatory responses are now faster and more severe than deployment can adapt to, creating a structural drag on scaling that controlled-environment applications (factories, mines, warehouses) do not face.
The labor economics case is strengthening even as deployment stalls. German manufacturing wages of 55,000-75,000 euros per year make collaborative robots with $150,000 capital costs profitable within 2-3 years. The U.S. construction industry faces 499,000-plus unfilled positions annually with 41% of the workforce retiring by 2031. Japanese nursing homes documented that companion robots reduce care-worker quit rates and enable one therapist to supervise 3-4 robotic units simultaneously. The demographic pressure is real, quantified, and intensifying -- yet only a handful of practices (cobots, autonomous haulage, commercial cleaning) have converted that pressure into repeatable, scaled deployment. The gap between labor-shortage urgency and actual automation deployment defines the domain's central strategic question: organizations that wait for general-purpose solutions may find the narrow solutions already have sufficient market share to dictate terms.
The Robotics Bubble Isn't Demos, It's the Deployment Nightmare (market analysis) — The single clearest quantification of the hype-reality gap: $18B+ invested in humanoids 2022-2025 with "vanishingly small" productive impact, Unitree IPO data showing only 9% industrial revenue, and Tesla Optimus producing zero units — this is the anchor evidence for every "what's new" claim about stalled deployment. https://robohorizon.uk/en-gb/magazine/2026/05/the-robotics-bubble-and-the-deployment-nightmare/
World's First 100 Autonomous All-Electric Mining Trucks at Huaneng Yimin Mine (deployment) — XCMG's fleet documents the opposite of the hype failure: controlled environment, repetitive routes, clear labor economics, and now 100-unit scale — the ideal-conditions success story that defines what narrow-task AI robotics can achieve when the three conditions (task specificity, environmental control, labor economics) align. https://www.xcmgglobal.com/news/news-detail-725.htm
Warehouse Robots vs. Human Operators in March 2026 — Productiv (operational assessment) — Operations data from 1,200+ workers across 5 US facilities is the most granular evidence in this scan for exactly where robots win and where humans dominate; the finding that humans remain faster on roughly 80% of tasks gives the "hard boundary" claim a specific number rather than a vague assertion. https://getproductiv.com/blog/man-vs-machine
Japan Airlines Humanoid Robot Trial for Cargo and Baggage Handling (deployment) — The first humanoid deployment in a high-stakes regulated transport environment is significant precisely because Haneda requires no facility redesign — if this works, it becomes the template for regulated-environment humanoid deployment that bypasses the "unstructured environment" failure mode documented everywhere else. https://betabriefing.ai/channels/the-robot-beat/briefings/2026-05-01/
Starship Technologies Achieves 10 Million Autonomous Deliveries (deployment milestone) — 10M deliveries across 300+ locations in 8 countries and 22M kilometers is the strongest counter-evidence to the regulatory-hardening narrative: sidewalk delivery can reach production scale when the operating environment is campus-controlled rather than open urban sidewalk. https://retailtechinnovationhub.com/home/2026/4/27/starship-technologies-hits-ten-million-autonomous-deliveries-across-europe-and-the-usa
Glendale City Council Proposes Moratorium on Delivery Robots (regulatory action) — The 25x geographic expansion that triggered a municipal moratorium illustrates why regulatory hardening is outpacing deployment in consumer-facing applications: the faster operators scale, the faster municipalities respond, creating a deployment ceiling that controlled-environment robotics does not face. https://abc7.com/post/glendale-takes-steps-regulate-delivery-robots-serve-robotics-fleet-expands-los-angeles-area/19048747/
Chinese Robot Cleaners Sweep Korean Homes as Data Worries Linger (market data) — Roborock crossing 50% South Korea market share with 17.7% global unit share and 40-60% cost advantage over Western competitors puts concrete numbers on the "China is setting the production pace" tension — this is not a future concern, it is a present market reality. https://m.ajupress.com/amp/20260508145303993
Philadelphia Delivery Robot Collision Reveals Regulatory Gap (failure/safety) — A single pedestrian collision exposing that no federal agency tracks delivery robot complaints — requiring 10 phone calls to find the responsible regulator — is the uncomfortable truth that scales: incident-response infrastructure has not kept pace with deployment velocity. https://scoop.upworthy.com/woman-reported-uber-bot-for-hitting-her-it-led-to-a-bigger-problem
Why Do Digital Twins So Often Stall in Oil & Gas? (failure analysis) — Capgemini's finding that fewer than 25% of digital twin pilots advance to operations, with organizational barriers (not technical ones) as the binding constraint, directly supports the summary's argument that software orchestration has become the bottleneck — and generalizes beyond oil & gas to the 64% failure rate cited for all sectors. https://www.capgemini.com/gb-en/insights/expert-perspectives/why-do-digital-twins-so-often-stall-in-oil-gas/
LG CNS Launches PhysicalWorks to Speed Robot Rollout (product launch) — A standalone multi-vendor orchestration platform generating 15% productivity gains on mixed robot fleets validates that orchestration is now its own commercial product category, not just a deployment problem — the organization that solves fleet coordination extracts disproportionate value from hardware already purchased by customers. https://biz.chosun.com/en/en-it/2026/05/07/YWAEDNACJVANHJHUGRGBWNNBEA/