The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
AI for generating and editing images, video, audio, 3D assets, and cross-media content. Mostly leading-edge with rapid advancement — image generation, music composition, and voice synthesis are approaching good practice. Video generation and 3D asset creation are progressing fast but quality and controllability gaps persist. The most active domain by momentum: over half the practices are advancing.
This is the domain where the technology has most clearly won and most clearly stalled at the same time. Across images, video, voice, music, and 3D, the underlying models have crossed the production threshold: text-to-image has reached photorealism parity on simple prompts, voice clones are perceptually indistinguishable from the people they imitate, 3D asset generators now "drop directly into games," and consumer video and music platforms have scaled to tens of millions of users and hundreds of millions of dollars in revenue. Suno passed 100 million users and $300M in annual recurring revenue. ElevenLabs reached roughly $500M ARR with two-thirds of the Fortune 500 deploying its voices. Adobe's Firefly approached $300M ARR with its AI-first business tripling to over $500M. By the narrow measure of "does the model work," most of this domain is done.
And yet almost nothing here is advancing on the dimension that would signal true arrival — mainstream professional adoption at organisational scale. The binding constraints have migrated, decisively, off the technology and onto trust, law, fraud, and labour. Consumer acceptance of AI-generated content has collapsed from roughly 60% in 2023 to the mid-20s in 2026; 78% of marketers now prioritise real user-generated content over AI, and AI-influencer partnerships fell 30% in a year. Copyright has hardened from uncertainty into live precedent: Disney, Universal, and Warner Bros. are suing Midjourney; US courts have refused to dismiss the developer cases; the UK reversed its training exemption under artist pressure; and a German court's GEMA v. Suno decision lands on 31 July. Fraud built on these same tools has industrialised — voice-cloning vishing surged more than twelvefold year over year, deepfake fraud cost US victims $1.1B in 2025, and the EU AI Act's Article 50 labelling mandate becomes enforceable on 2 August with fines up to 3% of global turnover. The capability frontier and the deployment frontier have detached, and the gap between them is now the whole story.
What distinguishes Creative & Generative Media from quieter domains is that its structural tensions are unusually legible because they are playing out in public — in courtrooms, in consumer sentiment surveys, in game studios that publicly refuse to ship AI assets, and in the unit economics of the products themselves. OpenAI shut down the Sora consumer video app, which burned around $1M a day against barely $1.4M in lifetime revenue, retreating from general-purpose video generation. That is the shape of the domain in one fact: the model was extraordinary, and the business around it did not work.
The dominant movement this cycle is the hardening of adoption ceilings even as capability keeps climbing — and the clearest signal is a downgrade. Voice cloning, previously advancing, is now assessed as stalled. The technology is finished: clones reach 97% fidelity, commercial APIs need only three to thirty seconds of source audio, and red-team testing finds users cannot distinguish clones over mobile networks. But that same maturity is now the problem. The NO FAKES Act cleared the Senate Judiciary Committee 14–0 on 18 June, establishing a federal likeness right with penalties up to $750,000; India's CERT-In issued a formal advisory on industrialised voice-cloning attacks against bank verification; deepfake vishing grew 1,265% year over year. Vanguard deployments (ElevenLabs at $500M ARR, a UK government MoU for public services) keep expanding, but mainstream rollout is now blocked by consent liability, fraud exposure, and production-reliability gaps rather than anything the model can't do.
The same pattern repeats across the domain. In content authenticity, the bifurcation between detection (an unwinnable arms race) and provenance (production-deployed) sharpened: UC Berkeley forensics leader Hany Farid publicly declared visual deepfake detection broken after failing his own tests, while TikTok demonstrated provenance at scale with 1.3 billion C2PA-labelled videos — yet researchers also stripped 91% of Google's SynthID watermarks via spectral analysis and disclosed two CVEs in Adobe's C2PA reference code, showing the provenance layer's durability is far from settled. In games, Take-Two confirmed GTA 6 ships with zero generative AI and its former AI chief said the hype is "poisoning the well"; AI-disclosed Steam titles drew 53% fewer reviews. Music's paradox deepened — Suno raised $400M at a $5.4B valuation while a study found 93% of AI tracks get under 1,000 plays and listener interest actively declined. The bright spot was bounded, structured automation: agentic video editing (Adobe Firefly's Quick Cut, analyst recognition of an "agentic video" category), real-time sports highlights at the World Cup (Media Distillery, WSC Sports, Pixellot), and 3D generation hitting commodity status (Meshy at 55% US market share, Microsoft's Trellis 3D). Where the use case is narrow and the stakes are low, the technology scales reliably. Everywhere else, it waits on trust and law.
Capability is solved; trust is the new ceiling. Across every modality, the model now works and the audience increasingly doesn't want it. Consumer acceptance of AI content fell from ~60% (2023) to the mid-20s; 83% of consumers say they can spot AI and trust the brand less when they do; H&M's AI-model product pages converted 22% lower than human ones. The decisive variable for deployment has shifted from "can the AI do it" to "will anyone accept that it did."
Copyright moved from uncertainty to liability. What was legal ambiguity is now precedent and exposure. Disney, Universal, and Warner Bros. are suing Midjourney; US courts denied AI developers' motions to dismiss; forensic audits showed Suno trained on 61,026 copyrighted works without consent; the UK reversed its training exemption; and pure-AI outputs remain uncopyrightable under US law, forcing human-in-the-loop workflows that erode the speed and cost advantage. The GEMA v. Suno ruling on 31 July is the next inflection point.
The same tools power both the product and the fraud. Voice cloning, deepfakes, and face-swap synthesis are simultaneously the domain's flagship capabilities and an industrialised crime wave — deepfake fraud cost US victims $1.1B in 2025, vishing rose 1,265% year over year, and one bank logged 8,065 deepfake-bypass attempts in a single quarter. Detection has hit an irreversible ceiling (real-world accuracy of 50–65%, humans near chance), pushing the industry's entire defence onto provenance — which is itself defeatable by commodity watermark-stripping tools.
Unit economics, not capability, decide which products survive. The Sora shutdown ($1M/day burn against $1.4M lifetime revenue) and Runway's inverted margins ($155M EBITDA loss on $44M revenue) show that an extraordinary model does not guarantee a viable business. The products that work are those with bounded, repeatable use cases — sports highlights (80% of NFL/NBA clips now automated), enterprise training video ($8 versus $3,000–5,000 traditional), e-commerce 3D assets (Lowe's at under $1 each) — not open-ended consumer generation.
Bounded automation scales; creative judgment does not. The reliable wins all share a structural property: the task is constrained and editorial judgment is minimal. AI video editing automates sports highlights and silence removal but cannot replicate narrative pacing; long-form video generation produces coherent clips but fails on character and causal consistency past a few minutes; game studios use AI for brainstorming (81%) but only 5% in player-facing features. The frontier separating "scaled deployment" from "still experimental" runs precisely along the line where human taste becomes load-bearing.
NO FAKES Act Clears Senate Judiciary Committee 14–0 (news-coverage) — The unanimous vote on 18 June — establishing a federal likeness right with penalties up to $750K and 70+ years of post-mortem protection — marks the moment voice-cloning and deepfake deployment moved from legal ambiguity into enforceable liability, directly capping mainstream rollout for the domain's most commercially mature modality. https://ground.news/article/a-landmark-bill-targeting-ai-deepfakes-faces-a-us-senate-judiciary-committee-vote-on-june-18-five-things-to-know-about-the-no-fakes-act
Hany Farid Says AI Deepfakes Have Defeated His Own Eyes (opinion) — The UC Berkeley forensics leader who spent 20 years advising governments and law enforcement on deepfake detection now fails his own tests; his public declaration that visual detection is broken closes off the detection branch of the content-authenticity strategy and leaves provenance as the only surviving defence — itself still fragile. https://aiweekly.co/alerts/hany-farid-says-ai-deepfakes-have-defeated-his-own-eyes
The Fraud Files — June 2026 (case-study) — Industrial-scale deepfake misuse in concrete numbers: synthetic identity fraud projected at $3.1B for 2026, one bank logging 8,065 deepfake-bypass attempts in a single quarter, deepfake-as-a-service at $10–50 per image — the same capabilities powering the domain's flagship products have become a commodity crime-wave infrastructure. https://www.proof.com/blog/the-fraud-files-stolen-credentials-fake-biometrics-and-the-synthetic-identity-wave-june-2026
An Empirical Analysis of AI Slop in Music Streaming (research-paper) — Peer-reviewed data showing 93% of AI tracks get under 1,000 plays versus 64% of human tracks, AI content now comprising 40% of new releases in a "spray and pray" pattern; this is the demand-side ceiling the summary describes — commercial scale without commercial engagement. https://arxiv.org/abs/2606.18052
Suno Raises $400M at $5.4B Valuation Despite Ongoing Legal Battles (adoption-metric) — $400M Series D at a $5.4B valuation (2x growth from November 2025), 2M paid subscribers, $300M ARR growing 404% year-over-year — investor conviction running directly against active UMG/Sony litigation and the listener-engagement collapse documented in the arxiv paper above. The coexistence of those two facts is the domain's central paradox. https://imusician.pro/en/resources/blog/suno-raises-over-400-million
Take-Two's Ex-AI Boss Says Generative AI Hype Is "Poisoning the Well" (opinion) — Take-Two shut its entire AI R&D division in April 2026 and GTA 6 ships with zero generative AI; the former AI research head's critique that hype is eroding credibility for proven procedural tools is the clearest single statement of the trust ceiling at AAA scale. https://www.eurogamer.net/take-two-former-ai-head-generative-ai-hype-poisoning-the-well
Nearly 20% of Steam Next Fest Games Carry a Generative AI Warning (adoption-metric) — 1,715 of 8,700 games (19.71%) disclosed AI use — a platform-level measure of how broadly indie developers have adopted the tools — set against the 53% fewer reviews (sales proxy) and 3.7-point score penalty for disclosed titles, showing the adoption-versus-acceptance gap in a single clean dataset. https://www.pcguide.com/news/nearly-20-of-steams-new-next-fest-games-come-with-a-generative-ai-warning-stats-reveal/
Adobe Firefly Introduces Agentic Video Capabilities (product-ga) — Quick Cut auto-assembly, storyboard-to-video, and product video creation deployed natively across Creative Cloud represents the domain's largest established player committing its mainstream product line to agentic automation — the bounded, workflow-constrained category the summary identifies as the reliable scaling path. https://blog.adobe.com/en/publish/2026/06/18/adobe-firefly-introduces-new-agentic-capabilities-and-an-upgraded-creative-ai-studio-built-for-the-way-you-work
Media Distillery and Personal Elevate World Cup Fan Engagement with Automated Highlights (case-study) — Auto-generated 3–8 minute match highlights within minutes of final whistle for 2M users across 104 World Cup matches is the clearest live illustration of the summary's "bounded automation scales" thesis: constrained task, no creative judgment required, production scale achieved. https://tva.onscreenasia.com/2026/06/argentinas-personal-and-media-distillery-elevate-world-cup-fan-engagement-with-automated-match-highlights/
Video as a Competitive Advantage: How Enterprise Teams Are Using AI-Generated Video at Scale (case-study) — $8 per video versus $3,000–5,000 traditional (a 99%+ cost differential), alongside a documented 10–15 point quality reduction in blind evaluation, makes the unit-economics argument concrete: AI video works where cost is the primary constraint and quality thresholds are modest, which is exactly the bounded-use-case model the summary argues is the only category currently scaling. https://aijourn.com/video-as-a-competitive-advantage-how-enterprise-teams-are-using-ai-generated-video-at-scale/