The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.
A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.
Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail
AI that monitors and moderates user-generated or AI-generated content to ensure brand safety and policy compliance. Includes automated content filtering and brand safety scoring; distinct from content safety in AI governance which governs AI outputs rather than published content.
Content moderation and brand safety is standard infrastructure for digital advertising and platform governance. Every major advertiser deploys automated content classification, and not doing so requires justification to stakeholders, regulators, and brand partners alike. The practice is established -- but it is also stalled. The core tension that defined this field a decade ago persists: automated tools handle categorical content (copyright, CSAM) reliably, yet consistently fail on contextual judgment -- sarcasm, cultural nuance, therapeutic necessity. Vendors like DoubleVerify and Integral Ad Science have built multi-hundred-million-dollar businesses on classification at scale, and the market continues to grow. But repeated investigations have exposed systemic accuracy gaps, and the industry is shifting from rigid blocklists toward contextual AI and brand suitability frameworks. May 2026 marked a maturity inflection: platforms (YouTube, TikTok, Meta) deployed automatic AI content detection and synthetic media labeling at scale, moving beyond voluntary creator disclosure. Yet this operationalization masks persistent limitations. Research demonstrates 57x labeling inconsistency across frontier LLMs even with detailed definitions; production moderation systems inappropriately flag therapeutic conversations discussing self-harm as undesirable; regulatory enforcement failures persist (Singapore: CSAM and terrorism detection remains inadequate; EU: 62% minority-language accuracy triggering fines). Moderation at scale now relies on automatic detection, multimodal analysis, and vendor ecosystem partnerships. But effectiveness ceilings remain hard: systematic language coverage gaps (98% of African languages), adversarial synthetic media tactics, contextual judgment failures in sensitive domains. Moderation works. It also demonstrably does not work well enough -- and that paradox now defines the field.
Deployment metrics confirm operational maturity at unprecedented scale. April 2026 platform enforcement data documented 2.0-2.5M moderation actions/day across 8 Very Large Online Platforms (VLOPs) with regulatory coordination driven by EU DSA compliance. TikTok removed 538,000+ AI-generated unauthorized videos in April 2026 alone, demonstrating platform-scale detection of synthetic content threats. Q4 2025 data showed 175M videos removed globally with 99.1% proactive detection. DoubleVerify achieved MRC accreditation for TikTok viewability and SIVT detection in April 2026—the first independent third-party validation for platform-specific brand safety measurement—signaling vendor ecosystem maturity. DoubleVerify's 2025 revenue of $748.3M (14% YoY growth) and Novacap's $1.9B acquisition of Integral Ad Science in September 2025 demonstrate sustained investor confidence. The brand safety verification market is consolidated and mandatory—IAS and DoubleVerify now measure across Meta Threads, TikTok Pangle, LinkedIn CTV, and all major social and streaming platforms.
June 2026 marked a critical inflection: Meta achieved ~50% automation of content moderation via LLMs, planning >90% for specific categories by year-end. Platform metrics claim 13% fewer enforcement errors and 10% more violations caught compared to human review. However, Meta's independent Oversight Board concurrently documented systematic dual-enforcement flaws—simultaneous over-moderation (wrongly shadow-banning legitimate speech) and under-moderation—alongside bias amplification from historical human decision logs, signaling that scale and accuracy remain in tension. Concurrent vendor ecosystem expansion (DoubleVerify's DV Neura showing 300x increase in content classification output; IAS extending Total Media Quality to YouTube Audio and Meta Threads; DV AdVantage deployment to Meta/TikTok with pilot metrics of 98% reach improvement and 59% suitability incident reduction) demonstrates sustained market momentum and multi-platform coverage maturity. Yet critical assessment research reinforces known limitations: UPenn study of seven production AI moderation systems revealed 50%+ variance in hate speech scoring across vendors, with systematic bias against marginalized communities and documented failures on reclaimed language and implicit hate speech detection. These paired signals—operational scale combined with documented inconsistency—define the field's current state: moderation infrastructure is mandatory and deployed at billions of daily decisions, yet bias, vendor disagreement, and contextual judgment failures remain hardened system properties unresolved by technical innovation alone.
Regulatory enforcement is reshaping the landscape at unprecedented speed. The U.S. TAKE IT DOWN Act (May 19, 2026 deadline) mandates platforms deploy AI-driven detection and removal systems for nonconsensual AI-generated intimate images with 48-hour removal requirements, creating a structural compliance gap between major platforms with existing infrastructure and thousands of smaller platforms lacking technical capability. The EU DSA moved from policy to enforcement: Meta faced its first major DSA fine for election disinformation, with specific findings showing 40% higher organic reach for unverified false claims versus corrections and only 62% accuracy in minority-language moderation—directly triggering mandates for algorithmic auditing and real-time moderation transparency. Yet credibility pressures intensify while systemic gaps widen. An FTC investigation alleges IAS engaged in advertiser-driven platform boycotts. A shareholder lawsuit accuses DoubleVerify of overbilling for bot impressions and misrepresenting tool capabilities. Critical assessments now dominate: Singapore's regulator (IMDA) documented that platforms fail to proactively detect CSAM and terrorism content despite policy commitments. A Global Voices investigation revealed that only 42 of 2000+ African languages appear meaningfully in LLM training—approximately 98% of African languages are "essentially invisible to moderation systems," while TikTok's removal of content from Kenya climbed from 450K (Q1 2025) to 592K (Q2 2025). Meta's platform-scale AI cleanup deleted millions of accounts for bot/spam activity in May 2026, with documented false positives indicating system limitations.
Generative AI and platform policy shifts pose an unresolved systemic challenge. Meta/Instagram rolled out mandatory AI-content labeling on Reels (April 30, 2026) closing loopholes in synthetic content detection. DoubleVerify launched "AI SlopStopper" in April 2026 to detect low-quality AI-generated content across social platforms, showing vendor innovation in response to emerging threat landscape. Yet real-time detection and enforcement remains unproven at scale, and regulatory fragmentation (EU DSA, US TAKE IT DOWN Act, China ex-ante content mandates) creates compliance uncertainty. The field's paradox now sharpens: moderation is operationalized at billions of daily decisions with measurable fraud reduction and vendor scale, yet credibility erodes amid evidence of political bias in LLM systems, systematic under-coverage of non-Western languages, documented regulatory enforcement failures against leading platforms, and continued failures against adversarial synthetic media tactics.
— Meta's independent Oversight Board documented dual enforcement flaws (over/under-moderation) and bias amplification risks in LLM-based moderation despite positive metrics, providing critical counterbalance to deployment claims.
— Meta replaced ~50% of human content review with LLMs, targeting >90% for specific categories by year-end; claims 13% fewer enforcement errors and 10% more violations caught, signaling production-scale AI moderation shift.
— Peer-reviewed ACL 2026 study of 16 AI text detection systems found systematic representational and allocational harms clustered by demographic group, directly applicable to content moderation fairness.
— Integral Ad Science expanded Total Media Quality to YouTube Audio Ads (1B+ monthly podcast users), completing multi-format brand safety ecosystem coverage across video, audio, and streaming platforms.
— DoubleVerify's integrated brand safety and optimization platform expanded to Meta and TikTok with reported 98% reach improvement and 59% reduction in brand-suitability incidents in pilot deployments.
— UPenn study of seven AI moderation systems revealed 50%+ variance in hate speech scoring and systematic bias against marginalized communities, documenting fundamental inconsistency in production moderation systems.
— DV Neura shows ~300x increase in content classification output and 500M+ impressions monitored/blocked since start of 2026, demonstrating scale maturity and vendor shift toward agentic autonomous moderation.
— TikTok removed 204.5M videos (0.7% of uploads) with 186.6M via automated detection (91%); 99.3% proactive removal, 94.8% within 24 hours. Demonstrates large-scale production deployment of automated AI moderation at platform scale.