Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Image generation — photorealistic & illustrative

GOOD PRACTICE

TRAJECTORY

Advancing

AI that generates photorealistic images and illustrations from text prompts or reference images. Includes diffusion-based generation for photography, concept art, and illustration styles; distinct from product visualisation which targets commercial product imagery.

OVERVIEW

Text-to-image generation has matured into a production-scale practice with validated business models and enterprise deployment, but remains constrained by copyright liability, persistent technical gaps, and human-in-the-loop requirements that prevent mainstream classification. By mid-May 2026, the competitive landscape had crystallized into distinct capability-openness trade-offs: Midjourney v7 and GPT Image 2 prioritized aesthetic quality and token-based reasoning (with Midjourney achieving 9.5/10 photorealism vs. DALL-E 3's 8.8/10 prompt accuracy); Adobe Firefly 3 uniquely trained on licensed data with 250M+ ARR and 45% Fortune 500 creative cloud penetration; Stable Diffusion 4 Ultra prioritized openness with 87% hand accuracy and native 4K resolution. Market consolidation strengthened: GPT Image 2 (April 2026) shifted from diffusion to token-based reasoning (Transfusion), achieving 1512 Elo rating with 80% win rate on day-1 integration across Figma, Canva, Adobe, and fal. Firefly adoption metrics now concrete: 24B+ assets generated, 45% Creative Cloud user penetration, 72% Fortune 500 design team integration. However, structural barriers to mainstream remain unresolved: copyright liability (U.S. federal court in April 2026 denied Stability AI's motion to dismiss Getty Images claims, establishing material exposure), persistent technical gaps (multi-object compositional failures, gradient degradation across regeneration cycles), human-in-the-loop requirement for copyright protection, and quality precision constraints (portraits 65-75%, products 80-85%, 80% of deployments requiring human touch-ups). Research efforts continue toward technical mitigation: Ambient Diffusion demonstrates training on 90% corrupted data to prevent memorization while maintaining generation quality. The practice remains in good-practice tier: market size, vendor diversification, and documented enterprise ROI validated, but persistent production quality gaps, copyright liability exposure, structural human-in-the-loop requirements, and real-world reliability constraints prevent mainstream advancement.

CURRENT LANDSCAPE

By mid-May 2026, text-to-image generation had achieved full enterprise maturity with crystallized vendor positions, quantified adoption metrics, and persistent legal/technical barriers. Market scale remained robust: image generation market grew to $15.18B (2026, +30.3% YoY); daily creation volume reached 34 million AI images; stock photography industry collapsed 77% ($14.3B → $3.2B, 2019–2026) as designers migrated to generative workflows. Vendor positioning stabilized with documented capability differentiation: Midjourney v7 achieved 9.5/10 photorealism rating with 500M revenue (2025), 19.83M active users (26.8% market share), and $5M revenue-per-employee efficiency; GPT Image 2 (April 2026) shifted architectural paradigm from diffusion to token-based reasoning (Transfusion), achieving 1512 Elo rating with 242-point lead over competitors and immediate integration into Figma, Canva, Adobe, and fal platforms; Adobe Firefly achieved 250M+ ARR with 3x YoY growth, 24B+ assets generated, 45% Creative Cloud user penetration, and 72% Fortune 500 design team integration; Stable Diffusion 4 Ultra reached GA with 87% hand accuracy and native 4K resolution, powering 80% of open-source deployment. Real-world deployment metrics confirmed production viability: e-commerce providers scaled 10,000+ images monthly with 75% cost reduction; performance marketers documented concrete CPA improvements and tool-by-tool performance data; Google Ads Asset Studio deployed Imagen 4 with 68% production time reduction and 41% ROAS lift. However, persistent barriers constrained mainstream advancement: copyright liability escalated with U.S. federal court (April 24, 2026) denying Stability AI's motion to dismiss Getty Images claims, establishing material liability exposure; human-authorship requirements for copyright protection codified across jurisdictions (U.S., China, UK); technical gaps persisted (multi-object compositional failures, 6-8% improvement ceiling from mitigation techniques, gradient degradation, unreliable text rendering); production precision constraints remained (portraits 65-75% accurate, products 80-85%, 80% requiring human touch-ups); and research gaps shifted focus from capability advancement to risk mitigation (Ambient Diffusion training on 90% corrupted data to prevent copyright memorization). Consumer trust remained stable at 68% deception concerns. The practice remained in good-practice: market scale, vendor diversification, and quantified enterprise ROI validated; however, copyright liability exposure, structural human-in-the-loop requirements, persistent production quality gaps, and platform-driven capability restrictions prevented mainstream advancement.

TIER HISTORY

ResearchJan-2022 → Jan-2022
Bleeding EdgeJan-2022 → Jul-2024
Leading EdgeJul-2024 → Feb-2026
Good PracticeFeb-2026 → present

EVIDENCE (106)

— Product GA announcement (public beta April 27, 2026) documents agentic AI orchestrating 60+ pro-grade tools across 6 Creative Cloud apps; marks transition from scattered features to cross-app workflow automation with planned Claude integration.

— High-impact technical analysis: GPT Image 2 (April 2026) shifts from diffusion to token-based reasoning (Transfusion). Benchmarks: 1512 Elo at launch, 242-point lead over next model (80% win rate). Day-1 integration: Figma, Canva, Adobe, fal.

— Independent business journalism documenting Firefly adoption metrics (24B+ assets generated, 45% Creative Cloud user penetration, 72% Fortune 500 design team integration) alongside competitive and market pressures; balanced critical assessment of adoption expansion despite stock decline.

— Independent analyst reports Firefly-specific ARR crossed $250M with 3x YoY growth; 45% QoQ generative credit growth and 8x YoY video generative actions signal volume-driven production scaling.

— Hands-on comparative testing of Midjourney V7, DALL·E 4, and Stable Diffusion 4 across 600+ generated images. Specific quality findings: Midjourney wins photorealism, DALL·E wins typography/infographics, SD4 + LoRA matches Midjourney for ~50% of prompts.

— Google announces Imagen 4 deployed commercially in Google Ads Asset Studio (May 2026), with 68% creative production time reduction and 41% ROAS lift from AI-generated ad variants. Signals product-GA for Imagen 4 and large-scale enterprise adoption.

— University news coverage of Ambient Diffusion research showing diffusion models can be trained on heavily corrupted data (up to 90% pixel masking) to prevent copyright memorization while maintaining generation quality—addressing core IP concerns.

— Performance marketer's detailed analysis of image generation tool deployment with concrete CPA metrics, tool-by-tool performance data, and documented both successes and critical failures in production use.

HISTORY

  • 2022-H1: OpenAI released DALL-E 2 to preview access in April 2022; Midjourney launched competing product and claimed profitability. Ethical concerns (bias, harmful content, copyright) emerged as primary blockers. No evidence of production adoption.
  • 2022-H2: Stable Diffusion public release (August) catalyzed rapid adoption: 1.5M+ DALL-E users, 10M+ Stable Diffusion users by October. Enterprise integration via Azure OpenAI. Deployment remained constrained by unresolved copyright risks (1.88% training-data copying), licensing disputes, bias amplification (99% white developer images), and syntactic comprehension gaps. Stable Diffusion 2.0 (December) imposed artist-name restrictions, provoking user backlash over restrictions versus legal risk mitigation.
  • 2023-H1: Enterprise adoption accelerated through cloud platform support (AWS SageMaker, Azure OpenAI). Named organizations deployed Midjourney for commercial workflows. However, class-action copyright litigation filed against Stability AI, Midjourney, and DeviantArt (February 2023) and unresolved IP risks (1.88% training-data copying) emerged as primary deployment barriers. Developer feedback documented API quality issues and overly restrictive safety filters as usability constraints.
  • 2023-H2: Edge deployment matured: Apple Core ML implementation enables efficient Stable Diffusion on iPhone/iPad (September). DALL-E 3 released with ChatGPT integration and artist opt-out provisions, signaling ethical investment. Enterprise investment reached $2.5B annually but stalled on unproven ROI. Hardware ecosystem benchmarking (45+ GPU compatibility, A100 optimization) confirmed production readiness. Critical research finding: "Model Autophagy Disorder" study shows models degrade when trained only on synthetic data, identifying a fundamental scaling constraint. Copyright lawsuit remained unresolved.
  • 2024-Q1: Midjourney V6 released with marked photorealistic improvements; a major publisher reported relying entirely on AI-generated images for website content, signaling production-ready capability. Stable Diffusion 3 entered early preview with multi-subject and text-rendering enhancements. Copyright litigation escalated: 30+ lawsuits tracked across the industry; LA Times investigation documented evidence that models trained on copyrighted material with ability to reproduce it. Getty and artist class actions continued to pose material liability risks. Photorealistic generation capability had matured to commercial viability but remained constrained by unresolved copyright liability.
  • 2024-Q2: Photorealistic capability matured to consumer and enterprise scale. Large-scale human perception study confirmed near-parity: 50-60% accuracy distinguishing AI from real images. Consumer adoption accelerated: 53% of US adults used generative AI, 6.5B Firefly images generated since launch. Enterprise deployment advanced: 65% of organizations adopted GenAI (double from 2023), with 40% deploying across multiple business functions. Midjourney's sustainable profitability ($300M revenue, $5M per employee) demonstrated stable business model. However, Stable Diffusion 3 faced licensing fragmentation (shift from open-source to commercial-restricted tiers) and quality concerns (poor anatomy, flattened images), signaling ecosystem divergence and tradeoff between openness and capability. Copyright litigation remained unresolved, creating persistent liability barriers despite deployment momentum.
  • 2024-Q3: Cloud infrastructure maturity accelerated: Amazon Bedrock integrated Stable Diffusion 3 Large and other models (September), enabling enterprise-grade scalable deployment. Firefly adoption sustained: 9+ billion cumulative images with customers upgrading to premium tiers. However, copyright litigation precedent tightened: August 2024 Andersen v. Stability AI ruling found Stable Diffusion was "created to facilitate infringement by design," rejecting VCR analogy and confirming both vendors and end users face legal liability. Comparative research (DALL-E 3, SD XL, Stable Cascade) confirmed domain-specific limitations: consistent anatomical errors in human figures across all models, constraining medical/scientific deployment. Social media analysis documented photorealistic output with misinformation risks (celebrities/politicians depicted with surrealism). Ecosystem fragmentation deepened as Stable Diffusion 3 shifted to commercial-restricted licensing. The practice remained in bleeding-edge: deployment momentum was undeniable, but copyright liability and ecosystem divergence prevented mainstream classification.
  • 2024-Q4: Photorealistic capability confirmed by independent measurement: consumer discrimination accuracy dropped to 10% (October 2024) identifying 70%+ of images correctly, down from 25% in June 2023. Stable Diffusion 3.5 released (October) with 8.1B/2.5B parameter models and permissive licensing, advancing accessible deployment. However, fundamental limitations surfaced: AI researchers documented DALL-E 3's persistent compositionality failures (3/17 correct on part-relation tasks); representation bias research revealed systematic portrayal bias toward disabled individuals; and detection method fragility increased with SD version updates. Trust barriers rose sharply: Deloitte survey found 68% of GenAI users concerned about synthetic content deception and 59% unable to distinguish AI from human media. Midjourney reached 21+ million Discord members (by window end), confirming sustained consumer adoption. The practice remained in leading-edge: technical capabilities proven, business models sustainable, but copyright liability, trust concerns, and compositional/reasoning limitations blocked mainstream advancement.
  • 2025-Q1: Regulatory environment shifted from litigation to policy: U.S. Copyright Office (January 2025) formally clarified that copyright protection requires substantial human authorship—pure AI-generated images receive no legal copyright protection, creating explicit constraints on automated deployment models. Stable Diffusion 3.5 Large became available on Amazon Bedrock (February) enabling enterprise cloud accessibility. Research focus moved toward technical mitigation: peer-reviewed studies proposed genericization methods to reduce copyright fingerprinting in outputs. Trust barriers remained high (68% of users concerned about synthetic content deception per Deloitte). Midjourney sustained 21M+ Discord members with continued tier monetization. The practice remained in leading-edge: infrastructure deployment and commercial viability proven, but regulatory guidance on copyright authorship requirements, persistent technical limitations (compositionality, anatomical accuracy), and trust barriers blocked mainstream adoption.
  • 2025-Q2: Deployment momentum accelerated with named enterprise success: Mercado Libre scaled Stable Diffusion for product ads across 7 countries (25% CTR improvement, 90K+ ads); Adobe Firefly Services reached major enterprises (Accenture, Dentsu, PepsiCo, Estée Lauder) with Forrester-quantified 70-80% asset scaling ROI. Midjourney v7 released (April) with sustained market leadership; FLUX emerged as fastest alternative. However, real-world quality gaps surfaced: Microsoft rolled back Bing Image Creator due to DALL-E 3 quality degradation (less detail, poor prompt adherence); independent testing revealed persistent anatomical errors, prompt fidelity gaps, and safety vulnerabilities. Ecosystem fragmented by capability-openness trade-offs: Midjourney prioritized aesthetics, FLUX speed/photorealism, Stable Diffusion 3.5 openness. Copyright regulatory environment consolidated: AI-only generation remains unprotected, forcing "human-in-the-loop" deployment models. Trust barriers persistent (68% fear deception, 59% cannot distinguish AI from real). The practice remained in leading-edge: profitable production deployments proven, but copyright constraints on autonomy, real-world quality/reliability gaps, and consumer trust barriers block mainstream advancement.
  • 2025-Q3: Enterprise deployment momentum sustained: Adobe reported 99% Fortune 100 adoption of AI in Adobe apps, 90% of top 50 accounts adopted Firefly/GenStudio; IBM achieved 80% content cost reduction with 2-day ideation cycle. Stability AI launched Image Services on Amazon Bedrock (September) with nine editing tools and named enterprise customers (Mercado Lille, HubSpot). Named deployments confirmed: NFL and Stride Learning scaling Stable Diffusion 3.5 (1,000+ images/minute). However, real-world precision assessment revealed persistent limitations: Midjourney portraits 65-75% accurate, products 80-85%; 80% of deployments still require human touch-ups for final output. Copyright litigation landscape shifted: June 2025 court rulings (Bartz, Kadrey) found training "transformative" but using pirated data exposed vendors to billions in damages; over 50 lawsuits tracked. The practice remained in leading-edge: enterprise cloud-native deployment (Bedrock, Azure) proven profitable with strong ROI metrics, but persistent quality gaps, ongoing copyright liability exposure, and real-world reliability constraints block mainstream advancement.
  • 2025-Q4: Enterprise scaling momentum accelerated: HubSpot scaled image generation 150% with Stable Diffusion 3.5 Large in Amazon Bedrock, generating 300K images in 4 months; Adobe Q4 FY2025 reported record revenue ($6.2B, +10% YoY) with strong Firefly adoption across enterprise. However, reliability and quality constraints remained: peer-reviewed medical imaging study of 1,500 AI-generated images found all generators significantly underperformed real images in anatomical accuracy and detail, signaling persistent domain-specific limitations. Copyright liability environment consolidated: legal analysis tracking 50+ lawsuits; fair-use precedents (Bartz, Kadrey) confirmed training "transformative" but using pirated data exposes vendors to billions in damages. Adobe addressed trust concerns with transparent design mechanisms for Firefly including creator content protection and disclosure attachments. The practice remained in leading-edge: production-scale cloud-native deployment (Bedrock, Azure) with proven enterprise ROI, but persistent medical/specialized domain quality gaps, copyright liability exposure, and ongoing precision constraints (portraits 65-75%, products 80-85%, 80% requiring human touch-ups) prevent mainstream classification.
  • 2026-Jan: Ecosystem consolidation and regulatory maturity accelerated: Stable Diffusion commanded 80% market share with 2M daily image generations and $150M+ annual revenue; Adobe Firefly achieved 2-3x performance improvements on AWS with 14K developers and 12M monthly Acrobat users; Midjourney and FLUX maintained market positions despite ecosystem fragmentation. However, adoption barriers tightened: copyright litigation landscape resolved with $1.5B Bartz settlement (pirate-data training deemed unfair despite transformative fair-use finding), exposing all vendors to billion-dollar liability; 70+ active lawsuits tracked. Real-world quality gaps persisted: DALL-E 3 forum documented persistent anatomical errors, face generation failures, and content moderation frustrations; independent ecosystem analysis (Republic Labs) confirmed chronic issues (bias, deepfakes, quality inconsistencies) across all models. Regulatory pressure accelerated platform rollbacks: Grok restricted image generation behind paywall following deepfake/NCII controversies and DSA/Ofcom scrutiny, signaling industry-wide shift toward ethical governance. The practice remained in leading-edge: infrastructure, business models, and market adoption proven at enterprise scale, but copyright liability now consolidated with financial precedent, persistent real-world quality gaps, and regulatory-driven capability restrictions collectively prevented mainstream advancement.
  • 2026-Feb: Enterprise cloud-native deployment reached full production maturity: Adobe Firefly claimed 68% penetration among enterprise design teams; DALL-E 3 confirmed GA on Azure; Stable Diffusion 3.5 Large available on Amazon Bedrock with 19% speed improvements and $0.08/image pricing. Adoption markers strengthened: 86% of creatives use generative AI daily with doubled prompt complexity; Firefly integration reduced project turnaround 40%. However, production barriers persisted: practitioners documented inconsistent style, sampling artifacts, unpredictable runtimes, and multi-pass workflow requirements. International legal fragmentation tightened: Chinese courts ruled Midjourney prompts uncopyrightable; U.S. DOJ and Copyright Office maintained human-authorship requirement for copyright protection. Licensing split adoption paths: Adobe offered IP indemnification for paid customers, Stable Diffusion provided open-source Apache 2.0 licensing. The practice remained in leading-edge: enterprise-scale deployment with proven ROI and cloud infrastructure support confirmed, but persistent production quality gaps, copyright liability exposure codified across jurisdictions, and licensing fragmentation collectively maintained barriers to mainstream advancement.
  • 2026-Apr: Ecosystem maturity accelerated with benchmarking, market consolidation, and technical insight into the photorealism mechanism. Independent API benchmark (AI Playbook, 500 requests per model) measured 11+ vendors; systematic testing across 6 tools and 5 categories rated Midjourney v7 at 9.5/10 for artistic quality (with photorealistic skin texture and fabric rendering) and DALL-E 3 at 8.8/10 for prompt accuracy, with Adobe Firefly 3 notable as the sole licensed-data trained model. Midjourney's market position confirmed at $500M revenue (2025, 900% growth since 2022), 19.83M active users, and 26.8% global market share at $5M revenue per employee. GitHub ecosystem metrics confirmed sustained developer adoption (ComfyUI 86.77k stars +518/month, AUTOMATIC1111 154.52k stars). Technical breakthrough clarified the photorealism mechanism: models achieved realism by mimicking smartphone camera imperfections — contrast issues, sharpening artifacts, perspective compression — rather than pursuing technical perfection, reframing the quality bar for deployment. Stable Diffusion 4 Ultra reached GA with 87% hand accuracy and native 4096×4096 resolution. UK High Court ruling in Getty v. Stability AI significantly reduced IP liability by rejecting secondary copyright infringement claims, clarifying that trained model weights are not infringing copies and enabling broader deployment. However, critical practitioner assessments documented persistent production limitations: gradient degradation across regeneration cycles, fine-detail loss, and unreliable text rendering continued to require hybrid human-AI workflows. The practice remained in leading-edge: market size, vendor diversification, and profitable enterprise deployment confirmed at scale; real-world quality gaps and human-in-the-loop requirements remained structural barriers to mainstream advancement.
  • 2026-May: GPT Image 2's architectural shift (diffusion to token-based Transfusion reasoning) solidified as the period's defining development: 1,512 Elo with a 242-point lead over competitors and 80% win rate on day-one integration across Figma, Canva, Adobe, and fal. Firefly adoption metrics strengthened further — 24B+ assets, 45% Creative Cloud user penetration, 72% Fortune 500 design team integration, $250M ARR tripled YoY — while Google Imagen 4 reached commercial deployment in Ads Asset Studio with documented 68% production time reduction and 41% ROAS lift. Research addressed the persistent copyright memorisation problem: Ambient Diffusion demonstrated that training on 90% pixel-masked data maintains generation quality while preventing training-data reproduction, offering a technical path around IP liability. Vendor differentiation stabilised: Midjourney leads photorealism (9.5/10), DALL-E leads typography and infographics (8.8/10 prompt accuracy), SD4 + LoRA competitive for ~50% of prompts at lower cost.

TOOLS