Perly Consulting │ Beck Eco

The State of Play

A living index of AI adoption across industries — where established practice meets the bleeding edge
UPDATED DAILY

The AI landscape doesn't move in one direction — it lurches. Some techniques leap from experiment to table stakes in a single quarter; others stall against regulatory walls, technical ceilings, or organisational inertia that no amount of hype can dislodge. Knowing which is which is the hard part. The State of Play cuts through the noise with a rigorously maintained index of AI techniques across every major business domain — classified by maturity, evidenced by real-world adoption, and updated daily so you always know where you stand relative to the field. Stop guessing. Start knowing.

The Daily Dispatch

A daily newsletter distilling the past two weeks of movement in a domain or two — delivered to your inbox while the index updates in the background.

AI Maturity by Domain

Each dot marks the weighted maturity of practices within a domain — hover for a brief summary, click for more detail

DOMAIN
BLEEDING EDGEESTABLISHED

Curriculum design & content generation

LEADING EDGE

TRAJECTORY

Stalled

AI that designs curricula, generates learning paths, creates course content, and produces lesson plans aligned to learning objectives. Includes standards-aligned content creation and prerequisite mapping; distinct from question generation which creates assessment items rather than instructional content.

OVERVIEW

AI-powered curriculum design has matured into a leading-edge but structurally constrained practice. By June 2026, the evidence is unambiguous: deployment at scale is real, time savings are quantified, yet learning outcomes depend entirely on how curricula are pedagogically designed—not on tool sophistication. UK-wide evidence (May 2026) documents 54% of teachers use AI for lesson planning, 50% for quizzes, 45% for test materials, with EEF/NFER controlled trials proving 31% reduction in prep time (~25 min/lesson) with no quality loss by blind expert review. US adoption mirrors scale (68-72% weekly use), and Stanford's analysis of 150,000+ teacher prompts confirms curriculum design is the dominant substantive AI use case in education. Yet the learning-outcomes picture is sobering. The OECD Digital Education Outlook 2026 establishes a critical threshold: "GenAI without clear teaching principles enhances performance with no real learning gains." A meta-analysis of 22 studies (2022–2025) shows positive effects only when pedagogical intent is explicit (g=0.572), while 90% of generic AI lesson materials constrain thinking to recall levels. Large-scale institutional deployments (Tec de Monterrey with 64K+ students, Copenhagen Business Academy with full course integration, JPMorgan's 300K-person curriculum) show the practice is production-ready. But they also reveal the hard requirement: every AI-generated lesson plan requires expert human review and pedagogical validation before deployment. The defining tension persists unchanged: mainstream adoption and time savings coexist with universal "supervision debt" (mandatory review), training gaps (29% of teachers lack formal AI instruction), governance deficits (87% of schools have no formal AI policy), and structural barriers preventing classroom-ready outputs without expert curation. The practice has crossed into leading-edge through deployment maturity and institutional infrastructure—not through solving the underlying design problem.

CURRENT LANDSCAPE

Adoption has reached true mainstream scale with quantified productivity gains and emerging agentic-system deployment. June 2026 evidence shows: UK nationally representative survey (8,000-10,000 teachers) documents 54% use AI for lesson planning, 50% for quizzes, 45% for test materials, 39% for school reports; EEF/NFER controlled trial (68 schools, 259 science teachers) measured ChatGPT-assisted lesson planning reducing prep time to 31% of control baseline (~25 minutes per lesson) with blind expert review finding no pedagogical quality loss. US adoption mirrors this pattern: 68-72% of K-12 teachers use AI weekly; Stanford's analysis of 150,000+ teacher prompts confirms ~50% relate to curriculum design (standards unpacking, example generation, materials alignment, accessibility adaptation). The vendor ecosystem is consolidating and innovating: MagicSchool operates at $44.9M ARR across 13,000+ schools and 6M+ users; Khanmigo reaches 700k users across 380+ districts with Harvard/Stanford RCT validation; next-generation tools (Atomic Jolt, PepperMill, Kuraplan) automate standards alignment and gap analysis; emerging agentic approaches (27% of L&D organizations active, 39% cautious but interested) shift from reactive drafting to proactive multi-step workflows. Policy institutionalization accelerates globally: 134 US state bills, Georgia/Mississippi mandate AI in graduation standards; India launches "AI Literacy for Teachers" (targeting 1M teachers by 2027); EU and UK government initiatives (DfE/DSIT) develop curriculum-aligned safety standards and content infrastructure. Large-scale institutional deployments (Tec de Monterrey: 64K+ students with 20–48 point score gains; Copenhagen Business Academy: no-code RAG for course-specific assistants; JPMorgan: 300K-person "AI Made Easy" curriculum) demonstrate production-stage maturity.

Yet adoption vastly outpaces implementation capacity. Only 29% of US teachers received formal AI training (May 2026); 87% of schools operate without formal AI governance. Quality outcomes remain inconsistent: practitioner reviews find basic worksheets acceptable (71%) but assessment items requiring higher-order thinking only 38% usable, IEP recommendations 36% good. The OECD establishes a design threshold: generic tools cause "cognitive offloading" with 17% worse exam performance despite higher practice scores—validating critical signal that curriculum design must prioritize "slow AI" (iterative, scaffolded) over "fast AI" (fluent one-shot generation). Critical research documents that sequential project design with high element interactivity outperforms open-ended approaches when AI is present. The universal "supervision debt" persists—every output requires expert validation. Field-building is underway (Digital Promise/TNTP partnership, $23M National Academy for AI Instruction targeting 400k teachers, NSF AmplifyGAIN research centers) but faces systemic barriers: insufficient PD infrastructure, weak learning science grounding in most tools, and widespread institutional inertia. Critical assessment from 50+ stakeholder interviews documents tools functioning as "sophisticated photocopiers" rather than pedagogical partners. The result: 68% adoption coexists with structural constraints—training gaps, governance deficits, and lack of pedagogical frameworks—that prevent scaling beyond early-adopter contexts. Emerging signals show the field is shifting from generative drafting to agentic systems and pedagogically-grounded design, but maturity remains concentrated in institutions with dedicated design capacity.

TIER HISTORY

ResearchJun-2023 → Jun-2023
Bleeding EdgeJun-2023 → Oct-2024
Leading EdgeOct-2024 → present

EVIDENCE (132)

The 24-Month WindowOpinion

— Identifies fundamental curriculum design gaps for AI era: unit of work shifted from typing to specifying; six core AI-native competencies missing from traditional curricula; frames 24-month window for AI-native training advantage.

— 2026 comprehensive tool comparison documenting market maturity: standards alignment, adaptive sequencing, auto-generated assessments, bias guardrails; validates practice as institutional/scaled deployment across K-12, higher ed, and corporate contexts.

— Empirical evidence of critical limitation: AI-assisted work improves output quality (M=7.62) without strengthening actual learning mastery (M=5.55), producing average AI-Learning Gap of 2.07; signals fundamental effectiveness constraint for AI-generated curriculum.

— Domain-specific curriculum framework with identified barriers (88% student recognition of AI importance but minimal formal instruction, 92% trainee inadequacy); addresses educator readiness and infrastructure gaps limiting integration.

— Deep critical assessment of AI's fundamental limitations: context blindness, lack of empathy, bias propagation; proposes hybrid human-AI model as maturity path; documents adoption barriers rooted in capability constraints.

— India's CBSE mandatory Computational Thinking & AI curriculum redesign (Classes 3-8: 50-100 hrs/yr) affecting millions of students; composite skill labs required by Aug 2027; documents institutional curriculum design at national scale.

— Italy national curriculum guidelines (effective Sept 2026) mandate AI-supported 6-step pedagogical workflow; emphasizes skill-based pathways and teacher-led instructional leadership; represents policy-level adoption of AI-supported curriculum generation.

— Direct evidence of educator upskilling for curriculum design: 60+ educators trained on AI instructional design, assessment creation, differentiation using Gemini and NotebookLM; addresses rural equity gap in access to AI curriculum training.

HISTORY

  • 2023-H1: MagicSchool and Twinkl introduce AI-powered lesson planning and content creation tools; K-12 adoption reaches ~40% of surveyed teachers; significant faculty skepticism persists in higher education regarding AI's educational impact.
  • 2023-H2: Khan Academy launches Khanmigo teacher tools in production (lesson planning, rubrics, discussion prompts); district-level pilots expand (Newark, Gwinnett); individual practitioner adoption accelerates (teachers building full curriculum maps with ChatGPT); standards bodies begin addressing AI-generated content and plagiarism detection challenges.
  • 2024-Q1: Vendor platforms mature to scale (MagicSchool 4M+ users, Curriculum Genie 300+ LEAs); industry consortia formalize K-12 integration frameworks (CoSN/CGCS maturity tool); business schools report 60% planning curriculum transformation; critical assessments emerge highlighting quality gaps and high pilot failure rates (95% of enterprise AI pilots deliver zero ROI).
  • 2024-Q2: Khanmigo expands free access to all US teachers via Microsoft partnership; Indiana statewide AI pilot reaches 112 schools with 53% positive impact on student outcomes; international platforms scale (NovaEscola in Brazil reaches 15,000+ users); higher education lags in curriculum review despite interest (only 14% of institutions reviewed curricula). Critical barriers remain: institutional adoption slow, quality concerns persistent, teacher training gaps evident.
  • 2024-Q3: Khanmigo extends globally to 49 countries via Microsoft partnership; LAUSD's custom curriculum chatbot shuts down after 5 months, revealing implementation risks; peer-reviewed research documents teachers' AI-driven curriculum adaptation patterns; practitioner and institutional frameworks emphasize educator control and caution. Evidence converges on simultaneous expansion and consolidation: tools scaling internationally while deployment failures expose adoption barriers and quality concerns, requiring heightened institutional oversight.
  • 2024-Q4: Peer-reviewed research confirms teacher adoption of MagicSchool in real classrooms; universities demonstrate rapid course generation (full courses via ChatGPT in under 24 hours with expert approval); instructional design adoption broadens (84% of practitioners use AI) but shows diminishing returns and platform stagnation. However, quality gaps persist: survey of 104 teachers finds only 40% of AI-generated lesson plans classroom-ready; instructional designers report 2024 as continuity-not-change year due to generic models. Adoption plateau evident: tools prove technical viability but encounter institutional and pedagogical limits requiring expert oversight.
  • 2025-Q1: MagicSchool scales to 5M+ educators across 160+ countries with 13,000+ schools; Enid High School (Oklahoma) reports positive outcomes in geometry curriculum deployment via Khanmigo with improved student engagement; practitioner research emphasizes need for educator quality control; industry analysis positions curriculum development as high-benefit, low-maturity use case. Tension persists between vendor scalability claims and institutional implementation reality: adoption broadens among early adopters while broader institutional scaling remains constrained by quality assurance and teacher training requirements.
  • 2025-Q2: Adoption metrics confirm 63% of K12 teachers and 42% of HED instructors use GenAI for lesson planning, yet peer-reviewed research (Penn GSE, UMich) documents systematic pedagogical limitations in AI-generated content. Vendors respond with pedagogically-grounded products (Curipod, others); practitioners develop frameworks emphasizing educator control. Administrator support reaches 55% but teacher adoption in classrooms remains low (25% report AI-assisted instruction). Quality concerns emerge across K12, higher education, and early childhood, converging on universal requirement: AI-generated lesson plans need expert review before classroom deployment.
  • 2025-Q3: Institutional curriculum AI adoption accelerates: Georgia University System deploys AI mapping across 26 institutions (344K+ students); Immaculata University integrates MagicSchool into teacher preparation programs; new product categories (Atomic Jolt, PepperMill) automate gap analysis and standards alignment. Research demonstrates quantified effectiveness (89.72% completion, 91.44% retention) in controlled deployments. Yet persistent pedagogical limitations documented: educators report AI generates generic, low-engagement content lacking critical thinking activities. Michigan Virtual survey (554 educators, September) confirms continued adoption growth. Practice achieves technical maturity and broad early-adopter reach, but remains constrained by universal requirement for expert curriculum review and quality assurance systems before classroom deployment.
  • 2025-Q4: Global policy frameworks institutionalize AI literacy into national curricula (UNESCO, Colombia, India, UAE initiatives); Khanmigo expands to Vietnam with native localization; peer-reviewed evidence documents critical limitations—90% of AI-generated civics lessons constrain thinking to basic levels, AI accuracy fails on subjective assessment tasks, Estonia survey (15,631 students) reveals implementation gaps where adoption outpaces pedagogical readiness. India reports 57% institutional AI policy adoption, signaling strategic institutional integration despite persistent classroom implementation gaps. Adoption plateaus: continued 63% K-12 and 42% HE tool adoption but only 25% classroom deployment; practice achieves operational maturity at scale but remains fundamentally constrained by unresolved quality, bias, and pedagogical limitations.
  • 2026-Jan: Major vendor innovations accelerate (Microsoft Copilot Teach, Google Gemini integration with Khan Academy); institutional deployments expand (Palm Springs Unified, Bloomington Junior High); practitioner critiques intensify, warning against proliferation of low-value AI curriculum tools and emphasizing gap between hype and classroom-ready solutions. Adoption remains steady at ~60% teacher usage for lesson planning; tools prove mature for early adopters while quality and pedagogical constraints continue limiting mainstream classroom implementation.
  • 2026-Feb: Global scale confirmed: Ciklum's AI platform serving 85,000 students across 160+ countries with 70% parent adoption increase; 30,000+ teachers driving 115,000+ AI-generated lesson plans. UK efficiency data shows 70-80% planning time reduction. However, Bend La-Pine Schools removes MagicSchool's student-facing Raina after parent protests, highlighting safety barriers. Peer-reviewed research (February) reiterates quality limitations: 90% of AI civics lessons constrain student thinking to basic levels. EdTech expert analysis identifies "supervision debt"—mandatory human validation across all curriculum AI workflows. Practice reaches scale but deployment failures and quality constraints confirm hard limits on autonomous systems and mainstream classroom reach.
  • 2026-Mar: Khanmigo growth accelerates to 700,000 users across 380+ districts with Harvard/Stanford RCT validation; MagicSchool adoption survey (3,600+ educators) documents 71% lesson planning use and 600+ district-customized tools deployed. Major institutional investment: $23M National Academy for AI Instruction partnership (Anthropic, Microsoft, OpenAI) commits to training 400k teachers on agentic curriculum workflows, signaling shift from template-based to reasoning-agent approaches. Veteran practitioner accounts (American Federation of Teachers) confirm AI used for lesson planning, differentiation, and rubrics but emphasise critical revision before classroom deployment. Critical countervailing evidence: OECD Digital Education Outlook 2026 shows pedagogy-grounded tools outperform generic LLMs; meta-analysis of 11 RCTs finds time savings (25 min/week average) but no improvement in lesson quality (45% stay at Bloom's "remember" level); Alpha School investigation documents AI-generated lesson failures at scale; 3-year deployment across 15 schools shows measurable standards-coverage gains (67→99%) but requires structured AI sequencing engines. Adoption momentum sustained but quality barriers persist—deployment continues to require expert human oversight and instructional design expertise.
  • 2026-Apr: Adoption reaches mainstream scale: 68-72% K-12 teachers use AI weekly for lesson planning; RAND survey (4,200 teachers) documents 72% create plans, 65% worksheets, 54% differentiation, but quality varies sharply (71% basic tasks good, 38% higher-order thinking, 36% IEP recommendations). Stanford SCALE Initiative analysis of 150,000+ teacher prompts confirms 50%+ relate to curriculum design. Policy-level institutionalization: 134 bills across 31 states, Georgia/Mississippi mandate AI in graduation standards. Independent practitioner review reveals MagicSchool excels at text leveling but lesson plans remain "educationally generic" requiring substantial revision; MagicSchool April 2026 updates add curriculum-aligned song generation as a new content format. UCL academic critique challenges whether AI should substitute for lesson planning at all, warning that proliferation of convenience tools risks reducing teachers to editors of generic outputs. Critical framework emerges: tools without learning science foundation (cognitive load theory, Bloom's progression, retrieval practice) function as "sophisticated photocopiers" not teaching partners; OECD Digital Education Outlook 2026 confirms AI benefits depend on curriculum design quality, not mere tool access. Field-building initiatives signal maturation (Digital Promise/TNTP, NSF AmplifyGAIN research center, Massachusetts PEA²K cohort) but widespread adoption constrained by training gaps (71% received no formal instruction), governance deficits (87% schools lack formal AI policy), and unresolved quality thresholds. Practice achieves scale but remains fundamentally dependent on institutional capacity for curriculum vetting and learning sciences integration.
  • 2026-May: A UK nationally representative survey (8,000–10,000 teachers) confirms curriculum design as the dominant AI use case: 54% for lesson planning, 50% for quizzes, 45% for test materials. The EEF/NFER controlled trial (68 schools, 259 science teachers) quantified the efficiency gain: ChatGPT-assisted planning reduced prep time to 31% of control baseline with no pedagogical quality loss in blind expert review. Google pre-registered RCTs (Sierra Leone 1,800 students, Italy 9,000 students) document Gemini used for content creation and scaffolding in production, delivering +0.26–0.38 SD math gains and 70% admin time reduction. Counterweights remain: OECD warns that "outsourcing tasks to GenAI simply enhances performance with no real learning gains"; EU policy (May 11) formalises teacher as co-designer rather than executor; CRPE's 50+ stakeholder audit characterises most tools as "sophisticated photocopiers." Student-demand pressure confirmed separately: 88% of UK students use AI in assessments but only 36% received institutional training, while a 120-institution US study documents five systemic adoption barriers — the adoption-infrastructure gap persists at both teacher and learner layers.
  • 2026-Jun: The agentic shift in L&D solidified: 27% of organizations are active agentic AI users for curriculum workflows (39% more interested), with multi-step autonomous content pipelines replacing reactive drafting. Large-scale deployments confirm production maturity — Tec de Monterrey's 64,000+ students and JPMorgan's 300,000-person "AI Made Easy" curriculum demonstrate institutional commitment — while a peer-reviewed study of 375 preservice teachers finds AI readiness explains 70.8% of variance in lesson design quality, making educator capability the decisive variable. The OECD "slow AI" principle gained empirical weight: generic AI tools caused 17% worse exam performance despite higher practice scores, reinforcing that curriculum design requires iterative pedagogical scaffolding rather than one-shot generation. Institutional mandates expanded globally: India's CBSE rolled out mandatory Computational Thinking & AI curriculum for Classes 3-8 (affecting millions of students, with composite skill labs required by Aug 2027); Italy formalized national guidelines (effective Sept 2026) for AI-supported curriculum design with explicit 6-step pedagogical workflow. However, June evidence documents critical barriers: empirical research (N=1,498 Vietnamese students) revealed AI-assisted work improves output quality but produces an average 2.07-point learning gap between assignment quality and actual knowledge mastery—validating OECD's "supervision debt" thesis. Instructional design experts identified fundamental AI limitations (context blindness, bias propagation, inability to extract tacit knowledge from domain experts) that prevent full automation, positioning human-AI teaming as the maturity path. The practice remains: deployment-ready at scale, with quantified time savings and institutional adoption, yet fundamentally constrained by unresolved learning effectiveness gaps and universal requirement for expert human oversight of all AI-generated curriculum outputs.