You Can Use AI All You Want. If It Lives in IT, Results Won't Come.
A four-stage operating lens for leaders who want AI to show up in the numbers, not just the narrative.
Executive Summary
Most organisations are doing AI. Almost none are running AI businesses. The gap between the two is not a technology problem — it is a management problem.
This article gives you the operating lens you will use before you build your control system. It answers four questions that every CEO, COO, and CFO should be able to answer clearly: What stage are we at? What does progress actually look like in numbers? What is specifically blocking the next stage? And who in the leadership team is accountable for closing that gap?
The framework has four stages: Establish Trust, Foundation, Reinvention, and Advantage. Each is defined by its business outcome, not by what technology gets deployed. The sequence is non-negotiable — each stage's output is the prerequisite for the next.
The central argument is this: the organisations that will define their industries over the next decade are not the ones with the most sophisticated AI. They are the ones that changed their management systems — their targets, their performance frameworks, their incentives, their operating models — to capture the value that AI makes available. Technology is necessary. Management system change is what converts it to results.
If you read nothing else, read Stage 3. That is where most organisations are stuck, where the critical distinction between deployment and reinvention lives, and where the management decisions that determine whether AI shows up in the P&L actually get made.
How to Use This Article
This is the operating lens you apply before you build your control system — not after. A control system built without a shared mental model produces governance theatre. This article builds the mental model first.
Use this article to align the leadership team before building detailed governance, investment, risk, and execution controls.
The Wrong Question
The most common question in boardrooms today is the wrong one.
"Are we doing AI?" produces a yes or no answer. A yes answer tells you almost nothing useful. It does not tell you whether AI is delivering business value, how far you have to go, or what is specifically blocking your progress. It just tells you that somewhere in the organisation, someone is doing something with AI.
The right question is: what level are we at, and what is specifically blocking us from reaching the next one?
In the 1970s, NASA developed the Technology Readiness Level scale — a nine-level system for measuring how mature a technology was, from basic concept to fully operational system. 1 The insight was simple: rather than asking "does this work?", you asked "how ready is this, and what precisely needs to happen next?" Enterprise AI adoption follows the same logic. Most organisations are stalled at the equivalent of level three — the proof of concept that shows promise in isolation and delivers almost nothing at the business level.
This article offers four stages, defined entirely by business outcomes. The stages are Establish Trust, Foundation, Reinvention, and Advantage.
Condition Zero: This Belongs on the CEO's Desk, Not the CIO's
When AI transformation is owned by IT, it produces IT outcomes. Systems get deployed. Costs get managed. Compliance gets maintained. These are necessary. They are not transformation.
Finance runs on technology. We don’t put the CFO under IT. AI is no different — it changes how the business works, so it has to be owned where business decisions are made.
In practice, three roles work in concert. The CEO sets the ambition, maintains the pressure, and holds the organisation accountable — without this, every stage gets negotiated rather than executed. The COO drives the transformation: owns the reinvention of operations, makes the workload sequencing decisions, and ensures business units actually change. The CIO or CTO enables the platform and the data — without which Stages 2 and 3 have no foundation. All three are essential. The point is not that IT is irrelevant — it is that accountability for business outcomes cannot sit inside a technology function.
Microsoft's own research found that AI power users — those who get the most transformative value from the technology — are 61% more likely to hear from their CEO about the importance of using AI at work. 2 That correlation is not accidental. Top-down intent is what converts individual AI use into organisational transformation.
Without this condition, the four stages below are a CIO's roadmap. With it, they are a business transformation programme.
The Framework: Four Stages, One Direction of Travel
Each stage is defined by what the business can demonstrably do differently at the end of it. The sequence is non-negotiable. You cannot reinvent workloads on data you do not trust. You cannot generate sustainable revenue from insight built on governance you have not established.
Two forces run through all four stages and together determine whether the framework succeeds or fails.
The first is employee trust. An organisation whose employees trust AI — and trust the organisation's intentions with AI — will outperform one that does not at every stage, regardless of technology quality. This trust must be built deliberately from Stage 1. By the time you need it in Stage 3, it is either there or it is not.
The second is management system alignment. Trust enables people to use AI. Management systems — targets, performance reviews, compensation, accountability structures — determine whether the organisation captures the value. An organisation where the performance framework still rewards the old way of working will plateau at productivity gains and never convert them to business outcomes. This is the most common reason AI programmes look successful on adoption dashboards and invisible on P&L statements. Both forces are required. Neither is sufficient alone.
The Four-Stage Operating Lens
Establish Trust
Governed access, visible usage, employee confidence.
Foundation
Trusted data, productivity signal, redeployed capacity.
Reinvention
AI-native workloads, reset targets, measurable output.
Advantage
New revenue, customer differentiation, compounding moat.
Stage Metrics: What Good Looks Like in Numbers
These are directional thresholds, not precise benchmarks. The purpose is to give leadership teams a shared signal set — something a CEO, CFO, and COO can look at together and agree on what "we are through this stage" actually means.
| Stage | Name | Primary Signal | Key Metrics | Directional Target |
|---|---|---|---|---|
| 1 | Establish Trust | Risk reduced; baseline visible | % workforce on governed AI platform | >70% of target population within 90 days of launch |
| Shadow AI exposure reduction via DLP monitoring | >50% reduction in unsanctioned AI data events | |||
| Employee trust sentiment via AI intent pulse survey | >60% confidence score that AI is being deployed in their interests | |||
| 2 | Foundation | Productivity real; data trusted | % of saved hours demonstrably redeployed to output | >50% of captured hours converted to measurable additional output |
| AI-ready data domains unlocked | >80% of priority workload data classified, governed, and accessible | |||
| AI output error rate by domain | Declining quarter-on-quarter; below threshold agreed per use case | |||
| 3 | Reinvention | Operations transformed | Cost per transaction reduction in flagship workloads | >20% reduction in at least 3 reinvented workloads within 12 months |
| Cycle time reduction | >25% in flagship processes | |||
| Throughput increase | >30% output per FTE in reinvented functions | |||
| Performance framework alignment | 100% of reinvented workload teams operating on updated targets | |||
| 4 | Advantage | Revenue grows; moat compounds | % revenue from AI-enabled products or models | Defined by business model; tracked as named line in management accounts |
| Customer lifetime value change | Measurable improvement in AI-served segments vs control | |||
| Speed to market vs prior baseline | Quantified per product or service category |
A note for the CFO on Stage 2: the metric that matters is not hours saved — it is hours redeployed. If the organisation cannot demonstrate what happened to the productivity gain, Stage 2 is generating activity, not business value. Productivity that cannot be traced to output is not yet an asset.
Stage 1: Establish Trust
You cannot build on what your people will not use and your organisation cannot see.
Right now, AI is in your organisation. Seventy-eight per cent of knowledge workers are bringing their own AI tools to work — personal accounts, free tiers, consumer applications with company data. 3 This is not defiance. It is a rational response to a productivity tool that works and no sanctioned alternative being available. The employees using it report saving time, improving quality, and enjoying their work more. 4 They are not the problem. The absence of governance around what they are doing is.
Stage 1 replaces chaos with governed access. The organisation provisions managed, policy-compliant AI and creates the first observable layer of AI usage across the business. Shadow AI does not disappear by being banned. It disappears by being made unnecessary.
But Stage 1 has a second dimension, equally important and far less often addressed. Every employee is watching what the company does with AI and drawing conclusions about their own future. Fifty-three per cent of people who use AI at work worry that using it on important tasks makes them look replaceable. 5 In that environment, the framing of Stage 1 is a strategic decision. If the first message is about policy, monitoring, and restriction, trust is damaged before it is built. If the first message is about access, support, and honest conversation about what AI will and will not mean for people's roles, trust becomes the foundation on which everything else is built.
Vague reassurances fail. "AI will create new opportunities" is read as hedging. What builds trust is specificity: here is what this AI will do in our organisation, here is what it will not do, here is what your role looks like as we progress, here is what the organisation commits to you during the transition. That conversation belongs in Stage 1 — not Stage 3, when roles are already changing.
People: Sanctioned AI access provided to target population. Explicit, specific communication on AI intent delivered at all levels. AI champions identified from within the workforce. Acceptable use policy developed with employee input.
Process: AI governance committee formed with business representation. Usage monitoring active. Data classification reviewed. Third-party AI inventory completed — software vendors with AI features enabled by default are governed alongside internally deployed tools.
Technology: Managed, policy-compliant AI platform provisioned. Data loss prevention active.
The blocker: Framing Stage 1 as a security initiative. The message must be "we are giving you AI that is safe to use and we are being straight with you about what that means." Anything less and the shadow AI problem continues underground while the official programme builds on sand.
Stage 2: Foundation
AI is only as intelligent as the data it can reach — and only as useful as the people who trust what it tells them.
The most common reason AI implementations underdeliver is not a model problem. It is a data problem. Siloed, inconsistent, inaccessible data produces unreliable AI output regardless of how sophisticated the technology is. Stage 2 establishes a data operating model: who owns data, how it flows, what is trustworthy enough to give to an AI, and what governance exists around sensitive data in AI contexts.
There is a dimension to data quality that directly affects the employee trust built in Stage 1. If an AI tool gives an employee a wrong answer because the underlying data was bad, that employee loses confidence in the tool — and rebuilding that confidence is significantly harder than building it correctly the first time. Every bad AI output is a withdrawal from the trust account. Data quality is therefore not just a technical requirement. It is a trust-maintenance requirement.
Stage 2 is also where the first measurable business signal appears: productivity. Microsoft's Copilot research found that users read 11% fewer emails, edited 10% more documents, and the heaviest users saved the equivalent of an entire working day per month. 6 These gains are the right metric for this stage — they validate the investment and build internal confidence.
But productivity is a false destination. Productivity gains at the process level rarely flow through to the bottom line without deliberate action — and the management system change required to convert saved time into additional output is a Stage 2 decision, not a Stage 3 one. The COO and HR leadership must update output expectations and performance frameworks for AI-assisted roles as productivity becomes visible, not after it has plateaued. If you wait until Stage 3 to address this, you will have twelve months of productivity data and almost no output growth to show for it. 7
People: Data owners identified per business domain. AI Centre of Excellence established or scoped. AI literacy programme launched.
Process: Data governance framework live. AI-ready pipelines designed for priority workloads. Principle: good-enough data for priority workloads, not perfect data as a precondition.
Technology: Data catalogue and quality tooling active. Knowledge retrieval infrastructure established.
The blocker: Paralysis by perfection — attempting to solve all data problems before doing anything. And the management system failure of treating productivity as an endpoint rather than a conversion challenge.
Stage 3: Reinvention
If your management system didn't change, your AI programme didn't either.
This is the stage where most organisations fail quietly. The technology is deployed, the adoption dashboards look healthy, and the P&L is unmoved. The reason is almost always the same: the workloads were not reinvented — they were automated. And the management systems were not updated — they were left in place.
Deployment vs. Reinvention
AI Deployment
Adds AI to the existing workflow.
- Old process remains intact
- Adoption looks good
- Productivity may improve
- P&L impact is often unclear
AI Reinvention
Redesigns the workflow around what AI now makes possible.
- Process is rebuilt AI-native
- Targets and incentives reset
- Output and cycle time move
- P&L impact is measurable
Deployment vs reinvention: the distinction that determines everything
Deploying AI into a workload asks "where can we add AI to what we already do?" Reinventing a workload for AI asks "if we were designing this from scratch, knowing what AI can do, what would it look like?" The answers are almost always radically different. A five-step approval process with three human checkpoints becomes a one-step AI-verified recommendation with a human exception handler. A weekly management report produced by a team of analysts becomes a live dashboard with AI-generated narrative and anomaly flagging. A customer onboarding journey that takes ten days becomes a real-time personalised experience.
The prioritisation model: where to start
Not all workloads are equal candidates. The organisations that execute Stage 3 most effectively limit their initial scope to three to five flagship reinventions. Prioritise workloads that meet the majority of these criteria: high volume and high frequency; high cost per transaction in current form; repeatable structure with clear inputs and measurable outputs; data readiness established in Stage 2; direct and traceable margin impact within a measurable timeframe.
These flagship reinventions serve two purposes. They produce genuine transformation in contained scope. And they generate the proof points that make the next wave of reinvention politically and financially possible.
Capital allocation in Stage 3
Prioritisation is not only a sequencing decision — it is a capital decision. Each flagship reinvention should be evaluated against a simple investment thesis: what is the current cost of running this process at scale, what does a fully reinvented version cost to build and operate, and what is the payback period? For most high-volume, high-cost operational processes, reinvention payback windows of twelve to twenty-four months are achievable — but this requires the COO and CFO to agree the investment case per workload before reinvention begins, not after. The build-versus-buy decision also surfaces here: some workloads are better served by configuring an existing AI platform; others justify proprietary development because the operational data and decision logic represent a genuine competitive asset. That decision should be made consciously, not by default.
The productivity-to-output conversion problem
If a team previously completed four units of work per day and AI now enables four units in two hours, the organisation gets nothing from that recaptured time unless it decides what to do with it. Without new expectations and cultural permission to operate differently, unchanged output is the default. This is a rational response to an unclear expectation — not a failure of the technology.
Management systems are the conversion mechanism
This is the point where trust reaches its limit as an explanation, and management systems become the determining variable. Trust enables people to use AI. Performance frameworks determine what they do with it. If compensation still rewards hours worked rather than output delivered, output will not grow. If performance reviews still benchmark individuals against the old process baseline, individuals will optimise for the old baseline. If targets were set before the workload was reinvented and were never updated after, the organisation has officially reinvented the process and unofficially kept the old expectations in place. The result is productivity gains that plateau and never convert.
The fix is explicit and operational: performance reviews updated to reflect AI-augmented output expectations. Targets reset to the new baseline of what the reinvented workload can deliver. Compensation frameworks that reward output growth. Manager accountability for the productivity-to-output conversion rate in their teams. This work belongs to the COO and HR leadership, not the CTO — and it must be sequenced alongside the workload reinvention, not after it.
A concrete example: a finance team that previously produced a monthly close in ten days and now produces it in three does not automatically produce more strategic analysis in the remaining seven days. Someone in the management chain must set the expectation that those seven days produce a named deliverable with a named owner and a named deadline. Without that, the ten-day close became a three-day close — and the organisation captured one-third of the available value.
Leadership alignment as a prerequisite
Stage 3 stalls when the leadership team has different answers to "what is AI for?" The CEO thinks competitive advantage. The CFO thinks cost reduction. The COO thinks operational risk. Until those converge into one shared ambition — owned at the top and communicated consistently — reinvention gets negotiated rather than executed.
People: Roles redesigned with employee involvement. Output targets reset with transparency. Performance frameworks updated. Manager enablement programme active.
Process: Three to five high-priority workloads rebuilt AI-native using the prioritisation criteria above. Decision rights reallocated between human and machine. Investment case agreed per workload before build begins.
Technology: AI agents in production. Process automation integrated with AI reasoning. Feedback loops built so the AI improves from operational data.
The blocker: Two equal and opposite failure modes. Attempting to reinvent everything simultaneously produces chaos without transformation. Reinventing processes without updating the management systems that govern them produces faster processes with the same output — and a leadership team wondering why the P&L hasn't moved.
Stage 4: Advantage
A competitor can copy what you build. They cannot copy how you became the organisation that built it.
Stage 4 is where AI stops being a cost tool and becomes a revenue tool. It arrives when two things converge: accumulated AI-generated insight that competitors cannot replicate, and AI-native customer experiences that create genuine switching costs.
New revenue in Stage 4 takes three forms. Product and service innovation — AI-generated insight identifying customer needs previously invisible. Outcome-based business models — operational AI data enabling a move from selling a product to selling a guaranteed result, the model that allowed Rolls-Royce to pioneer "Power by the Hour" pricing and that is now available at scale to any organisation with sufficient operational data. And data monetisation — proprietary data enriched through Stages 1 to 3 becoming a licensable asset or the foundation of an AI-native platform.
The moat that compounds
External AI products can be observed and replicated. A competitor can study your AI-powered customer experience and deploy something equivalent within months. What they cannot replicate is the internal operating model that produced it: the reinvented workloads, the proprietary data pipelines, the decision logic refined through years of operational use, and the workforce that built it and keeps improving it.
This is the Toyota Production System dynamic applied to AI. When Toyota developed its production system, competitors visited the factories and read the published descriptions. Almost none successfully replicated it — because what they were copying was the visible output of an organisational capability built over years. 8 The techniques were observable. The capability was not.
Enterprise AI operates the same way, with one additional force: the advantage compounds. Every reinvented workload generates operational data that makes the AI more calibrated to your specific business context. Every employee using the technology at full capacity contributes to its refinement. The moat widens as you operate. This is why Stage 3 is where the competitive moat is forged — Stage 4 is where it becomes visible in the market.
A note on Stage 4 scope
The operating detail of Stage 4 — the revenue model design, the organisational structure required, the investor narrative, and the build-versus-platform decisions — starts with one design principle: the organisations that reach Stage 4 fastest are the ones that defined this ambition from Stage 1 and used it to make choices in every stage that followed. AI as a cost programme optimises you into mediocrity. AI as a revenue ambition, held from the beginning, transforms the business.
People: Organisation thinks AI-natively. Leadership measures AI contribution to revenue as a named metric. Employees see AI as a career amplifier — because the organisation made and kept that commitment from Stage 1.
Process: AI output feeds product development, pricing, customer strategy, and capital allocation.
Technology: Models fine-tuned on proprietary data. Continuous learning loops operational. AI platform potentially customer-facing.
The blocker: Treating Stage 4 as a destination to arrive at rather than an ambition to design for. Every sequencing decision in the earlier stages either opens or closes the path here.
The Five Threads That Run Through Everything
Management system alignment is the primary conversion mechanism at every stage. Targets, performance reviews, compensation, and accountability structures must be updated to match the new operating model — stage by stage. Trust enables people to use AI. Management systems determine what the organisation does with it. If the management system does not change, the AI programme will not show up in the numbers — regardless of adoption rates, regardless of platform quality, regardless of employee willingness.
Employee trust and cultural readiness is the most underestimated precondition. Built in Stage 1, it determines whether employees use AI at full capacity in Stages 3 and 4 or perform compliance while protecting themselves from the consequences. An organisation where employees do not trust the technology or the organisation's intentions will plateau at Stage 2 productivity and never reach Stage 3 output.
Measurement and value realisation must be defined before each stage begins. Productivity signal at Stage 2, tracked as hours redeployed not just hours saved. Output, cycle time, and cost per transaction at Stage 3. Revenue at Stage 4. Without defined metrics and thresholds, AI programmes drift from outcomes into activity.
Skills and AI literacy address the most consistent bottleneck: not technology, but human capability. Only 39% of people who use AI at work have received any AI training from their employer. 9 That gap compounds at every stage.
Accountability and ownership answers who is responsible for AI delivering results with real authority and real budget. CEO sets the ambition and holds accountability. COO drives operational transformation. CIO/CTO enables the platform and data. If these three roles are not aligned and active, each stage produces compliance rather than transformation.
So, What Level Are You At?
These are not diagnostic questions. They are positioning statements. If you cannot answer them clearly, you are not through the stage.
Stage 1: If you cannot name the percentage of your workforce on a governed AI platform and demonstrate a measurable reduction in shadow AI exposure, you are not through Stage 1.
Stage 2: If you cannot show where the productivity gains from Stage 2 went — in output terms, not hours-saved terms — you are not through Stage 2.
Stage 3: If you cannot name three workloads that have been redesigned from scratch for AI, with updated performance targets and a measurable cost or cycle time improvement, you are not in Stage 3. You are running pilots.
Stage 4: If no one in the leadership team can articulate the first AI-enabled revenue stream and name who is accountable for building it, Stage 4 is aspiration, not strategy.
The companies that will define their industries over the next decade are not the ones with the most sophisticated AI. They are the ones that changed their management systems to capture the value AI makes available — and that started building the trust required to do so from the very first conversation, not after the resistance appeared.
The question is not whether you are doing AI.
It is what, specifically, is blocking you from the next level — and whether the person accountable for closing that gap is in the room.
Footnotes
- The Technology Readiness Level framework was originally developed by NASA in the 1970s and has since been adopted by the European Union's Horizon Europe funding programme as a standard method for assessing innovation maturity. EU Funding Playbook , eufundingplaybook.fi/large/
- Microsoft and LinkedIn, 2024 Work Trend Index Annual Report: AI at Work Is Here. Now Comes the Hard Part , May 2024. Survey of 31,000 knowledge workers across 31 countries. microsoft.com/worklab
- Ibid. 78% of AI users are bringing their own AI tools to work (BYOAI), across all generations and company sizes.
- Ibid. AI users report: saving time (90%), ability to focus on most important work (85%), greater creativity (84%), enjoying work more (83%).
- Ibid. 53% of people who use AI at work worry that using it on important tasks makes them look replaceable.
- Microsoft WorkLab Copilot Study: a six-month randomised control trial of 3,000 individuals across 60 customers. Copilot users read 11% fewer individual emails; edited 10% more documents in Word, Excel, and PowerPoint; the heaviest users (top 5%) summarised 8 hours of meetings per month. Work Trend Index 2024.
- Microsoft and LinkedIn, 2024 Work Trend Index : 59% of leaders worry about quantifying the productivity gains of AI; 60% say their organisation's leadership lacks a plan to go from individual impact to driving the bottom line.
- The Toyota Production System analogy draws on the documented history of competitive attempts to replicate Toyota's operational model following the publication of The Machine That Changed the World (Womack, Jones & Roos, 1990), in which widespread lean manufacturing adoption failed to replicate Toyota's sustained competitive advantage due to the cultural and organisational depth of the system.
- Microsoft and LinkedIn, 2024 Work Trend Index : only 39% of people globally who use AI at work have received any AI training from their company; only 25% of companies are planning to offer generative AI training.
This framework synthesises published AI adoption models from Microsoft's Cloud Adoption Framework, McKinsey's AI transformation research, Gartner's AI maturity model, IBM's enterprise AI guidance, and Google Cloud's AI adoption framework, alongside primary research from Microsoft WorkLab's Work Trend Index. It is informed by the consistent gaps those frameworks leave unaddressed: management system change as the primary conversion mechanism between AI adoption and business outcomes, workload reinvention as distinct from deployment, employee trust as a strategic precondition, and business advantage as the product of internal capability rather than external imitation.


