Beyond the Glue Layer: The Rise of Governed Semantic Operational Intelligence

The Transition No One Can Afford to Get Wrong

Enterprise AI is undergoing its most consequential architectural transition since the move from on-premises data warehouses to cloud-native analytics. The trajectory is now unmistakable: organizations moved from standalone copilots to autonomous agents, and are now racing toward enterprise-wide orchestration fabrics — unified platforms designed to coordinate AI capabilities across systems, teams, and workflows at scale.

McKinsey’s QuantumBlack practice recently published an important piece — Creating a Future-Proof Enterprise Agentic Platform Architecture by Anne-Gabrielle Starkloff, Dave Kerr, Dante Gabrielli, Stephan Schneider, and Wayne Vest. The article validates what many of us in the enterprise AI space have been working toward: that the era of isolated AI pilots is ending, and a new infrastructure category — one built around composable, protocol-driven, multi-runtime orchestration — is emerging.

This is a significant signal. When McKinsey publishes an architectural reference for enterprise AI, it shapes how hundreds of CIOs and CTOs think about their technology roadmaps. The direction they describe is real. The category is real.

But orchestration is only the beginning.

Enterprises do not merely need smarter agents. They need governed operational intelligence systems capable of surviving contact with reality — systems that understand operational meaning, enforce policy boundaries, maintain evidence lineage, and produce decisions that are explainable, replayable, and accountable.

The distinction matters enormously, because getting the architecture right at this stage will determine which enterprises build durable AI capability and which spend the next three years assembling infrastructure that still cannot answer the fundamental question: on what basis was this decision made, and who authorized the action?

Legacy chaos of disconnected copilots and fragmented systems contrasted with a governed semantic intelligence fabric integrating governance, evidence, workflows, and outcomes

McKinsey’s Architecture Is an Important Signal

The QuantumBlack team identifies a paradox that will be familiar to any enterprise AI leader: “The technology is being widely adopted, but still shows little measurable bottom-line impact.” Their diagnosis is sharp — horizontal AI solutions like copilots and chatbots scale easily but lack business impact, while vertical applications embedded in workflows deliver genuine transformation. The cure, they argue, is a unified platform architecture built around composable and compostable design, protocol-first interoperability, GraphRAG, evaluation-driven development, memory management, agentic orchestration, and multi-runtime execution.

The architecture they describe is layered and thoughtful. At its foundation sit agentic runtimes — selectable execution environments like MS AI Foundry, Google Vertex, and AWS Bedrock. Above that, an interfaces and orchestration layer provides connectivity and coordination using open protocols like A2A and MCP. Agentic shared services — evaluation, marketplaces, memory, and feedback mechanisms — provide cross-cutting capabilities. And an integrated glue layer “connects in-house systems, data and workflows with multiple external agent ecosystems and runtimes.”

McKinsey's Enterprise Agentic Platform Architecture diagram showing the full stack — agentic systems, runtimes, interfaces, shared services, and cloud infrastructure — alongside their Build vs Partner vs Buy decision framework. Source: McKinsey & Company

Source: McKinsey & Company / QuantumBlack

The article articulates four design principles that any enterprise architect should internalize: be relentlessly protocol-focused for interoperability; selectively build for differentiation where “functionality is specific to a given industry, region, or locally competitive landscape”; design for production from the start; and continuously integrate emerging market capabilities.

Simplified enterprise agentic platform architecture stack showing runtimes, protocols, GraphRAG, workflows, and orchestration layers

Several themes in the McKinsey framing deserve particular attention:

  • Composable and compostable — “Composable means using modular building blocks that can be combined into complex systems; compostable means that components can be replaced without redesign.” This is the right foundational principle for an architecture that must evolve.
  • Production from day one — “Most initiatives get stuck at the PoC stage, or require major rework because their production requirements are treated as an after-thought.” This observation alone justifies the entire paper.
  • Protocol-first — the principle that interoperability should be structural, not accidental, with MCP and A2A as emergent standards
  • GraphRAG — the “convergence of autonomous agents and graph databases” for contextual precision and improved auditability over standard retrieval

This is a credible and important direction for enterprise AI. The QuantumBlack team has produced the clearest public articulation of what the enterprise agentic platform layer should look like.

But the architecture they describe is still fundamentally infrastructure-centric. It answers the question of how agents communicate, coordinate, and execute. It does not fully address the deeper question that regulated enterprises must answer: what governs the meaning, the decisions, and the accountability of what those agents do?


The Enterprise Problem Is Not Really AI

Here is the uncomfortable truth that most enterprise AI architecture discussions avoid: the fundamental problem enterprises face is not an AI problem. It is an operational coherence problem.

Consider what a typical enterprise looks like from the inside:

  • Fragmented operational truth — the same customer, the same product, the same process described differently across a dozen systems
  • Fragmented decisions — choices made in silos, without visibility into what other parts of the organization are deciding or why
  • Fragmented workflows — work that crosses system boundaries through manual handoffs, email chains, and tribal knowledge
  • Fragmented accountability — no clear lineage from insight to decision to action to outcome
  • Fragmented institutional memory — critical knowledge trapped in the heads of long-tenured employees, buried in SharePoint folders, or scattered across email threads
Enterprise fragmentation map showing CRM, ERP, MES, eQMS, SharePoint, Teams, Jira, Email, ServiceNow, and tribal knowledge connected by chaotic integration lines

ERP, CRM, MES, eQMS, SharePoint, Teams, Jira, ServiceNow, email, dashboards, SOPs — these systems were never designed to share operational meaning. They were designed to serve individual functions. The result is that enterprises have spent decades building sophisticated systems of record that collectively lack a shared understanding of what the enterprise actually knows, what it has decided, and what it is doing about it.

LLMs did not create this fragmentation. They simply exposed it. When organizations began deploying AI that needed to reason across systems, the absence of unified operational meaning became immediately visible. Retrieval-augmented generation hallucinates not because the models are flawed, but because the knowledge they retrieve is contradictory, incomplete, and semantically disconnected.

Orchestration alone does not solve operational meaning.

You can build the most elegant agent coordination framework in the world, and it will still produce unreliable results if the agents are reasoning over fragmented, semantically incoherent operational data. The problem is not the plumbing. The problem is what flows through it.


The Missing Layer: Semantic Operational Truth

This is the critical gap in most enterprise agentic architectures today — including the one McKinsey describes. The architectures address protocols, orchestration, retrieval, and workflows. They do not fully address what we call the semantic operational truth layer: the structural foundation that gives enterprise AI a shared, governed understanding of what things mean, how they relate, and what actions are permissible given the current operational context.

McKinsey’s architecture talks about GraphRAG, which is a meaningful step beyond simple vector retrieval. But enterprises ultimately need more than structured knowledge retrieval. They need:

  • Operational semantics — a formal, machine-interpretable model of enterprise entities, relationships, and processes
  • Evidence interpretation — the ability to distinguish between facts, derivations, hypotheses, and risks
  • Action governance — structural controls on what actions are permissible, under what conditions, and with what authorization
  • Decision lineage — traceable links from evidence through reasoning to decisions to actions to outcomes
  • Interpretive boundaries — explicit limits on how far AI can autonomously reason before requiring human judgment
  • Durable operational state — persistent, versioned representations of work-in-progress that survive system restarts, agent failures, and personnel changes

This is what a Semantic Control Plane provides. It is the layer that transforms an enterprise agentic platform from a coordination framework into a governed operational intelligence system.

Layered semantic control plane showing Goals, Workflows, Jobs, Tasks, Capabilities, Skills, Policies, Actions, and Outcomes as interconnected governance layers

The Semantic Control Plane encompasses a structured type system — operational goal definitions, workflow specifications, job and task types, capability and skill descriptors, tool registries, action proposals, and outcome records. These are not abstract ontology exercises. They are the runtime vocabulary through which enterprise AI understands its operational world, proposes actions, and records what happened and why.

Without this layer, enterprise AI is flying blind — coordinating efficiently but reasoning unreliably.


From RAG to Operational Intelligence

The evolution from retrieval-augmented generation to operational intelligence systems represents a fundamental shift in what enterprise AI is designed to do.

Evolution timeline from RAG to GraphRAG to Agent Orchestration to Operational Intelligence Systems, showing increasing sophistication at each stage

RAG retrieves text chunks and hopes the language model can synthesize them correctly. GraphRAG structures knowledge into graphs and retrieves connected subgraphs, improving coherence. Agent orchestration coordinates multiple AI capabilities to execute multi-step workflows. Each step is a genuine improvement.

But operational intelligence requires something qualitatively different: evidence-grounded synthesis where every claim is traceable to its source, every derivation is explicit, and every recommendation carries a structured assessment of its epistemic status.

This is where the Interpretive Boundary Layer becomes essential. In a governed operational intelligence system, every piece of information that flows through the system is classified along three dimensions:

Epistemic status — Is this a verified fact, a derived conclusion, a hypothesis, or an identified risk? The system must know the difference, because the appropriate action is fundamentally different for each.

Causality — Is the relationship causal, correlational, or unknown? An agent that treats a correlation as a cause will recommend actions that may be ineffective or harmful.

Actionability — Is this safe to act on autonomously, does it require human review, is it restricted to human-only decision-making, or is action explicitly blocked? This classification determines the governance boundary for every AI-initiated action.

Three-dimensional interpretive boundary classification model showing epistemic status, causality, and actionability axes with objects positioned in the classification space

This three-dimensional classification is not a reporting feature. It is a runtime governance mechanism. Every recommendation, every proposed action, every synthesized insight passes through this interpretive boundary before it can influence enterprise operations. The system doesn’t merely produce outputs — it produces outputs that carry their own trust receipts, provenance chains, and actionability classifications.

“Every recommendation, action, and outcome must remain explainable, replayable, and attributable.”

The difference between an agentic platform and an operational intelligence system is precisely this: the operational intelligence system knows what it knows, knows what it doesn’t know, and knows what it is and isn’t authorized to do about either.


Why Workflows Are Not Enough

Traditional agentic systems execute workflows. They call tools, pass data between steps, and orchestrate runtimes. This is valuable. But it treats the enterprise’s operational work as something that exists primarily inside the systems being orchestrated — the Jiras, the ServiceNows, the Salesforces.

A governed operational intelligence system takes a fundamentally different approach. It does not merely orchestrate work across external systems. It governs operational work-state itself through what we call the Agentic Work Substrate.

Agentic Work Substrate architecture showing native operational substrate with durable work items replacing fragmented operational systems, with external systems as projections

The Agentic Work Substrate provides:

  • Durable work items — persistent, semantically typed representations of work that survive system boundaries, agent restarts, and organizational changes
  • Work-state lifecycle management — structured transitions from creation through execution to completion, with governed escalation paths at every stage
  • Dependency graphs — explicit modeling of which work depends on which other work, enabling intelligent scheduling, parallelization, and impact analysis
  • Escalation and handoff protocols — governed transitions between AI-autonomous execution, human review, and human-only decision-making
  • Evidence-linked execution — every work item is connected to the evidence that created it, the decisions that shaped it, and the outcomes it produced

This shifts the center of gravity in enterprise operations. External enterprise systems — Jira, Salesforce, ServiceNow, workflow engines — become optional projections of the operational substrate, not the substrate itself. The semantic operational layer becomes the authoritative source of work state. External systems become execution surfaces and display interfaces.

This is a profound strategic shift. It means that adding or replacing an enterprise system does not require rewiring the operational intelligence layer. The operational truth persists independently of the systems through which it is expressed.


Decisions, Actions, Outcomes

Most enterprise AI architectures stop at recommendations. The system retrieves information, synthesizes it, and presents a suggested course of action. What happens next — the decision, the action, the outcome, and the learning — lives outside the AI architecture entirely, scattered across email threads, approval workflows, and operational systems.

A governed operational intelligence system explicitly models the full decision lifecycle:

Decision Context Graph showing the complete lifecycle from Decision through Action through Outcome through Learning through Precedent, forming a continuous improvement loop

Recommendations are evidence-grounded proposals with explicit confidence levels, supporting evidence, and identified risks. Decisions record who (or what) decided, on what basis, under what authority, and with what alternatives considered. Actions are governed executions with pre-conditions, authorization checks, and rollback capabilities. Outcomes capture what actually happened, measured against what was expected. Attribution links every outcome back through the decision chain to the evidence that informed it. Precedent makes historical decisions searchable and reusable, enabling the system to learn not just from data but from institutional judgment. Replay allows any decision to be re-examined with full context — not just what was decided, but why, and what was known at the time.

This is the Decision Context Graph — a persistent, queryable representation of enterprise decision-making that enables true organizational learning. When a similar situation arises in the future, the system does not start from scratch. It can identify relevant precedents, understand what worked and what didn’t, and propose actions informed by institutional experience.

“The real enterprise problem is fragmented operational truth.”

Without this structure, enterprise AI produces recommendations that disappear into the void. With it, every decision becomes organizational knowledge.


Governance Is Not a Feature

Perhaps the most consequential architectural decision in enterprise AI is where governance lives. In most agentic platforms, governance is a layer added on top — an observability dashboard, an audit log, a policy check before deployment.

This is insufficient for regulated enterprises. Governance cannot be bolted on afterward. It must be structural — woven into the runtime fabric of every operation the system performs.

Layered governance architecture showing delegation contracts, capability tokens, authorization fabric, risk ladder, policy engine, and traceability as interconnected structural layers

A structurally governed operational intelligence system implements governance through:

  • Delegation Contracts — formal specifications of what authority has been delegated to AI agents, by whom, under what conditions, and with what constraints. Delegation is not implicit. It is explicit, versioned, and auditable.
  • Capability Tokens — runtime-enforced permissions that govern which capabilities an agent can exercise in a given context. An agent authorized to recommend is not automatically authorized to act.
  • Action Risk Ladder — a graduated classification of actions by their organizational risk, determining the appropriate level of human oversight. Low-risk, high-confidence actions may proceed autonomously. High-risk actions require explicit human authorization, regardless of the system’s confidence.
  • Structured Proposal Invariant — every action proposed by an AI agent must conform to a structured proposal format that includes the evidence basis, confidence assessment, identified risks, alternative actions considered, and the specific delegation authority under which the action is proposed.
  • Authorization Fabric — a runtime layer that enforces authorization policies across all agent actions, ensuring that no agent can exceed its delegated authority regardless of how it was invoked.
  • Decision Trace — a complete, signed, tamper-evident record of every decision and action, enabling after-the-fact review, regulatory audit, and organizational learning.
  • Compensation Policies — predefined procedures for unwinding or correcting actions when outcomes diverge from expectations or when new information invalidates the basis for a prior decision.

“Governance is not a feature. It is the runtime constitution of enterprise AI.”

This is not overhead. This is the architecture. In the same way that a database’s transaction guarantees are not a feature but the foundation that makes the database trustworthy, governance in operational intelligence is the foundation that makes the system deployable in environments where decisions have real consequences.


The Strategic Shift

Everything described above converges on a strategic insight that will define the next decade of enterprise architecture:

The old model: Enterprise systems are primary. AI orchestrates around them — pulling data from CRMs, triggering workflows in ERPs, writing tickets in ServiceNow. The intelligence is ephemeral. The systems of record remain the source of truth.

The new model: The semantic operational substrate becomes primary. Enterprise systems become execution surfaces — projections of operational intelligence into specialized tools. The intelligence is durable. The operational meaning, decision history, and governance fabric persist independently of any individual system.

Governed coordination fabric showing humans, AI agents, and enterprise systems collaborating through a shared semantic operational layer with governance controls

This is not a minor architectural refinement. It is a fundamental inversion of where operational truth lives in the enterprise. And it has profound implications for technology strategy, vendor selection, integration architecture, and organizational design.

In the old model, changing a CRM means migrating decades of institutional knowledge. In the new model, the CRM is a projection — the institutional knowledge lives in the semantic substrate, and the CRM is one of many surfaces through which it is accessed and acted upon.

In the old model, AI is constrained by the data models and APIs of the systems it orchestrates around. In the new model, AI reasons over a unified semantic model of the enterprise and projects actions into whatever systems are appropriate for execution.

“External systems become execution surfaces. The semantic substrate becomes primary.”

This is the conceptual shift that separates enterprise AI architectures that will scale from those that will calcify. Orchestration frameworks that remain infrastructure-centric — however elegantly designed — will eventually reach a ceiling defined by the semantic incoherence of the systems they orchestrate. Operational intelligence systems that establish a semantic foundation first will compound their capabilities over time, because every new integration, every new agent, every new workflow builds on a shared understanding of what the enterprise knows and how it operates.


Why This Matters for Regulated Industries

The distinction between agentic orchestration and governed operational intelligence is most consequential — and most urgent — in regulated industries.

Healthcare, MedTech, pharmaceuticals, financial services, aerospace, energy, and manufacturing all operate under regulatory frameworks that impose specific requirements on decision-making:

  • Hallucinations are unacceptable. In regulated contexts, an AI-generated statement that cannot be traced to authoritative evidence is not merely wrong — it is a compliance violation. The system must distinguish between what it knows, what it has derived, and what it has inferred.
  • Provenance is required. Every recommendation must be traceable to its evidentiary basis. Regulators do not accept “the model said so” as an explanation.
  • Auditability is required. Every decision must be reconstructable after the fact, with full context — not just the outcome, but the evidence, the reasoning, the alternatives considered, and the authority under which the decision was made.
  • Explainability is required. Stakeholders — including regulators, clinicians, operators, and executives — must be able to understand why a particular action was recommended or taken, in terms that are meaningful to their role.
  • Policy boundaries are required. AI systems must operate within explicit, enforceable constraints that reflect regulatory, organizational, and ethical requirements.
  • Evidence lineage is required. The chain from source data through interpretation to recommendation to action to outcome must be intact and queryable.
  • Escalation is required. The system must know when it has reached the boundary of its authorized autonomy and must reliably escalate to human judgment.
Replay and explainability architecture showing how any decision can be rewound and examined with full evidence context, reasoning chain, and authorization history

These are not nice-to-have features. They are table stakes for deployment in regulated environments. And they cannot be achieved through orchestration alone — they require the structural governance, semantic precision, and operational state management that only a governed operational intelligence architecture provides.

This is where the SAOA architecture — Semantic Agentic Operational Architecture — becomes strategically differentiated. It was designed from the ground up for environments where every recommendation must be evidence-grounded, every action must be policy-bounded, every decision must be explainable, and every outcome must be attributable.


The Architecture the Enterprise Actually Needs

The enterprise AI landscape is converging. The industry is moving past copilots, past isolated agents, past simple workflow automation. The question is no longer whether enterprises need a unified AI architecture. The question is what that architecture must actually contain.

Future enterprise operating model showing a semantic operational intelligence system as the foundational layer connecting all enterprise functions, decisions, and actions

The QuantumBlack team’s contribution is valuable because it names the category and establishes the directional vector. Composable and compostable architecture, protocol-first interoperability, GraphRAG, evaluation-driven development, multi-runtime execution — these are real and necessary components.

But the fully realized architecture requires more. It requires:

  • A semantic operational truth layer that gives AI a shared, governed understanding of enterprise meaning
  • An interpretive boundary layer that classifies every claim by its epistemic status, causality, and actionability
  • An agentic work substrate that manages durable operational state independently of external systems
  • A decision context graph that links evidence, decisions, actions, outcomes, and precedent into a persistent institutional memory
  • A governance fabric that enforces delegation, authorization, risk classification, and accountability as structural properties of the runtime — not afterthoughts

These layers together constitute what we call governed semantic operational intelligence — the architecture that emerges when you take the enterprise agentic platform concept seriously and follow it to its logical conclusion.

The next generation of enterprise AI will not be defined by who builds the smartest standalone agent. It will be defined by who builds the most trusted operational intelligence substrate — connecting systems, workflows, evidence, decisions, actions, and outcomes together in production.

The organizations that get this right will not merely automate processes. They will build systems of institutional reasoning — operational intelligence that learns, governs, explains, and compounds institutional knowledge over time.

“The future of enterprise AI is not merely agentic. It is semantic, governed, operational, and accountable.”

Ready to see governed AI in action?

Learn how Empower AI helps regulated enterprises move from pilots to production-grade systems of action.