[release] 5 min · Jun 4, 2026

Microsoft MXC — Agent Sandbox Arrives, Read the Caveats

Microsoft shipped MXC at Build 2026 — an OS-level sandbox for AI agents on Windows. OpenClaw, Hermes, and OpenAI are on board. The early-preview caveat matters.

Microsoft Execution Containers (MXC) ↗ Jun 2, 2026

#microsoft#agent-security#mxc#windows#sandbox#governance#openclaw#hermes

Microsoft shipped the Execution Containers (MXC) SDK in early preview at Build 2026 on June 2 — a policy-driven, OS-level execution layer that lets developers declare exactly what an AI agent can access on Windows, with the kernel enforcing those boundaries at runtime. OpenClaw, Hermes Agent, NVIDIA, Manus, and OpenAI are already listed as launch partners. The problem MXC is solving — agents running with the full authority of the logged-in user — has been the single biggest barrier stopping enterprises from deploying autonomous coding agents on real machines.

TL;DR

What: Microsoft shipped MXC, an OS-level agent sandbox embedded in Windows and WSL — policy-driven, kernel-enforced containment
Who’s in: OpenClaw, Hermes Agent, NVIDIA, Manus, OpenAI, and GitHub Copilot CLI already adopted process isolation
Caveat: Microsoft’s own repo warns that no MXC profiles should currently be treated as security boundaries
Action: Study the policy model now, but do not market MXC-wrapped agents as “secured” until micro-VMs and Agent 365 integration ship

What Happened

I’ve been waiting for this layer to exist for 18 months. Every agent sandbox we’ve covered — IronClaw’s Rust isolator, NanoClaw’s containers, Anthropic’s gVisor-based sessions for Claude Managed Agents — were developer-built workarounds for a problem the OS should own. MXC is Microsoft planting a flag: agent containment belongs at the kernel level, not the framework level.

MXC is not a cloud service. It is an SDK and a policy model — a foundational primitive embedded in Windows and the Windows Subsystem for Linux. Developers write a JSON policy declaring which files, network domains, and processes an agent can touch. MXC enforces those constraints at runtime via the Windows kernel. Three isolation levels ship or are roadmapped: process isolation (fast, lightweight — already adopted by GitHub Copilot CLI), session isolation (separates the agent from the user’s desktop, clipboard, UI, and input devices), and micro-VMs (hardware-backed hypervisor isolation for untrusted code — still on the roadmap alongside Linux containers and Windows 365 for Agents integration).

The partner list tells you who Microsoft considers the agents most likely to run persistently on enterprise Windows machines. OpenClaw now runs its node and gateway on Windows via MXC. NVIDIA ships OpenShell on Windows built on MXC. Hermes Agent (Nous Research) is integrating MXC in its new Windows application. OpenAI is exploring “new patterns for AI agents to safely and efficiently generate and execute code” using MXC. These are not speculative integrations — these are launch commitments from companies already shipping agent infrastructure.

Microsoft’s official MXC GitHub repo states explicitly: no MXC profiles should currently be treated as security boundaries. Developers who ship MXC-wrapped agents as “secure” before micro-VMs and full enforcement land are misrepresenting the guarantee.

Why This Matters

Until MXC, agent sandboxing on Windows was either nonexistent (agents ran with the user’s full permissions) or bolted on through container runtimes that added latency and complexity. Every framework had to solve isolation independently — OpenClaw built its own, IronClaw wrote a Rust isolator, Anthropic runs gVisor per session. None of those solutions could enforce boundaries at the kernel level because they did not own the kernel.

MXC changes the abstraction layer. Instead of each framework implementing its own sandbox, the OS provides a declarative policy surface. Write a JSON file describing what the agent can touch; Windows enforces it. This is the same architectural shift that happened when operating systems took over memory protection from individual applications — the right layer owns the problem.

But “early preview” here is not marketing humility. The current shipping state includes process isolation and session isolation, both useful for reducing blast radius. Micro-VMs — the level you actually need for running untrusted code from the internet — remain on the roadmap. The gap between “process isolation” and “hypervisor-backed containment” is not incremental; it is categorical. Process isolation prevents accidental file access. Micro-VMs prevent a determined adversary from escaping the sandbox entirely. One is convenience. The other is security.

The enterprise governance story has the same gap. Full integration with Agent 365 — meaning Defender, Entra, Intune, and Purview policy controls applied to local agents — arrives in preview in July 2026. Until that ships, MXC is developer-configured. Individual developers write the JSON policies. Enterprise security teams have no centralized way to enforce, audit, or revoke those policies across a fleet of machines. For organizations with compliance requirements, this is the difference between “we have a sandbox” and “we can prove the sandbox was enforced.”

The MXC JSON policy model is worth learning now even if you are not shipping on Windows. The pattern — declarative capability constraints enforced by the runtime — will become standard across platforms. Understanding the policy surface gives you a head start regardless of which OS-level sandbox wins on Linux or macOS.

There is also a lock-in geometry worth naming. MXC is Windows/WSL-only. Cross-platform agent toolchains — Claude Code on macOS, Linux-native CI runners, anything in a Docker-first workflow — need separate isolation stories. Microsoft is betting that enough enterprise agent workloads run on Windows to make MXC the default expectation. If they are right, the path is clear: MXC becomes the expected sandbox, Agent 365 becomes the governance layer, and Entra plus Intune become the control plane. Your IT department’s existing Microsoft tools manage agents the same way they manage laptops. That is a powerful lock-in loop, and it is intentional.

Microsoft also open-sourced the Agent Control Specification (ACS) at Build — an open, vendor-neutral standard that defines how runtime governance is applied across the agent lifecycle, independent of framework, runtime, or policy engine. Combined with ASSERT (Adaptive Spec-driven Scoring for Evaluation and Regression Testing), which converts natural language policies into adversarial test suites, MXC + ACS is Microsoft’s attempt at a full agent security lifecycle: policy-driven containment at the OS level, portable governance across frameworks, and automated testing to verify policies hold under adversarial conditions. The fact that ACS ships with SDK plugins for LangChain, OpenAI Agents SDK, Anthropic Agents SDK, AutoGen, CrewAI, Semantic Kernel, Microsoft.Extensions.AI, and MCP tools suggests Microsoft understands the standard has to be framework-neutral to succeed — even if the enforcement layer favors their OS.

The Take

MXC is the most important infrastructure announcement at Build 2026, and it is not ready yet. Both things are true simultaneously. The architecture is correct — agent containment should be an OS primitive, not a framework afterthought. The partner list is serious. The policy model is clean. And the disclaimer on the GitHub repo is honest: this is not a security boundary today.

What matters is the trajectory. Microsoft is building the stack that lets enterprise IT manage agents through the same tools they already use for devices and identities. If you are building agents that will run on enterprise Windows machines — and statistically, many of you are — the MXC policy model is what your customers’ security teams will expect within 12 months. Learn the JSON policy surface now. Build against process and session isolation for development and testing. But do not promise your users kernel-grade containment until micro-VMs ship and Agent 365 integration is generally available.

The uncomfortable truth is that the best agent sandbox in 2026 still comes with an asterisk. The right response is not to ignore it — it is to adopt the policy model early while keeping your own containment layers in place until Microsoft delivers the enforcement they have promised.

By dennis · Jun 4, 2026 ← all signals