[release] 6 min · Jun 11, 2026

New Relic AI Coding Observability — Your IDE Is Now a Cost Center

New Relic moves APM monitoring into the editor with open-source AI Coding Observability for Claude Code, Cursor, Copilot, Windsurf, and Amazon Q. Free — for now.

#observability#ai-coding#devtools#cost-management#opentelemetry

Eighty-one percent of enterprise leaders report increased production issues tied to AI-generated code, according to CloudBees’ 2026 State of Code Abundance Report. Yet not a single one of the tools generating that code — Claude Code, Cursor, Copilot, Windsurf, Amazon Q — ships with built-in cost attribution, quality correlation, or even basic spend tracking. On June 8, New Relic announced AI Coding Observability, an open-source feature shipping June 23 that extends production-grade APM monitoring directly into the IDE, covering all five assistants from a single vendor-neutral dashboard built on OpenTelemetry and Model Context Protocol. The observability perimeter just moved left, from staging into the editor, and the timing is not accidental.

TL;DR

  • What: New Relic ships open-source AI Coding Observability on June 23 — APM-grade monitoring for five major coding assistants from one dashboard
  • Why it matters: AI coding tools are unmonitored cost centers with zero correlation to production outcomes — 81% of enterprise leaders report increased production issues from AI-generated code
  • The catch: “No additional cost” means telemetry is subject to New Relic’s consumption-based ingest pricing — model your data volume before deploying org-wide
  • Action: Evaluate for cost visibility, but read the data retention terms before instrumenting your org

What Happened

Every team I talk to has the same confession: they know their Cursor and Claude Code seats are expensive, but they have no idea which engineers are burning tokens on what, or whether those tokens produce code that survives review. AI coding assistants operate entirely outside traditional APM stacks. No traces. No spans. No cost attribution. Teams running Claude Code, Cursor, and Copilot in parallel have three unmonitored cost centers with zero correlation to production outcomes.

New Relic’s answer is straightforward: instrument the IDE the same way you instrument production. The feature normalizes telemetry across all five supported assistants into a unified dashboard. You get token spend tracking, budget threshold alerts, and — critically — the ability to correlate AI-assisted code with what actually ships. The architecture grounds itself in OpenTelemetry and MCP rather than proprietary integrations per tool, which means the instrumentation layer is at least theoretically portable.

Two details matter for procurement conversations. First, the feature ships at no additional cost beyond standard New Relic ingest rates — but “standard ingest rates” is doing heavy lifting. New Relic uses consumption-based pricing, and continuous IDE telemetry across an engineering org generates high-cardinality data. Model the expected data volume against your current ingest bill before treating this as free. Second, a local-only / zero-outbound mode runs queries entirely within the user’s private network — prompts never leave. That second point is the difference between “interesting demo” and “viable for regulated industries” in banking, healthcare, and federal contexts.

Why This Matters

The observability gap here is structural, not accidental. Traditional APM evolved around a clear boundary: code gets written, then deployed, then monitored. AI coding assistants shattered that boundary. They generate code inside the editor, consume tokens against API keys that finance teams barely track, and produce output whose quality nobody measures until it breaks in production. CloudBees’ 2026 State of Code Abundance Report puts a number on the damage: 81% of enterprise leaders report an increase in production issues tied to AI-generated code. The same report documents “token anxiety” as finance teams struggle to forecast AI spend.

What New Relic is doing is not technically novel — OpenTelemetry instrumentation is well-understood infrastructure. The insight is applying it to a domain that has been completely unmonitored. Think of it this way: if your production database had zero observability, no cost tracking, no performance metrics, and no way to attribute resource consumption to specific teams, you would call that negligent. That is the current state of AI coding tool adoption at most organizations.

The vendor-neutral approach matters more than it appears. The AI coding tool mix is fragmenting fast — teams do not standardize on one assistant. They run Cursor for frontend work, Claude Code for backend reasoning, Copilot because it came with the GitHub Enterprise license. Each tool has its own billing model, its own token accounting, its own opacity. The practical consequence is that finance teams receive three separate invoices with no common unit of measurement and no way to compare cost-per-outcome across tools. New Relic’s bet is that whoever provides the normalized view across all of them becomes essential infrastructure. That is a classic platform play: own the observation layer, and you become harder to remove than any individual tool you observe.

The open-source framing is part of this strategy. By releasing the instrumentation layer as open source, New Relic lowers the adoption barrier to zero and accelerates data ingest. More telemetry flowing through New Relic means a stickier relationship and more opportunities to upsell the rest of the APM stack. This is not cynicism — it is the standard infrastructure playbook, and it works because the free tier genuinely solves a real problem.

“No additional cost beyond standard ingest rates” does not mean free. New Relic’s pricing is consumption-based, and AI coding telemetry is high-cardinality data. Continuous IDE instrumentation across fifty engineers could materially increase your monthly ingest bill. Model the data volume before deploying org-wide — request a cost estimate from New Relic based on your team size and expected telemetry throughput.

The local-only mode deserves specific attention. Regulated industries are the highest-value segment for this feature because they have the strictest compliance requirements and the least visibility into AI tool usage. A developer running Claude Code inside a bank’s network today is essentially operating an unaudited, unmonitored system that processes proprietary code. The zero-outbound mode makes AI Coding Observability the first tool that can provide governance without requiring prompts to traverse external infrastructure. That is a procurement unlock, not just a feature.

If your organization runs multiple AI coding assistants, start by deploying AI Coding Observability on a single team for two weeks. Compare actual token spend against what finance is being invoiced. The delta will tell you whether org-wide rollout is worth the instrumentation effort.

The Take

I think New Relic made exactly the right call at exactly the right moment. The observability perimeter has always started where code ended — in staging or production. Moving it into the editor acknowledges a reality that the rest of the monitoring industry is still ignoring: the IDE is now a cost center, and an unmonitored one at that.

The strategic angle is data acquisition, and you should understand this clearly before deploying. Free observability for your IDE means New Relic gets telemetry on your entire development workflow — not just your production stack. That is a meaningful expansion of their data moat. They will know which tools your engineers use, how many tokens they burn, what patterns of code generation correlate with production incidents. That intelligence is extraordinarily valuable, and the “no additional cost” framing — subject to consumption-based ingest pricing that could grow substantially — is how they get you to hand it over voluntarily.

This does not mean you should avoid the tool. The problem it solves is real, and nobody else is solving it at this scope. But go in with clear eyes. Read the data retention policies. Understand what telemetry leaves your network in the default mode versus local-only mode. Negotiate the ingest rate implications before instrumenting fifty engineers.

The risk is the one that accompanies every free infrastructure play: “no additional cost” has a shelf life. Once your organization depends on AI Coding Observability for budget governance and compliance reporting, removing it becomes expensive. New Relic knows this. You should too.

Evaluate it. Deploy it on one team. Measure the cost visibility it provides against the telemetry it consumes. But do not instrument your entire IDE fleet on day one without reading the fine print. That is how you trade one blind spot for another.