Back to blog
Agency Economics

Claude Code Costs $150–250/Developer/Month. Here's Why Your Agency Is Eating It.

Anthropic publishes the number openly: the average Claude Code user costs $13 per active developer per day, or $150–250 per developer per month across enterprise deployments.

For a dev agency with eight developers, that's up to $2,000 a month in AI coding costs — before you've looked at Cursor, Cline, or anything else.

Most agency owners know this number roughly. What they don't know is where it goes.

That's not a monitoring problem. It's a structural one. And it costs agencies real margin every month.


The Invoice Tells You Nothing Useful

Your Anthropic Console shows total spend. Your Cursor billing page shows total spend. Neither tells you:

One number, no attribution. That's the entire picture most agencies are working from.

This isn't Anthropic's fault — they bill by workspace, not by project or client. It's a tool built for a single organization tracking its own costs, not a billing layer for agencies serving multiple clients simultaneously. The gap between what Anthropic shows you and what you need to run a profitable agency is significant.


The Fixed-Price Contract Problem

For agencies billing time-and-materials, unattributed AI costs are annoying. You can roughly estimate them per project, adjust hourly rates, and absorb the rest.

For agencies on fixed-price contracts — which is most dev agencies doing product work — unattributed AI costs are a direct margin hit.

Here's the math. You quote a project at $28,000. Your internal estimate assumed $400 in AI tooling costs. The project runs long, the scope creeps, one developer spends three weeks in deep debugging spirals with Claude Code running constantly. The actual AI cost for that project: $1,200.

That $800 difference doesn't appear anywhere. It doesn't show up on the invoice as an overrun. It doesn't trigger a change order conversation. It comes straight out of your margin — silently, after the fact, when you reconcile the month.

Multiply that across six projects running simultaneously with eight developers, and the unattributed AI spend isn't a rounding error. It's a budget line that doesn't exist on your P&L but exists on your Anthropic invoice.

This is what TokenWatch is built to recover. By attributing every Claude Code, Cursor, and Cline session to the developer, project, and client that drove it — automatically, via git context — agencies can identify which costs are recoverable before the invoice goes out. Not as an estimate. As a verified line item.


Why Developers Can't Tell You Either

The instinct most agency owners have when they see an unexpected AI bill: ask the team.

The honest answer you'll get back: "I'm not sure. I was working on BuildFast most of the week but also spent some time helping Jordan with the NovaSpark API migration."

This isn't developers being evasive. It's the nature of how AI coding tools work. Sessions bleed across tasks. A developer opens Claude Code to debug a BuildFast issue, the context grows, they context-switch to NovaSpark without clearing, and the session cost is now split across two projects with no record of the split.

Git doesn't track AI usage. Calendar blocks don't track AI usage. Anthropic's console doesn't track AI usage per project. The data to answer "who spent what on which client" simply doesn't exist anywhere — unless you instrument for it.


The $13/Day Number Is an Average. Your Worst Projects Are Higher.

Anthropic's $13/day figure is an enterprise average. Ninety percent of users stay below $30 per active day. That ceiling matters.

The developers who hit $30/day aren't evenly distributed across your client roster. They're concentrated on the hardest projects — legacy codebases, poorly scoped features, complex integrations, anything that generates long debugging sessions with Claude Code running in agentic mode.

Those are also, frequently, the fixed-price projects. The projects where scope uncertainty was highest at quoting time and where you absorbed risk in exchange for a cleaner client relationship.

The developer who spent $28/day for two weeks on the legacy migration isn't an anomaly. They're the reason your margin on that contract was 2% instead of 14%.

Without per-project cost data, you can't see this until the project is done and the damage is permanent. With it, you can flag the spike mid-project, have a conversation with the client, and either add a change order or make a deliberate decision to absorb it — instead of discovering it in retrospect.


What "Recoverable Revenue" Actually Means

The framing most tools use for AI cost tracking is efficiency: find waste, reduce spend, optimize prompts.

That's the wrong frame for an agency.

You don't need to spend less on AI. Your developers are more productive because of it. The goal isn't to reduce AI usage — it's to stop subsidizing your clients' AI usage out of your own margin.

When a developer spends $840 in Claude Code sessions on a client project, that cost is as real and attributable as the hours they worked. The client benefited from it. On a time-and-materials contract, that's a line item. On a fixed-price contract, it's at minimum a data point you can use to requote the next phase more accurately.

The agencies that figure this out first will requote fixed-price work more accurately, recover costs they're currently absorbing, and have real data for client conversations about AI usage — instead of apologetic guesses.

The agencies that don't will keep eating the cost, wondering why margins on their most complex projects keep compressing.


The Practical Gap: What You Need vs What Exists

To recover AI costs from client work, you need three things:

Attribution — which developer session maps to which project and client. This has to be automatic. If it requires developers to tag anything manually, it won't be consistent enough to use for billing.

Aggregation — per-client cost totals across Claude Code, Cursor, and Cline, rolled up by billing period. The output needs to be something you can attach to an invoice or use in a client conversation.

Timing — you need to see this before you send the invoice, not after. Cost spikes need to be flagged in the current month, not discovered in reconciliation.

Anthropic's console gives you none of these. Cursor's billing page gives you none of these. A spreadsheet where developers manually log sessions gives you an inconsistent approximation of the first two and nothing on the third.

TokenWatch is built specifically for this gap. Attribution is automatic via git remote URL and git config identity — no developer workflow changes. The dashboard shows per-client cost totals across all three major AI coding tools. Cost spikes are flagged in real time, with the developer and project responsible identified.

The output is a per-client summary you can review before every invoice cycle — and attach directly to the invoice when the costs are recoverable.


What This Looks Like in Practice

A typical TokenWatch billing report for May shows eight projects, eleven developers, $17,340 in total AI spend. Of that, $14,820 is attributable to client projects and recoverable. One spike is flagged: a single developer on one project running at 3.1× their 30-day average — worth reviewing before the invoice goes out.

Without attribution tooling, that report is a single number: $17,340. No breakdown, no spike flag, no recovery.

The $14,820 in recoverable costs is either billed to clients or absorbed into your margin. The difference between those two outcomes is whether you had the data at invoice time.


The Number to Keep in Mind

$150–250 per developer per month. Eight developers. $2,000/month in AI coding costs, growing as the tools improve and your team uses them more.

How much of that is sitting on fixed-price client contracts right now?

If you don't know, you're eating it.


TokenWatch
Stop absorbing AI costs your clients should be paying.
Attribution via git context. Per-client reports. No spreadsheets.
Get Early Access →