FinOps for AI coding tools

The AI coding meter for every agent you run.

$2,340/mo · across every account, Mac & cloud

Ottto reconciles subscriptions, API usage, credits, quota windows, and cloud bills into one number you can defend. No proxy in the request path, no added latency, no mystery totals. Add another AI tool without adding another thing to watch.

Start free →Get the Mac app

Free to start · no credit card · first numbers in minutes

every agent & terminal you run - captured

Claude Code

Codex

Cursor

Gemini

Ghostty

iTerm

cmux

openclaw

Hermes + more

Why we built this

We don't want to spend less on AI. We want to scale up without flying blind.

We found ourselves using 3x the AI of last month, across Codex, Claude Code, Pi, Bedrock, Vertex, subscriptions and APIs, on multiple Macs. The bill stopped being background noise. So we keep checking the same things, several times a day:

Where are we losing efficiency to huge context, bad caching, noisy MCP tools, retries, or sessions we reopen after days idle?

Advisor finds it

How much more are we paying now that agents run loops and spawn other agents, instead of us driving every step?

agent vs human

Which agents are expensive but not productive?

cost by tool

How much are we paying in APIs right now, and would a fourth $200 subscription be smarter?

subs vs PAYG

What is our total AI coding cost?

one number

Why now

The bill stopped being background noise.

AI is the fastest-growing line in most engineering budgets - and a single agent task now spends like a small invoice. This isn't a rounding error anymore.

3-10×AI spend is growing faster than any other line in the cloud bill.Flexera · PointFive, 2026

400K-2Mtokens a single agent task can burn - a real slice of a weekly plan, not loose change.agentic coding telemetry, 2026

June 1, 2026GitHub Copilot went usage-based - the flat-rate era is ending across the category.GitHub

One place, reconciled

Not per device. Not per account. One reconciled truth layer.

Every dollar, reconciled. Ottto pulls usage from every agent, every account, on every machine - plus the cloud and your credits - into one trusted view. By tool, model, account, session, repo, and billing path. Subscriptions, API usage and credits, side by side.

✓All your accounts. A personal and a business Codex, two Claude plans + API, all deduped into one number.
✓All your agents and machines. Claude Code, Codex, Cursor, Pi - and the cloud - together, not one at a time.
✓Subscriptions, API and credits. Monthly plans, metered usage, and promo credits in the same picture.
✓Every number labeled. Exact where we know it, estimated where we don't.
✓One glance, the bottom lines. Every subscription and API at a glance - on your Mac or your phone. No tab-switching, no mental math, and a heads-up when you're pacing over.
✓Your data, your agents. The unified data layer is exposed over CLI, MCP, plugins and skills - so your own agent can pull last week's real usage, line it up against your merged PRs, and check whether the spend is turning into shipped code.

Local sessionsyour Macs

Live telemetryaccounts

Connectorsplans · API

Cloud billsAWS/GCP

→

oto

Reconcilededupe · price · label

Claude · 2 plans + API$1,340exact

Codex · Pro + Business$540exact

awsBedrock · work$280est.

Find the leak

Most of your spend is productive. Some of it buys nothing.

You're not overpaying for speed, that's the point. But some spend buys nothing back: re-paid context from broken caching, a plan on the wrong tier, a 1M window where 200K would do, or output tokens - billed several times higher than input - spent on verbose intermediate chatter no one reads. Ottto separates the spend that makes you faster from the spend that doesn't, and shows you exactly how much - so you can act, like keeping models terse in intermediate steps and verbose only in the final answer.

See what you'd save →

$620/mo found across your stack

Broken caching in one repo$300/mo

Plan on the wrong tier$140/mo

1M context where 200K would do$110/mo

Idle Codex Business plan$70/mo

Advisor · every day

A short queue of fixes, not another dashboard.

Advisor watches quota windows, model/cache behavior, long-context sessions, agent-spawned work, and plan fit every day. It shows the dollars, the evidence, and the caveats before anything changes.

Set a budget per tool or per person and Ottto pings you on Slack or desktop while you can still act - before the invoice, not after.

Advisor recommendation Quota window reset changed the best place to run work

Codex API spend is rising while a Claude Max window has room.

Impact +$140/moConfidence exact

Route next run to the subscription window Reset detected · room available · nothing changes without review

approve

✓

Approved action recorded$140/mo tracked against real usage

oto

Your daily digest5 things changed · 2 anomalies · today

+$620/moif you approve all

⟳

Quota window reset before the scheduled time.

detector Route overflow away from paid API usage before the next reset.

+$140/mo

ReviewSkip

⚡

Model cost and cache behavior regressed.

detector Reprice the same traffic with the cache hit-rate drop called out.

+$160/mo

ReviewSkip

◱

Long-context session cost jumped.

detector Spot sessions where 1M context is being used where a smaller window would do.

+$110/mo

Track

⌗

Agent-spawned sessions multiplied spend.

detector Separate human-driven work from agents spawning other agents.

+$70/mo

Track

↧

Plan fit recommendation is ready.

autopilot Demote, promote, or switch only when the usage evidence supports it.

+$140/mo

Track

Analyzeevery day

→

Flaganomalies

→

Recommendwith the $

→

You approveone click

→

Trackreal savings

Not a scary autopilot that moves money on its own. It finds, explains, prices, asks, executes only approved actions, and tracks rollback state.

Subscriptions & limits

Know before you hit the wall.

Every plan and account you pay for, how much you actually use, and when the next limit resets - so you're never surprised. Which plan fits, what you'd save on pay-as-you-go, and when to cancel before a renewal you won't use.

You won't refresh four dashboards at midnight, mid-task. Ottto watches every window - across machines, accounts and subscriptions - and warns you before you hit the wall.

✓Quota resets. "Claude Max - 78% of your 5-hour window. Resets 3:40pm."
✓Alerts before the wall. Set a threshold and get a heads-up before a window resets or a limit hits - not after you're blocked.
✓Renewal warnings. "Codex Business renews Tuesday - idle 3 weeks. Cancel?"
✓Savings vs pay-as-you-go. The cross-account math vendor dashboards never show.

ottto.net/subscriptions

Claude Maxoverpaying

$200/mo41% used

Cursor Progood fit

renews in 4d88% used

Codex Businessrenews Tue · idle

$25/mo6% used

Codex Proresets 3:40pm

5-hr window78% used

↓Switch Claude Max → Pro + API, cancel the idle Codex Business plan. Same usage, ~$160/mo less.

Provider Impact

The moment a provider changes, you know what it costs you.

Anthropic ships a new default. OpenAI changes pricing. A model gets deprecated, a quota window shifts. Ottto watches provider changes and reconciles each one against your actual usage so a price hike or default swap shows up as "here's what it does to your bill."

✓Real-time alerts. Pricing, defaults, deprecations, quota and outage changes.
✓Tied to your spend. Not "something changed" - "this changed your bill by $X."

⚡

Anthropic changed Claude Code's default modeldetected 2 hours ago

LIVE

Your sessions are 18% pricier since the switch.

The new default uses more expensive output tokens for the same work. We caught it the moment your usage shifted.

Affects 2 repos$120 extra this weeksince Tue 9:02am

Pin the previous modelSee impact

Under the hood

Out of your way. Open by default.

No proxy · zero added latency Open source · local-first daemon ChatGPT + MCP · ask where you already work

Never in the middle. Ottto doesn't proxy your requests, limit which tools you use, or touch provider terms - it just collects in parallel.

See how it works →

Why ottto

The one place that sees all of it.

A vendor's dashboard charts one account - it won't reconcile your other plans, and it won't tell you what to change. Free meters watch one machine. Gateways sit in your request path and see only proxied API traffic. Ottto ties every device, agent and account together - and tells you what to do about it.

	Free CLIs & menu bars	Gateways	Vendor dashboards	ottto
All your devices	one machine	proxied only	n/a	yes
All accounts & agents	one or a few	API only	one tool	yes
Subs + API + credits	quota only	usage only	its own	all
Reconciles every source	no	no	no	yes
Real-time provider-change alerts	no	no	no	yes
Recommends & helps act	no	limits	no	yes
In your request path	never	yes	n/a	never

Questions

The things you're wondering.

QCan't I just check my usage myself?

You could - across every tool, account and machine. But you won't, not mid-task when you're deep in a problem, which is exactly when a limit hits or a bill creeps up. Ottto watches the windows for you and flags it before the wall.

QDoes Ottto slow down my coding tools?

No. Ottto is out of your request path - it reads telemetry your tools already emit. Zero added latency, and nothing breaks if Ottto is down.

QCan you see my code or prompts?

Ottto is built around usage, cost, and limit signals - not your source code. You choose what each source shares, and local data stays labeled and under your control.

QMultiple accounts and plans?

Yes - that's the point. A Pro and a Business Codex, two Claude plans, an API key and some credits all reconcile into one deduped picture across every Mac.

QDo I need an API key?

No. Ottto works with subscriptions too - Pro, Max, and the rest. Subscriptions, API and credits, all first-class.

QIs my subscription safe?

Yes. Ottto observes; it never proxies your requests, never holds your login, never limits which tools you use, and never touches provider terms - it just collects in parallel. So it isn't affected by the gateway restrictions vendors introduced in 2026.

QWhat is free and open source?

You can start free, no credit card. The runtime and provider feed are open source: ottto and ai-provider-watch.

Why I'm building Ottto

"Not because I want to slow down. Because I want to scale up without flying blind."

The bill stopped being background noise, so I'm building the tool I kept wishing I had: open-source building blocks for the basics, and one product for the actionable bottom line. Usage and cost intelligence for AI coding agents, so you can push harder, not slower.

oto

Ronbuilding Ottto

Push harder, not slower.

See your real numbers in minutes.

No sales call. No proxy. No slowdown. Install the app, connect your accounts, and watch every agent, machine and credit reconcile into one picture.

Start free →Get the Mac app

Free to start · no credit card · Apple Silicon · macOS Sonoma+