FinOps for AI coding tools

The AI coding meter for every agent you run.

$2,340/mo · across every account, Mac & cloud

Ottto reconciles subscriptions, API usage, credits, quota windows, and cloud bills into one number you can defend. No proxy in the request path, no added latency, no mystery totals. Add another AI tool without adding another thing to watch.

Free to start · no credit card · first numbers in minutes
ACCOUNTS & USAGE CLOUD & CREDITS Claude Max[email protected] · sub$580 Claude Team[email protected] · 3 seats$420 Claude APIpay-as-you-go · sk-…a3f$340 Codex Pro[email protected] · sub$360 Codex Business[email protected] · sub$180 Cursor Pro[email protected] · sub$60 awsBedrockwork · us-east-1 · cloud$280 Vertexwork · gcp-proj · cloud$120 $CreditsAnthropic · promo−$40 $CreditsAWS · activate−$200 $0 this month · all of it reconciled ✓
every agent & terminal you run - captured
Claude Code Codex Cursor Pi Gemini Ghostty iTerm cmux openclaw Hermes + more
Why we built this

We don't want to spend less on AI. We want to scale up without flying blind.

We found ourselves using 3x the AI of last month, across Codex, Claude Code, Pi, Bedrock, Vertex, subscriptions and APIs, on multiple Macs. The bill stopped being background noise. So we keep checking the same things, several times a day:

Where are we losing efficiency to huge context, bad caching, noisy MCP tools, retries, or sessions we reopen after days idle?

Advisor finds it

How much more are we paying now that agents run loops and spawn other agents, instead of us driving every step?

agent vs human

Which agents are expensive but not productive?

cost by tool

How much are we paying in APIs right now, and would a fourth $200 subscription be smarter?

subs vs PAYG

What is our total AI coding cost?

one number
Why now

The bill stopped being background noise.

AI is the fastest-growing line in most engineering budgets - and a single agent task now spends like a small invoice. This isn't a rounding error anymore.

3-10×AI spend is growing faster than any other line in the cloud bill.Flexera · PointFive, 2026
400K-2Mtokens a single agent task can burn - a real slice of a weekly plan, not loose change.agentic coding telemetry, 2026
June 1, 2026GitHub Copilot went usage-based - the flat-rate era is ending across the category.GitHub
One place, reconciled

Not per device. Not per account. One reconciled truth layer.

Every dollar, reconciled. Ottto pulls usage from every agent, every account, on every machine - plus the cloud and your credits - into one trusted view. By tool, model, account, session, repo, and billing path. Subscriptions, API usage and credits, side by side.

  • All your accounts. A personal and a business Codex, two Claude plans + API, all deduped into one number.
  • All your agents and machines. Claude Code, Codex, Cursor, Pi - and the cloud - together, not one at a time.
  • Subscriptions, API and credits. Monthly plans, metered usage, and promo credits in the same picture.
  • Every number labeled. Exact where we know it, estimated where we don't.
  • One glance, the bottom lines. Every subscription and API at a glance - on your Mac or your phone. No tab-switching, no mental math, and a heads-up when you're pacing over.
  • Your data, your agents. The unified data layer is exposed over CLI, MCP, plugins and skills - so your own agent can pull last week's real usage, line it up against your merged PRs, and check whether the spend is turning into shipped code.
Local sessionsyour Macs
Live telemetryaccounts
Connectorsplans · API
Cloud billsAWS/GCP
oto
Reconcilededupe · price · label
Claude · 2 plans + API$1,340exact
Codex · Pro + Business$540exact
awsBedrock · work$280est.
Find the leak

Most of your spend is productive. Some of it buys nothing.

You're not overpaying for speed, that's the point. But some spend buys nothing back: re-paid context from broken caching, a plan on the wrong tier, a 1M window where 200K would do, or output tokens - billed several times higher than input - spent on verbose intermediate chatter no one reads. Ottto separates the spend that makes you faster from the spend that doesn't, and shows you exactly how much - so you can act, like keeping models terse in intermediate steps and verbose only in the final answer.

$620/mo found across your stack
Broken caching in one repo$300/mo
Plan on the wrong tier$140/mo
1M context where 200K would do$110/mo
Idle Codex Business plan$70/mo
Advisor · every day

A short queue of fixes, not another dashboard.

Advisor watches quota windows, model/cache behavior, long-context sessions, agent-spawned work, and plan fit every day. It shows the dollars, the evidence, and the caveats before anything changes.

Set a budget per tool or per person and Ottto pings you on Slack or desktop while you can still act - before the invoice, not after.

Advisor recommendation Quota window reset changed the best place to run work

Codex API spend is rising while a Claude Max window has room.

Impact +$140/moConfidence exact
Route next run to the subscription window Reset detected · room available · nothing changes without review
approve
Approved action recorded$140/mo tracked against real usage
oto
Your daily digest5 things changed · 2 anomalies · today
+$620/moif you approve all
Quota window reset before the scheduled time.

detector Route overflow away from paid API usage before the next reset.

+$140/mo
ReviewSkip
Model cost and cache behavior regressed.

detector Reprice the same traffic with the cache hit-rate drop called out.

+$160/mo
ReviewSkip
Long-context session cost jumped.

detector Spot sessions where 1M context is being used where a smaller window would do.

+$110/mo
Track
Agent-spawned sessions multiplied spend.

detector Separate human-driven work from agents spawning other agents.

+$70/mo
Track
Plan fit recommendation is ready.

autopilot Demote, promote, or switch only when the usage evidence supports it.

+$140/mo
Track
Analyzeevery day
Flaganomalies
Recommendwith the $
You approveone click
Trackreal savings

Not a scary autopilot that moves money on its own. It finds, explains, prices, asks, executes only approved actions, and tracks rollback state.

Subscriptions & limits

Know before you hit the wall.

Every plan and account you pay for, how much you actually use, and when the next limit resets - so you're never surprised. Which plan fits, what you'd save on pay-as-you-go, and when to cancel before a renewal you won't use.

You won't refresh four dashboards at midnight, mid-task. Ottto watches every window - across machines, accounts and subscriptions - and warns you before you hit the wall.

  • Quota resets. "Claude Max - 78% of your 5-hour window. Resets 3:40pm."
  • Alerts before the wall. Set a threshold and get a heads-up before a window resets or a limit hits - not after you're blocked.
  • Renewal warnings. "Codex Business renews Tuesday - idle 3 weeks. Cancel?"
  • Savings vs pay-as-you-go. The cross-account math vendor dashboards never show.
ottto.net/subscriptions
Claude Maxoverpaying
$200/mo41% used
Cursor Progood fit
renews in 4d88% used
Codex Businessrenews Tue · idle
$25/mo6% used
Codex Proresets 3:40pm
5-hr window78% used
Switch Claude Max → Pro + API, cancel the idle Codex Business plan. Same usage, ~$160/mo less.
Provider Impact

The moment a provider changes, you know what it costs you.

Anthropic ships a new default. OpenAI changes pricing. A model gets deprecated, a quota window shifts. Ottto watches provider changes and reconciles each one against your actual usage so a price hike or default swap shows up as "here's what it does to your bill."

  • Real-time alerts. Pricing, defaults, deprecations, quota and outage changes.
  • Tied to your spend. Not "something changed" - "this changed your bill by $X."
Anthropic changed Claude Code's default modeldetected 2 hours ago
LIVE
Your sessions are 18% pricier since the switch.
The new default uses more expensive output tokens for the same work. We caught it the moment your usage shifted.
Affects 2 repos$120 extra this weeksince Tue 9:02am
Pin the previous modelSee impact
Powered by our open feed - AI Provider Watch ↗
Under the hood

Out of your way. Open by default.

No proxy · zero added latency Open source · local-first daemon ChatGPT + MCP · ask where you already work

Never in the middle. Ottto doesn't proxy your requests, limit which tools you use, or touch provider terms - it just collects in parallel.

See how it works →
Why ottto

The one place that sees all of it.

A vendor's dashboard charts one account - it won't reconcile your other plans, and it won't tell you what to change. Free meters watch one machine. Gateways sit in your request path and see only proxied API traffic. Ottto ties every device, agent and account together - and tells you what to do about it.

Free CLIs & menu barsGatewaysVendor dashboardsottto
All your devicesone machineproxied onlyn/ayes
All accounts & agentsone or a fewAPI onlyone toolyes
Subs + API + creditsquota onlyusage onlyits ownall
Reconciles every sourcenononoyes
Real-time provider-change alertsnononoyes
Recommends & helps actnolimitsnoyes
In your request pathneveryesn/anever
Questions

The things you're wondering.

QCan't I just check my usage myself?

You could - across every tool, account and machine. But you won't, not mid-task when you're deep in a problem, which is exactly when a limit hits or a bill creeps up. Ottto watches the windows for you and flags it before the wall.

QDoes Ottto slow down my coding tools?

No. Ottto is out of your request path - it reads telemetry your tools already emit. Zero added latency, and nothing breaks if Ottto is down.

QCan you see my code or prompts?

Ottto is built around usage, cost, and limit signals - not your source code. You choose what each source shares, and local data stays labeled and under your control.

QMultiple accounts and plans?

Yes - that's the point. A Pro and a Business Codex, two Claude plans, an API key and some credits all reconcile into one deduped picture across every Mac.

QDo I need an API key?

No. Ottto works with subscriptions too - Pro, Max, and the rest. Subscriptions, API and credits, all first-class.

QIs my subscription safe?

Yes. Ottto observes; it never proxies your requests, never holds your login, never limits which tools you use, and never touches provider terms - it just collects in parallel. So it isn't affected by the gateway restrictions vendors introduced in 2026.

QWhat is free and open source?

You can start free, no credit card. The runtime and provider feed are open source: ottto and ai-provider-watch.

Why I'm building Ottto

"Not because I want to slow down. Because I want to scale up without flying blind."

The bill stopped being background noise, so I'm building the tool I kept wishing I had: open-source building blocks for the basics, and one product for the actionable bottom line. Usage and cost intelligence for AI coding agents, so you can push harder, not slower.

oto
Ronbuilding Ottto
Push harder, not slower.

See your real numbers in minutes.

No sales call. No proxy. No slowdown. Install the app, connect your accounts, and watch every agent, machine and credit reconcile into one picture.

Free to start · no credit card · Apple Silicon · macOS Sonoma+