Why did your AI coding spend cost that much?

Nomira is open-source cost forensics for AI coding assistants. It reads the logs you already have and explains where the money went — cache-aware, multi-provider, and without your prompts ever leaving your machine.

Run it in 2 minutes See the proof

$ python nomira.py # your newest Claude Code / Codex session

The cost isn't where you think

$10,231

true cost across 44 local sessions

~$970

what a naive input+output tool reports — 10× wrong

91%

of the cost is cache & reasoning tokens others ignore

Most tools price only visible input/output. Real spend lives in cache reads, cache writes (5-min vs 1-hour), and reasoning tokens — priced differently by every provider. Getting this right is the whole point. We call it the auditor.

Three pillars

Forensic

Not "how much this month" — why this turn, conversation, or developer cost what it did. Plus waste signals: retry loops, repeated reads, cache rebuilt instead of reused.

Accurate

Cache-aware, multi-provider (Anthropic, OpenAI/Codex, Gemini), versioned rates, reconcilable against your invoice. Unknown models are flagged, never guessed.

Private

Token counts and business tags only — never prompt or response content. Schema-enforced. Self-hosted. No proxy in your request path.

How it works

1 · Analyze

python nomira.py reads a local Claude Code or Codex session and shows top-cost turns, the cache gap, and waste.

2 · Compare

--compare --by-project ranks efficiency: $/call, $/1k output, cache-reuse %. Why does A burn 5× B?

3 · Team

--ship then --serve for a self-hosted dashboard. Usage-only events; content stays home.

Python 3.9+, standard library only. No account, no install, no data egress.

Where Nomira fits

	Langfuse / Helicone	CloudZero / Finout	Nomira
Built for	production apps	enterprise FinOps	coding assistants
Answers	what happened in the call	what the org spent	why THIS cost so much
Open / self-host	yes	no	yes
Sees your content	via proxy/trace	via invoices	never
Cache-accurate	partial	invoice-level	core

We don't claim the market is empty — it isn't. Our wedge is forensics + coding assistants + privacy + accuracy.