Question 1

How does Anthropic API billing work?

Accepted Answer

Anthropic charges separately for input tokens and output tokens. Input tokens include your system prompt, conversation history, and any tool definitions you pass. Output tokens are the model's response. Prompt caching adds two more categories: cache write tokens (first call that creates a cached block) and cache read tokens (subsequent calls that hit the cache). Cache reads are typically 90% cheaper than standard input reads, so heavy caching significantly reduces bills for workloads with repeated context.

Question 2

Which Claude model is the cheapest for production use?

Accepted Answer

Claude Haiku 3.5 is the cheapest Claude model at $0.80/M input and $4/M output tokens. It's ideal for classification, routing, and lightweight reasoning tasks. Claude Sonnet 4.5 costs more but handles complex code generation and multi-step reasoning better. A common cost-optimization pattern is to route simple sub-tasks to Haiku and hard tasks to Sonnet — reducing average cost by 40–70% without sacrificing quality on tasks that matter.

Question 3

How much does prompt caching save on Anthropic API costs?

Accepted Answer

Prompt cache reads cost $0.08/M tokens for Haiku and $0.30/M for Sonnet — compared to $0.80/M and $3.00/M for standard input reads. That is a 10× reduction. For a system with a 10,000-token system prompt making 1,000 calls/day, caching saves roughly $7–8/day on Haiku or $27/day on Sonnet. The cache write call (first request) costs slightly more than standard input, so caching pays off after the second request for any prompt block.

Question 4

How do I see exactly what my Claude Code session cost?

Accepted Answer

Claude Code records every API call in session logs at ~/.claude/projects/<project-name>/*.jsonl. Each JSON line includes the model name, input token count, output token count, cache write tokens, and cache read tokens. Paste or drop this file into the Claude Code Cost Calculator — it applies current Anthropic pricing for each model and renders a breakdown by model, by tool, and by hour. No API key needed; it runs completely client-side.

Anthropic API
Cost Calculator 2025

Claude API Pricing by Model

Claude Haiku 3.5 Cheapest

Claude Sonnet 4.5 Balanced

Claude Opus 4 Most Capable

How Anthropic API Billing Works

Frequently Asked Questions

How does Anthropic API billing work?

Which Claude model is cheapest for production?

How much does prompt caching save on Anthropic API costs?

How do I see exactly what my Claude Code session cost?

Anthropic APICost Calculator 2025

Claude API Pricing by Model

Claude Haiku 3.5 Cheapest

Claude Sonnet 4.5 Balanced

Claude Opus 4 Most Capable

How Anthropic API Billing Works

Frequently Asked Questions

How does Anthropic API billing work?

Which Claude model is cheapest for production?

How much does prompt caching save on Anthropic API costs?

How do I see exactly what my Claude Code session cost?

Related tools & guides

Anthropic API
Cost Calculator 2025