Claude Sonnet 4.6 is the workhorse of the Anthropic API. For most production workloads it is the right default — capable enough for nearly all tasks, priced 5× under Opus.
Per-million-token pricing
Type
Price
Input
$3
Output
$15
Cache write (5m)
$3.75
Cache read
$0.30
Batch input
$1.50
Batch output
$7.50
The 80/20
The most common production pattern in 2026: Sonnet 4.6 with prompt caching on a long system prompt + RAG context. Effective input cost lands around $0.30/M after cache hits, and output stays at $15/M. For a 2k-input / 500-output workload that's well under a fraction of a cent per request.
Estimate your bill
Use the Claude Cost Calculator — set model to Sonnet 4.6, dial in your cache hit rate, and see the monthly number.
Frequently asked questions
What is the Claude Sonnet 4.6 price per million tokens?
Claude Sonnet 4.6 costs $3 per million input tokens and $15 per million output tokens at standard rates. With prompt caching, cached reads drop to $0.30/M. Batch mode drops both to half: $1.50 input, $7.50 output.
Is Sonnet 4.6 the same as Claude 4 Sonnet?
Yes — Claude Sonnet 4.6 is the 4th-generation Sonnet model from Anthropic, released in 2026. It is the mid-tier model in the current lineup, sitting between Haiku 4.5 and Opus 4.7.
How do I calculate my Sonnet 4.6 monthly bill?
Multiply your average input tokens by $0.000003 (per token) and your average output tokens by $0.000015, then multiply by monthly request count. Or enter those numbers into the Claude Cost Calculator which handles the arithmetic and adds cache/batch discounts.