Claude Extended Thinking Pricing

How Claude's extended-thinking mode is priced: thinking tokens count as output, plus when the latency and cost trade is worth it.

🔥 Launch tonight — Power Prompts PDF 50p (just 50p tonight)30 battle-tested Claude Code prompts · 8 pages · paste into CLAUDE.md · price reverts to £5

Extended thinking lets Claude reason through a problem before answering. The reasoning trace is real output, and Anthropic bills it as output tokens.

How it's billed

The cost math

A typical hard reasoning task uses 3–10k thinking tokens. On Sonnet 4.6, that's $0.045 – $0.15 per request just for the reasoning. On Opus 4.7, $0.225 – $0.75. Plan accordingly.

When it's worth it

When it isn't

Estimate

The Cost Calculator has a "thinking tokens per request" field — add an estimate to see total impact.

Frequently asked questions

How much do Claude thinking tokens cost?
Thinking tokens are billed as output tokens. On Sonnet 4.6 they cost $15 per million; on Opus 4.7 they cost $75 per million. A typical hard reasoning task uses 3,000–10,000 thinking tokens, costing $0.045–$0.15 on Sonnet and $0.225–$0.75 on Opus per request.
Can I cap the number of thinking tokens Claude uses?
Yes — the budget_tokens parameter sets a hard upper limit on thinking tokens per request. Setting it to 2,000 for simpler tasks and 8,000 for harder ones is a common pattern that keeps extended thinking cost predictable.
Is extended thinking available via the Batch API?
Yes. Extended thinking works with both the standard Messages API and the Batch API. Thinking tokens are still billed as output tokens; the 50% batch discount applies to them as well.

Free tools

Cost Calculator → Prompt-Pricing Recommender → Diff Summarizer → Skills Browser →

Related

Claude Opus 4.7 vs Sonnet 4.6 Pricing (2026 Comparison)How Much Does Claude Cost? (2026 API Pricing Guide)Claude Prompt Caching: 90% Cost Savings Explained (2026)Claude API Cost Calculator: Estimate Your Anthropic BillClaude vs GPT-4 Pricing: 2026 API Cost Comparison