Question 1

How much does the Claude API cost?

Accepted Answer

Claude Sonnet 4 is priced at $3.00 per 1M input tokens and $15.00 per 1M output tokens. Claude Haiku 3.5 is considerably cheaper at $0.80/1M input and $4.00/1M output. Both models have a 200,000 token context window. For a typical exchange (1,000 input + 500 output tokens), Claude Sonnet 4 costs about $0.0105 per call.

Question 2

What is Claude's context window size?

Accepted Answer

All current Claude models support a 200,000 token context window  - equivalent to roughly 150,000 words or a full novel. This is larger than GPT-4o's 128K but smaller than GPT-4.1's 1M context. Claude's 200K window is sufficient for most enterprise use cases including long document analysis, large codebase review, and extended multi-turn conversations.

Question 3

How does Claude compare to GPT-4 on cost?

Accepted Answer

Claude Sonnet 4 at $3.00/1M input is 20% more expensive than GPT-4.1 ($2.00/1M), with higher output costs ($15 vs $8/1M). Claude Haiku 3.5 at $0.80/$4.00 is more expensive than GPT-4.1-mini ($0.40/$1.60) but often preferred for its stronger instruction following and reduced hallucinations in structured output tasks.

Question 4

Does Claude support prompt caching?

Accepted Answer

Yes. Anthropic offers prompt caching for Claude models. Cached input tokens cost $0.30/1M (90% discount vs standard input price). Cache writes cost $3.75/1M. This is highly effective for applications with long static context like system prompts, retrieved documents, or tool definitions that are reused across many API calls.

Claude Context Cost Calculator

LLM Context Cost Calculator

Optimize your LLM spend

Need help optimizing LLM costs?

Claude Model Pricing

Frequently Asked Questions

How much does the Claude API cost?

What is Claude's context window size?

How does Claude compare to GPT-4 on cost?

Does Claude support prompt caching?

Model	Input $/1M	Output $/1M	Context	Category
Claude Sonnet 4	$3.00	$15.00	200K	flagship
Claude Haiku 3.5	$0.80	$4.00	200K	fast