1,000,000 Tokens

Amp can now use 1 million tokens of context with Claude Sonnet 4, up from 432,000 tokens two weeks ago.

You should not use the full context window for most tasks in Amp. Instead, use small threads that are scoped to a single task. Amp is better, faster, and cheaper when used this way. A notice will appear when you hit 20% of the context window to remind you of this.

Longer threads are more expensive, both because each iteration of the agentic loop sends more and more tokens, and because requests with more than 200k tokens are roughly twice as expensive per token in Anthropic's API pricing.

Amp thread with 1,000,000 tokens of context

Note: the screenshot shows 968k tokens because the context window is composed of 968k input tokens and 32k output tokens.