Near-flagship quality at lower cost
Sonnet 4.6 is often the better price-performance option for daily engineering and operations workflows.
Anthropic's most capable Sonnet model. Near Opus-level intelligence with dramatic improvements in coding, computer use, long-context reasoning, and agent planning — at unchanged Sonnet pricing.
Enter a prompt below to experience Claude Sonnet 4.6's dramatically improved capabilities firsthand.
Dramatically improved coding, computer use that approaches human-level performance, and powerful long-context reasoning — all at Sonnet pricing.
Reads context before modifying code, consolidates duplicated logic, reduces over-engineering and lazy behavior. Multi-step task execution is far more stable with fewer false-positive success reports.
Evolved from experimental to practical. OSWorld benchmark jumped from 14.9% to 72.5%. Approaches human-level performance on complex spreadsheets and multi-step web forms, coordinating across multiple browser tabs.
1M token context window (beta) holds entire codebases, long contracts, or dozens of research papers. Sonnet 4.6 reasons effectively over such long contexts — not just stuffs text in.
All-around improvement approaching Opus-level performance, with standout results in computer use and coding.
Up from 14.9% at launch — approaching human-level performance in real software environments
70% of Claude Code users prefer Sonnet 4.6 over previous Sonnet 4.5
59% of users prefer Sonnet 4.6 over flagship Opus 4.5 for coding tasks. Learn more about <a href='https://www.anthropic.com/models/opus'>Opus</a>.
Highest score on the agentic coding evaluation
1 million token context window (beta) for entire codebases and long documents
Best-in-class value for a model that delivers near Opus-level performance
A comprehensive upgrade across every dimension — from programming to planning, from safety to scale.
Comprehensive benchmark gains across all evaluations. Tasks that previously required Opus can now be handled by Sonnet — at Sonnet pricing of $3/$15 per million tokens.
Enough to hold an entire codebase, a long contract, or dozens of research papers. Sonnet 4.6 reasons effectively over long contexts, enabling complex long-horizon planning.
Significant improvement in resisting prompt injection attacks compared to Sonnet 4.5. Malicious webpages can no longer easily hijack the model during computer use tasks.
Dramatically improved multi-step task execution. Better at orchestrating agent teams, planning ahead, and recovering from errors without human intervention.
Web search and scraping tools now automatically filter and process results, keeping only relevant content to save tokens. Code execution, memory, and tool use have reached GA.
Despite all improvements, pricing remains $3 per million input tokens and $15 per million output tokens — the same as previous Sonnet models.
A practical balance of strong coding quality, computer-use automation, and predictable cost efficiency.
Sonnet 4.6 is often the better price-performance option for daily engineering and operations workflows.
It handles multi-step browser and office-like tasks with better reliability and fewer intervention loops.
With 1M context support, teams can process bigger artifacts and preserve task continuity across long sessions.
Use an execution-first prompt structure to improve consistency across coding, automation, and long-context tasks.
Specify inputs, expected outputs, and failure handling to reduce ambiguity in multi-step automation tasks.
Group documents into sections and mark source priorities so Sonnet can maintain stronger reasoning focus.
Track token usage, latency, and task completion quality to decide where Sonnet should replace higher-cost models.
Compare other frontier models on Fluxchat.
OpenAI's professional model for coding, reasoning, tools, and long-context work
Anthropic's most intelligent model with 1M context
Zhipu AI's flagship model with open SOTA coding & agent performance
Google's most advanced reasoning model with 77.1% ARC-AGI-2 & 1M context
Have more questions? Contact our support team.