Claude Sonnet 5 - DeveloPassion

# Claude Sonnet 5 Claude Sonnet 5 is [[Anthropic]]'s mid-tier [[Claude]] model, released early July 2026. Anthropic calls it "our most agentic Sonnet yet": it plans, drives browsers and terminals, and runs autonomously at a level that required Opus-class models a few months earlier. The pitch is simple: near [[Claude Opus 4.8]] results at Sonnet pricing. ## Positioning - Bridges the gap between Sonnet 4.6 and the Opus tier; approaches Opus 4.8 on several agentic evaluations - Early testers report it "finishes complex tasks where previous Sonnet models would stop short" and checks its own output without being asked - On OSWorld-Verified (computer use), it can match Opus 4.8 at higher effort levels. On BrowseComp (agentic search), it offers a wider cost-performance range than Sonnet 4.6 - Lower hallucination and sycophancy rates than its predecessor, continuing the honesty push that started with [[Claude Opus 4.8]] ## Availability - Model ID: `claude-sonnet-5` - Context window: 1M tokens - The new default in [[Claude Code]] for Pro users - Available on Free, Pro, Max, Team, and Enterprise plans, plus the [[Claude API]] and Managed Agents ## Pricing - Launch pricing (through 2026-08-31): $2 / $10 per million input/output tokens - Standard pricing after that: $3 / $15 - The launch discount frames the real question: at $3/$15, is Sonnet 5 at high effort still cheaper than Opus at low effort? Hacker News says: not always ## Reception and caveats The Hacker News thread is more skeptical than the announcement: - **Value-prop confusion**: on several benchmarks, cost per task rises above Opus at anything higher than medium effort. If you need high effort anyway, why not run Opus at a lower setting? - **Wealth-extraction complaints**: recurring accusations that recent models are tuned to consume tokens rather than solve problems. The new tokenizer producing up to 33% more tokens, models reading tens of thousands of lines they don't need, identical "$200 credits" handed to enterprise customers paying premium rates - **Benchmark chart games**: commenters caught Anthropic updating performance charts after publication; the cost x-axis shrank from $50 to $10 - **Security posture**: Sonnet 5 scores 0 on CyberGym with default mitigations enabled. Anthropic ships it with real-time cyber safeguards on by default and deliberately limited offensive capabilities compared to Opus - **The agentic split**: some developers push back on full autonomy and prefer agent-assisted development; others report letting agents run 100k LOC mobile projects with good results - **Cheaper alternatives**: Kimi K2.7 Code and GLM-5.2 come up repeatedly, with the familiar "95% of the performance at 10% of the price" argument. The broader thread questions whether frontier-lab valuations survive inference commoditization The plateau-fatigue pattern from the [[Claude Opus 4.8]] launch repeats here. The announcements emphasize agentic reliability; the community debates pricing mechanics and token consumption. Capability is no longer the story. Cost per outcome is. ## Working with it - If you run [[Claude Code]] on a Pro plan, you're already on it - Benchmark your own workloads before assuming the "Opus-level at Sonnet pricing" framing holds; effort level changes the math completely - The 1M context window makes it a candidate for long-horizon [[AI Agents]] work that used to require Opus, including [[Claude Dynamic Workflows]] jobs where per-agent cost adds up fast ## References - Announcement: https://www.anthropic.com/news/claude-sonnet-5 - Hacker News discussion: https://news.ycombinator.com/item?id=48736605 - ClaudeDevs launch tweet: https://x.com/ClaudeDevs/status/2072018504392601762 - Claude launch tweet: https://x.com/claudeai/status/2072017450611142835 ## Related - [[Claude]] - [[Anthropic]] - [[Claude Code]] - [[Claude Opus 4.8]] - [[Claude Fable 5]] - [[Claude Dynamic Workflows]] - [[Large Language Models (LLMs)]] - [[AI Agents]]