Disclosure: Some links below are affiliate links. If you buy through them, this site earns a small commission at no extra cost. Editorial recommendations are never influenced by affiliate rates.
In this article
- What changed in 4.7
- Coding got a lot better
- Vision is now actually useful
- New control features
- What it costs (and the tokeniser catch)
- How it compares to the competition
- What Anthropic changed around cybersecurity
- Breaking changes to watch if migrating from 4.6
- Where to access Opus 4.7
- Who should actually upgrade
Anthropic dropped Claude Opus 4.7 on 16 April 2026 with gains across coding and vision tasks. On SWE-bench Verified (the standard benchmark for agentic coding), it scores 87.6% compared to 80.8% on Opus 4.6. Vision accuracy jumped from 54.5% to 98.5%. The pricing stayed flat: $5 per million input tokens, $25 per million output tokens. (Source: anthropic.com/news/claude-opus-4-7)
Whether it's worth upgrading depends on what you're doing with it. Here's the practical version.
What changed in 4.7
Coding got a lot better
The coding gains are the headline. Opus 4.7 scores 70% on CursorBench (a real-world IDE-integrated benchmark), up from 58% on Opus 4.6. That 12-point gain measures multi-file refactors and context-aware completions, which is what developers actually do. (Source: nxcode.io/claude-opus-4-7-guide)
On SWE-bench Verified (the agentic coding standard), 4.7 scores 87.6% versus 80.8% for Opus 4.6. GPT-5.4 sits at ~70% on the same benchmark. Clear difference.
In practice, tasks that previously required close human supervision (debugging a codebase, writing a fix, running tests) now run more independently. One early tester (a fintech firm) noted: "It catches its own logical faults during the planning phase and accelerates execution." (Source: anthropic.com/news/claude-opus-4-7)
Vision is now actually useful
The vision jump is the most dramatic change. Image resolution went from 1.25MP to 3.75MP. Accuracy on visual-understanding tasks went from 54.5% to 98.5%. The coordinate system simplified to 1:1 mapping.
Practically: you can now feed Claude screenshots for QA, UI debugging, chart reading, and document review, and it will get them right. The old model struggled with dense tables or small fonts. This one doesn't. If you've been skipping vision tasks, this is worth trying again.
New control features
Three changes for API users:
xhigh effort level. New step between "high" and "max" for extended thinking. Use when you need more reasoning than "high" but want to avoid "max" cost ceiling.
task_budget (beta). Instead of a hard per-request token limit, you now set an approximate budget for the full agentic loop: tool calls, thinking, output. The model winds down gracefully at the limit instead of cutting off mid-sentence. Makes long document reviews simpler to cost-estimate.
/ultrareview command in Claude Code. Deeper code review in the Claude Code editor. Relevant mainly for the desktop app, not the API.
What it costs (and the tokeniser catch)
The per-token price matches Opus 4.6: $5 per million input tokens, $25 per million output tokens. (Source: cloudzero.com/claude-api-pricing)
The wrinkle: 4.7 includes a new tokeniser. Depending on your content, the same text uses 1.0x to 1.35x more tokens. If you run tight cost projections on batch workflows, you need to retest before switching. You might pay 0 to 35% more per request even at unchanged per-token rates. (Source: finout.io/claude-opus-4-7-pricing)
For NZ developers: 5/25 per million tokens translates to NZD $8.25 input / $41.25 output at 1:1.65 exchange rate. Prompt caching cuts input costs up to 90%. Batch processing cuts both by 50%. Most small-to-medium workloads cost less than a coffee per month.
Or skip the API entirely and use Claude Pro for $20 USD/month (~NZD $33) via claude.ai. You get full-model access with conversation limits, no API setup. Better if you're an individual or small team that doesn't need programmatic access.
Alternatively, Perplexity Pro (20USD/month)bundlesClaudeOpus4.7accessalongsidewebsearchandimageanalysis, whichsavesswitchingbetweentools.Or[ReplitFoundingTier](https://replit.com/pricing)(20 USD/month) includes Claude Opus for pair-programming and rapid prototyping.
How it compares to the competition
| Model | SWE-bench Verified | SWE-bench Pro | Computer Use |
|---|---|---|---|
| Claude Opus 4.7 | 87.6% | 64.3% | 78.0% |
| GPT-5.4 | ~70% | 57.7% | 75.0% |
| Gemini 3.1 Pro | lower | lower | lower |
Opus 4.7 leads on every publicly available agentic coding and computer-use benchmark. (Source: llm-stats.com/claude-opus-4-7-launch, vellum.ai/claude-opus-4-7-benchmarks)
Note: Anthropic has Claude Mythos Preview, which scores higher on most benchmarks. It's not publicly available yet. Anthropic is rolling it out slowly to test cybersecurity safeguards. Opus 4.7 is what you can use today.
What Anthropic changed around cybersecurity
Opus 4.7 is the first Claude with real-time automated safeguards for cybersecurity requests. This is Anthropic's testing ground for eventual broader model releases. (Source: support.claude.com/en/articles/14604842)
Security pros doing legitimate work (penetration testing, vulnerability research) can apply to Anthropic's Cyber Verification Program for less-restricted access.
For most business users, this doesn't matter. You won't hit the safety layer building coding agents, document workflows, or analytics. But if you're in a compliance-heavy industry, the safeguards exist and are documented.
Breaking changes to watch if migrating from 4.6
Three API changes that might break existing code:
- Sampling parameters removed.
temperatureandtop_pdon't work the same way. If you tuned these for production consistency, test before switching. - Extended thinking budgets removed. The old
thinking_budgetis gone. Usetask_budget(beta) instead, which works differently. - Tokeniser change. Retest your cost and latency assumptions.
If your prompts relied on specific sampling behaviour, test before rolling out.
Where to access Opus 4.7
Access Opus 4.7 through:
- claude.ai: Claude Pro ($20 USD/month) or Team/Enterprise
- Anthropic API directly (model ID:
claude-opus-4-7): platform.claude.com - Amazon Bedrock (if you already use it, no extra setup)
- Google Cloud Vertex AI
- Microsoft Foundry
No NZ restrictions. USD billing is standard everywhere.
Who should actually upgrade
Upgrade now if:
- You're building coding agents or agentic multi-step workflows
- You're doing screenshot QA, UI analysis, or vision-heavy document work
- You're building long-running pipelines that need self-verification
- You use Claude Code on desktop
Test carefully if:
- You're on tight API cost budgets with heavy text input (tokeniser change might cost +0–35%)
- You're relying on specific
temperatureortop_ptuning for consistency - You're running creative writing where Opus 4.6 worked well
Stick with Sonnet 4.6 if:
- You're drafting, summarising, or answering questions (Opus power isn't needed)
- You're cost-sensitive. Sonnet 4.6 is 3/15 per million tokens, one-fifth of Opus cost. (Source: cloudzero.com)
For most NZ small businesses using Claude for writing, email, or support summaries, Sonnet 4.6 is still the right pick. Opus 4.7 pays off when tasks are hard and multi-step. If your prompts run 30 seconds and produce short text, you're paying 5x for capability you won't use.
For developers building agent workflows, though, Opus 4.7 is the best publicly available option right now.
Sources: Anthropic launch post (April 16, 2026), nxcode.io complete guide, evolink.ai review, llm-stats.com benchmark analysis, vellum.ai benchmark explainer, finout.io pricing analysis