Claude 4.5 Sonnet
A frontier-level hybrid-reasoning model excelling at coding, long-horizon tasks, computer use, and domain reasoning with top-tier alignment and reliability.
Model Details
Provider
Anthropic
Model Type
text
Context Window
1,000,000 tokens
Pricing
Input (1M)$3.00
Output (1M)$15.00
Capabilities
1. Best-in-class coding performance
- #1 on SWE-bench Verified (77.2% standard, 82.0% high-compute).
- Excels at debugging, architecture, and multi-file code generation.
- Maintains coherence for extremely long tasks (30+ hours).
2. State-of-the-art computer use & agents
- Leads OSWorld at 61.4%.
- Strongest model for agentic workflows, multi-step tool use, and real computer control.
- Powering Claude Code, the new Claude Agent SDK, and Chrome agent actions.
3. Advanced reasoning & math
- Large improvements across reasoning-heavy benchmarks (AIME, MMMLU, τ2-bench, Terminal-Bench).
- Deep multi-step reasoning with extended or interleaved thinking.
4. High alignment & safety
- Most aligned Claude model to date with reduced deception, hallucinations, sycophancy, and harmful compliance.
- Strong protections against prompt injection for agentic tasks (ASL-3 safeguards).
5. Domain-expert performance
- Notable gains in finance, law, medicine, and STEM tasks.
- Trusted by early customers for long-context legal analysis, multi-file engineering, security research, and red-teaming.