Parameters
-
Context Length
200K
Modality
Text
Architecture
Dense
License
Proprietary
Release Date
1 Nov 2025
Knowledge Cutoff
-
Attention Structure
Multi-Head Attention
Hidden Dimension Size
-
Number of Layers
-
Attention Heads
-
Key-Value Heads
-
Activation Function
-
Normalization
-
Position Embedding
Absolute Position Embedding
Anthropic's most powerful Claude model with maximum thinking effort configuration. Exceptional performance across all domains with top-tier web development (Elo 1510), reasoning (80.09), and mathematics (90.39). Features extended reasoning chains and deep analytical capabilities. Ideal for complex problems requiring thorough analysis, multi-step reasoning, and comprehensive solutions. Extended context window up to 200K tokens.
Enhanced Claude models with further improvements in reasoning, coding, and agentic capabilities. Features advanced thinking modes with adjustable effort levels (high, medium, standard) for optimal performance-latency tradeoffs. Excels at complex analysis, software development, web development, and long-context understanding. Includes thinking variants that expose reasoning process for improved transparency.
Rank
#2
| Benchmark | Score | Rank |
|---|---|---|
Coding Aider Coding | 0.84 | 🥇 1 |
Web Development WebDev Arena | 1510 | 🥇 1 |
Agentic Coding LiveBench Agentic | 0.70 | 🥈 2 |
Coding LiveBench Coding | 0.80 | ⭐ 5 |
Reasoning LiveBench Reasoning | 0.80 | ⭐ 7 |
StackUnseen ProLLM Stack Unseen | 0.82 | 8 |
Mathematics LiveBench Mathematics | 0.77 | 14 |
Overall Rank
#2 🥈
Coding Rank
#1 🥇