Parameters
-
Context Length
200K
Modality
Text
Architecture
Dense
License
Proprietary
Release Date
1 Oct 2025
Knowledge Cutoff
Feb 2025
Attention Structure
Multi-Head Attention
Hidden Dimension Size
-
Number of Layers
-
Attention Heads
-
Key-Value Heads
-
Activation Function
-
Normalization
RMS Normalization
Position Embedding
Absolute Position Embedding
Claude Haiku 4.5 Thinking is a high-efficiency large language model developed by Anthropic, engineered to provide near-frontier intelligence with the low-latency profile characteristic of the Haiku model family. As a hybrid reasoning model, it incorporates an optional extended thinking mode that allows the system to engage in multi-step internal reasoning before emitting a final response. This architectural design balances the computational demands of complex problem-solving with the speed required for real-time production environments.
Technically, the model features significant advancements in context management and output capacity, supporting a 200,000-token context window and generating up to 64,000 tokens in a single response. It is designed with explicit context awareness, enabling the system to track its own token usage and adjust its reasoning persistence based on the remaining context budget. This capability is specifically tuned to mitigate agentic laziness and improve performance in long-horizon tasks such as codebase refactoring and autonomous system orchestration.
The model's utility is centered on high-throughput, cost-sensitive applications including customer service automation, real-time pair programming, and multi-agent systems where parallel execution is required. By integrating native vision capabilities alongside text processing, it supports multimodal workflows like document analysis and UI testing. Its training emphasis on coding and tool use allows it to act as a responsive engine for developer tools, offering a refined balance of speed and analytical depth at a significantly lower operational cost than flagship variants.
Enhanced Claude models with further improvements in reasoning, coding, and agentic capabilities. Features advanced thinking modes with adjustable effort levels (high, medium, standard) for optimal performance-latency tradeoffs. Excels at complex analysis, software development, web development, and long-context understanding. Includes thinking variants that expose reasoning process for improved transparency.
Rank
#34
| Benchmark | Score | Rank |
|---|---|---|
Agentic Coding LiveBench Agentic | 0.42 | 16 |
Mathematics LiveBench Mathematics | 0.78 | 18 |
Coding LiveBench Coding | 0.73 | 20 |
Reasoning LiveBench Reasoning | 0.62 | 21 |
Data Analysis LiveBench Data Analysis | 0.69 | 23 |
Overall Rank
#34
Coding Rank
#40