趋近智
参数
-
上下文长度
200K
模态
Text
架构
Dense
许可证
Proprietary
发布日期
1 Oct 2025
训练数据截止日期
Feb 2025
注意力结构
Multi-Head Attention
隐藏维度大小
-
层数
-
注意力头
-
键值头
-
激活函数
-
归一化
RMS Normalization
位置嵌入
Absolute Position Embedding
Claude Haiku 4.5 Thinking is a high-efficiency large language model developed by Anthropic, engineered to provide near-frontier intelligence with the low-latency profile characteristic of the Haiku model family. As a hybrid reasoning model, it incorporates an optional extended thinking mode that allows the system to engage in multi-step internal reasoning before emitting a final response. This architectural design balances the computational demands of complex problem-solving with the speed required for real-time production environments.
Technically, the model features significant advancements in context management and output capacity, supporting a 200,000-token context window and generating up to 64,000 tokens in a single response. It is designed with explicit context awareness, enabling the system to track its own token usage and adjust its reasoning persistence based on the remaining context budget. This capability is specifically tuned to mitigate agentic laziness and improve performance in long-horizon tasks such as codebase refactoring and autonomous system orchestration.
The model's utility is centered on high-throughput, cost-sensitive applications including customer service automation, real-time pair programming, and multi-agent systems where parallel execution is required. By integrating native vision capabilities alongside text processing, it supports multimodal workflows like document analysis and UI testing. Its training emphasis on coding and tool use allows it to act as a responsive engine for developer tools, offering a refined balance of speed and analytical depth at a significantly lower operational cost than flagship variants.
Enhanced Claude models with further improvements in reasoning, coding, and agentic capabilities. Features advanced thinking modes with adjustable effort levels (high, medium, standard) for optimal performance-latency tradeoffs. Excels at complex analysis, software development, web development, and long-context understanding. Includes thinking variants that expose reasoning process for improved transparency.
排名
#34
| 基准 | 分数 | 排名 |
|---|---|---|
Agentic Coding LiveBench Agentic | 0.42 | 16 |
Mathematics LiveBench Mathematics | 0.78 | 18 |
Coding LiveBench Coding | 0.73 | 20 |
Reasoning LiveBench Reasoning | 0.62 | 21 |
Data Analysis LiveBench Data Analysis | 0.69 | 23 |