趋近智
参数
-
上下文长度
200K
模态
Text
架构
Dense
许可证
Proprietary
发布日期
1 Oct 2025
训练数据截止日期
Feb 2025
注意力结构
Multi-Head Attention
隐藏维度大小
-
层数
-
注意力头
-
键值头
-
激活函数
-
归一化
-
位置嵌入
Absolute Position Embedding
Claude Haiku 4.5 is a high-throughput, multimodal large language model designed for low-latency applications requiring near-frontier intelligence at scale. Within the Claude 4.5 model family, Haiku serves as the optimized execution engine, balancing computational efficiency with sophisticated capabilities such as agentic reasoning and autonomous computer use. It is engineered to handle complex, multi-step instructions and high-volume data streams, making it a primary choice for developers building responsive AI agents and real-time customer-facing services.
Technically, the model utilizes a dense transformer architecture and is trained with a specialized focus on context awareness. This architectural refinement allows the model to monitor its own token consumption within its 200,000-token context window, effectively mitigating agentic laziness and ensuring persistent reasoning during long-running tasks. Unlike many contemporary models that employ rotary embeddings, Claude 4.5 Haiku continues to utilize absolute position embeddings combined with multi-head attention (MHA) to maintain structural consistency and precision across its expanded context. The model supports multimodal inputs, enabling it to process and analyze visual data alongside text with significant speed.
Performance characteristics are centered on rapid inference and cost-effectiveness for production-grade workloads. A standout feature of this variant is the inclusion of extended thinking, which allows the model to allocate additional internal compute for deliberate reasoning before generating an output. This makes Haiku 4.5 particularly effective for sub-agent orchestration, where it acts as a fast executor for plans developed by larger models like Sonnet 4.5. Common use cases include automated financial monitoring, real-time code refactoring, and large-scale document processing where maintaining high quality at a reduced cost is a technical requirement.
Enhanced Claude models with further improvements in reasoning, coding, and agentic capabilities. Features advanced thinking modes with adjustable effort levels (high, medium, standard) for optimal performance-latency tradeoffs. Excels at complex analysis, software development, web development, and long-context understanding. Includes thinking variants that expose reasoning process for improved transparency.
排名
#69
| 基准 | 分数 | 排名 |
|---|---|---|
Web Development WebDev Arena | 1405 | 17 |
Coding LiveBench Coding | 0.72 | 22 |
Agentic Coding LiveBench Agentic | 0.33 | 25 |
Data Analysis LiveBench Data Analysis | 0.66 | 35 |
Reasoning LiveBench Reasoning | 0.34 | 40 |
Mathematics LiveBench Mathematics | 0.58 | 42 |