Claude Haiku 4.5

Closed Source

Closed Weights

Parameters

Context Length

200K

Modality

Text

Architecture

Dense

License

Proprietary

Release Date

1 Oct 2025

Knowledge Cutoff

Feb 2025

Evaluation Benchmarks

Rank

#124

Benchmark	Score	Rank
StackUnseen ProLLM Stack Unseen	0.476	25
Coding LiveBench Coding	0.72	32
Agentic Coding LiveBench Agentic	0.33	39
General Text Text Arena	1410	49
Data Analysis LiveBench Data Analysis	0.45	50
Mathematics LiveBench Mathematics	0.58	54
Reasoning LiveBench Reasoning	0.34	59
Web Development WebDev Arena	1324	71

Rankings

Overall Rank

#124

Coding Rank

#95

About Claude Haiku 4.5

Claude Haiku 4.5 is a high-throughput, multimodal large language model designed for low-latency applications requiring near-frontier intelligence at scale. Within the Claude 4.5 model family, Haiku serves as the optimized execution engine, balancing computational efficiency with sophisticated capabilities such as agentic reasoning and autonomous computer use. It is engineered to handle complex, multi-step instructions and high-volume data streams, making it a primary choice for developers building responsive AI agents and real-time customer-facing services.

Technically, the model utilizes a dense transformer architecture and is trained with a specialized focus on context awareness. This architectural refinement allows the model to monitor its own token consumption within its 200,000-token context window, effectively mitigating agentic laziness and ensuring persistent reasoning during long-running tasks. Unlike many contemporary models that employ rotary embeddings, Claude 4.5 Haiku continues to utilize absolute position embeddings combined with multi-head attention (MHA) to maintain structural consistency and precision across its expanded context. The model supports multimodal inputs, enabling it to process and analyze visual data alongside text with significant speed.

Performance characteristics are centered on rapid inference and cost-effectiveness for production-grade workloads. A standout feature of this variant is the inclusion of extended thinking, which allows the model to allocate additional internal compute for deliberate reasoning before generating an output. This makes Haiku 4.5 particularly effective for sub-agent orchestration, where it acts as a fast executor for plans developed by larger models like Sonnet 4.5. Common use cases include automated financial monitoring, real-time code refactoring, and large-scale document processing where maintaining high quality at a reduced cost is a technical requirement.

Technical Specifications

Attention

Attention Structure

Multi-Head Attention

Attention Heads

Key-Value Heads

Attention Head Dimension

Position Embedding

Absolute Position Embedding

RoPE Theta

Sliding Window Attention

Sliding Window Size

Sliding Window Ratio

Linear Attention

Linear Attention Ratio

Normalization

Activation Function

Dimensions

Hidden Dimension Size

Number of Layers

FFN Intermediate Size (Dense)

Multi-Token Prediction Heads

Tokenizer

Vocabulary Size

Model Integrity

Total Score

34 / 100

Upstream

10.0 / 30

Model

14.0 / 40

Downstream

10.0 / 30

Claude Haiku 4.5 Model Integrity Report

Total Score

/ 100

Audit Note

Claude 4.5 Haiku exhibits a high degree of operational transparency regarding its identity and API functionality, but remains almost entirely opaque concerning its upstream development. Critical gaps exist in the disclosure of training data, architectural specifications, and compute resources, which are treated as proprietary secrets. While the model provides clear versioning and performance benchmarks, the inability to verify these claims through independent reproduction or technical documentation limits its overall transparency profile.

Upstream

10.0 / 30

Architectural Provenance

3.0 / 10

Anthropic identifies Claude 4.5 Haiku as a 'hybrid reasoning model' and a 'dense transformer,' but provides no specific architectural details such as layer counts, attention mechanisms (beyond general multi-head attention mentions), or specific modifications from previous generations. While the system card mentions 'context-aware' training to mitigate agentic laziness, the underlying pre-training methodology and specific architectural provenance remain proprietary and largely undisclosed.

Dataset Composition

2.0 / 10

Information regarding the training data is extremely limited. The system card vaguely refers to 'internet data' and 'AI feedback' (RLHF/RLAIF) without providing any breakdown of data sources, proportions (e.g., code vs. text), or specific filtering and cleaning methodologies. There is no public disclosure of the specific datasets used or sample data for verification, falling into the 'proprietary dataset' red flag category.

Tokenizer Integrity

5.0 / 10

While a public tokenizer tool exists for the Claude 4.5 family (claudetokenizer.com) and the API provides token counting functionality, the technical documentation for the tokenizer itself—such as vocabulary size, training data alignment, and specific normalization techniques—is not comprehensively documented in a public technical paper. It is functional for users but opaque in its construction.

Model

14.0 / 40

Parameter Density

1.0 / 10

Anthropic does not disclose the parameter count for Claude 4.5 Haiku. While it is marketed as a 'small' or 'lightweight' model within the 4.5 family, no specific numbers (total or active) are provided. This lack of transparency makes it impossible to verify efficiency claims or compare density against competitors.

Training Compute

0.0 / 10

There is zero public information regarding the compute resources used to train Claude 4.5 Haiku. No GPU/TPU hours, hardware specifications, training duration, or carbon footprint data have been disclosed. Anthropic cites competitive reasons for withholding these metrics, which is a direct red flag under the scoring guidelines.

Benchmark Reproducibility

4.0 / 10

Anthropic provides evaluation results for standard benchmarks like SWE-bench Verified (73.3%) and internal 'computer use' tests. However, while they mention some methodology (e.g., averaging over 50 trials), the exact evaluation code, full prompt sets, and detailed reproduction instructions are not publicly available. Third-party verification is limited to high-level leaderboard positions rather than granular reproduction.

Identity Consistency

9.0 / 10

The model consistently identifies itself as Claude 4.5 Haiku and maintains a clear versioning identity (e.g., claude-haiku-4-5-20251001). It is transparent about its role as a 'sub-agent' or 'executor' within the broader Claude 4.5 ecosystem and accurately reflects its 200k context window and reasoning capabilities in its system prompts.

Downstream

10.0 / 30

License Clarity

3.0 / 10

The model is governed by a strictly proprietary license. While the terms for API usage and commercial use are clearly stated in Anthropic's Commercial Terms of Service, it is not open source or open weights. The score is low because the 'license' is a service agreement rather than a model license that allows for inspection or derivative works.

Hardware Footprint

2.0 / 10

As a closed-source API-only model, there is no documentation regarding the actual hardware requirements (VRAM, compute) to run the model. While Anthropic provides pricing and latency expectations, the underlying hardware footprint and quantization tradeoffs are entirely hidden from the public.

Versioning Drift

5.0 / 10

Anthropic uses dated versioning (e.g., 20251001) and maintains a basic changelog for its API and tools (like Claude Code). However, there is no public documentation of model drift over time or specific 'alignment tax' impacts. While version pinning is possible, the transparency regarding what changes between silent 'point' updates is minimal.

Resources

Official Documentation Release Notes

About Claude 4.5

Enhanced Claude models with further improvements in reasoning, coding, and agentic capabilities. Features advanced thinking modes with adjustable effort levels (high, medium, standard) for optimal performance-latency tradeoffs. Excels at complex analysis, software development, web development, and long-context understanding. Includes thinking variants that expose reasoning process for improved transparency.

Claude Haiku 4.5

Evaluation Benchmarks

Rankings

About Claude Haiku 4.5

Technical Specifications

Model Integrity

Claude Haiku 4.5 Model Integrity Report

Audit Note

Upstream

Model

Downstream

Resources

About Claude 4.5

Other Claude 4.5 Models