ApX logoApX logo

GPT-5.4 Pro

Parameters

-

Context Length

272K

Modality

Multimodal

Architecture

Dense

License

Proprietary

Release Date

5 Mar 2026

Knowledge Cutoff

-

Technical Specifications

Attention

Attention Structure

Multi-Head Attention

Attention Heads

-

Key-Value Heads

-

Attention Head Dimension

-

Position Embedding

Absolute Position Embedding

RoPE Theta

-

Sliding Window Attention

-

Sliding Window Size

-

Normalization

-

Activation Function

-

Dimensions

Hidden Dimension Size

-

Number of Layers

-

FFN Intermediate Size (Dense)

-

Multi-Token Prediction Heads

-

Tokenizer

Vocabulary Size

-

GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's maximum-performance variant of GPT-5.4, designed for the most complex tasks requiring the highest reasoning capacity. Achieves state-of-the-art results on ARC-AGI-2 (83.3%), BrowseComp (89.3%), GPQA Diamond (94.4%), and FrontierMath Tier 4 (38.0%). Pricing: $30/M input, $180/M output. Available via API as gpt-5.4-pro. Released March 5, 2026.

About GPT-5.4

GPT-5.4 is OpenAI's most capable and efficient frontier model for professional work, combining the industry-leading coding capabilities of GPT-5.3-Codex with major advances in reasoning, computer use, and agentic workflows. It introduces native computer-use capabilities, tool search for large tool ecosystems, substantially improved knowledge work (spreadsheets, presentations, documents), and is OpenAI's most factual and token-efficient reasoning model. Supports up to 1M context tokens in Codex. Released March 5, 2026.


Other GPT-5.4 Models

Evaluation Benchmarks

No evaluation benchmarks for GPT-5.4 Pro available.

Rankings

Overall Rank

-

Coding Rank

-

Model Integrity

Total Score

F

31 / 100

GPT-5.4 Pro Model Integrity Report

Total Score

31

/ 100

F

Audit Note

GPT-5.4 Pro exhibits a high degree of opacity regarding its internal architecture, training data, and compute resources. While it provides clear versioning and consistent self-identification for API users, the lack of reproducible evaluation methodologies and technical documentation significantly hinders independent verification. The model's transparency profile is characterized by strong performance claims backed by limited verifiable evidence.

Upstream

9.0 / 30

Architectural Provenance

3.0 / 10

OpenAI identifies GPT-5.4 Pro as a 'unified' model that consolidates the reasoning of the GPT-5 series with the coding capabilities of GPT-5.3-Codex. However, the underlying architecture is described only as 'dense' in marketing materials, with no public technical report or documentation detailing the specific training methodology, architectural modifications, or pretraining procedures. The lack of a peer-reviewed paper or detailed technical specification makes the architectural provenance largely unverifiable.

Dataset Composition

1.0 / 10

There is no public disclosure of the training data sources, composition breakdown, or data collection methodology. Documentation only mentions a knowledge cutoff of August 31, 2025, and vague claims of being 'factually grounded' with a 33% reduction in errors. No information regarding filtering, cleaning, or the use of synthetic data is available, which is a significant transparency gap for a frontier model.

Tokenizer Integrity

5.0 / 10

The tokenizer is accessible via the OpenAI API, and the model supports a standard 272K context window (expandable to 1.05M tokens). However, the specific vocabulary size, BPE configuration, and training alignment for the 5.4 version are not explicitly documented in any public technical report. While behavior can be observed through API usage, the underlying technical specifications remain opaque.

Model

14.0 / 40

Parameter Density

2.0 / 10

The parameter count for GPT-5.4 Pro is officially 'Unknown.' While it is marketed as a 'dense' architecture, there is no verifiable information regarding the total number of parameters or the architectural breakdown (e.g., attention vs. FFN). This lack of disclosure is a major transparency failure for a model positioned as a high-performance enterprise variant.

Training Compute

1.0 / 10

OpenAI has not disclosed any specific details regarding the GPU/TPU hours, hardware specifications, or energy consumption used to train GPT-5.4 Pro. There are no carbon footprint calculations or cost estimates provided in official documentation, relying instead on vague marketing claims of being the 'most efficient' frontier model.

Benchmark Reproducibility

3.0 / 10

While OpenAI provides specific scores for several benchmarks (e.g., 83.3% ARC-AGI-2, 89.3% BrowseComp), the evaluation code, exact prompts, and few-shot examples are not fully public. Some benchmarks, such as GDPval, are described as 'in-house,' making independent third-party verification and reproduction impossible. A -2 penalty was applied due to partial disclosure of contamination issues in predecessor models without clear mitigation details for this version.

Identity Consistency

8.0 / 10

The model consistently identifies itself as GPT-5.4 Pro and maintains clear versioning (5.2, 5.3, 5.4) within the API and ChatGPT interface. It is transparent about its capabilities regarding native computer use and extended context. However, it loses points for occasional confusion in complex agentic workflows where it may second-guess its own identity or output.

Downstream

8.0 / 30

License Clarity

2.0 / 10

The model is released under a proprietary license with no open-source access to weights or source code. While commercial use is permitted via the API, the terms are governed by OpenAI's Terms of Service, which can change without notice. The lack of a clear, stable license for derivative works or weight inspection results in a low score.

Hardware Footprint

2.0 / 10

As a closed-source API-based model, there is no public documentation regarding VRAM requirements or hardware footprints for local deployment. OpenAI provides pricing based on token usage but offers no guidance on quantization tradeoffs or memory scaling for the experimental 1M token window, leaving developers with no information on the actual compute resources required to run the model.

Versioning Drift

4.0 / 10

OpenAI uses semantic-like versioning and provides 'snapshot' features in the API to mitigate drift. However, there have been documented reports of 'stealth nerfs' and performance degradation (e.g., reduced thinking time) following plan changes, which are not reflected in official changelogs. A -3 penalty was applied for behavior changes occurring without corresponding version updates.

GPT-5.4 Pro: Model Specifications and Details