ApX logoApX logo

GPT-5.4 nano

Parameters

-

Context Length

-

Modality

Text

Architecture

Dense

License

Proprietary

Release Date

17 Mar 2026

Knowledge Cutoff

-

Technical Specifications

Attention

Attention Structure

Multi-Head Attention

Attention Heads

-

Key-Value Heads

-

Attention Head Dimension

-

Position Embedding

Absolute Position Embedding

RoPE Theta

-

Sliding Window Attention

-

Sliding Window Size

-

Normalization

-

Activation Function

-

Dimensions

Hidden Dimension Size

-

Number of Layers

-

FFN Intermediate Size (Dense)

-

Multi-Token Prediction Heads

-

Tokenizer

Vocabulary Size

-

GPT-5.4 nano

GPT-5.4 nano is OpenAI's smallest and cheapest GPT-5.4 variant, optimized for tasks where speed and cost matter most. A significant upgrade over GPT-5 nano, recommended for classification, data extraction, ranking, and coding subagents handling simpler supporting tasks. Pricing: $0.20/M input, $1.25/M output. Available via API as gpt-5.4-nano. Released March 17, 2026.

About GPT-5.4

GPT-5.4 is OpenAI's most capable and efficient frontier model for professional work, combining the industry-leading coding capabilities of GPT-5.3-Codex with major advances in reasoning, computer use, and agentic workflows. It introduces native computer-use capabilities, tool search for large tool ecosystems, substantially improved knowledge work (spreadsheets, presentations, documents), and is OpenAI's most factual and token-efficient reasoning model. Supports up to 1M context tokens in Codex. Released March 5, 2026.


Other GPT-5.4 Models

Evaluation Benchmarks

Rank

#8

BenchmarkScoreRank

0.91

4

0.81

13

Rankings

Overall Rank

#8

Coding Rank

-

Model Integrity

Total Score

F

31 / 100

GPT-5.4 nano Model Integrity Report

Total Score

31

/ 100

F

Audit Note

GPT-5.4 nano is a highly opaque model that prioritizes commercial utility over technical transparency. While it provides consistent self-identification and clear API versioning, it fails to disclose critical information regarding its architecture, training data, and compute resources. The model's performance claims are currently unverifiable due to the lack of public evaluation code and data provenance.

Upstream

9.0 / 30

Architectural Provenance

3.0 / 10

OpenAI identifies GPT-5.4 nano as a 'unified frontier model' variant that integrates capabilities from previous iterations like GPT-5.3-Codex. However, it is described only as a 'dense' architecture with no public documentation regarding its specific training methodology, architectural modifications, or pretraining procedures. The relationship between the 'nano' variant and the larger GPT-5.4 models is not technically detailed beyond marketing claims of efficiency.

Dataset Composition

2.0 / 10

Data provenance is highly opaque. While a knowledge cutoff of August 31, 2025, is stated, there is no public disclosure of dataset sources, composition percentages (e.g., web vs. code), or filtering methodologies. Claims of being 'factually grounded' and '33% less likely to contain false claims' are presented without verifiable data provenance or access to training samples.

Tokenizer Integrity

4.0 / 10

The model supports a 400k context window and users can observe tokenization behavior via the API, but the specific tokenizer vocabulary size and underlying BPE configuration for the 5.4 series are not explicitly documented in a public technical report. There is no public repository for the tokenizer weights or training alignment data.

Model

12.0 / 40

Parameter Density

1.0 / 10

The parameter count for GPT-5.4 nano is officially 'Unknown'. While it is described as a 'dense' model, OpenAI has not disclosed the total or active parameter counts, nor provided any architectural breakdown of attention vs. feed-forward networks.

Training Compute

0.0 / 10

OpenAI has not disclosed any information regarding GPU/TPU hours, hardware specifications, training duration, or carbon footprint for the GPT-5.4 nano variant. The company explicitly cites competitive reasons for withholding compute metrics.

Benchmark Reproducibility

2.0 / 10

While OpenAI provides specific scores for benchmarks like SWE-Bench Pro (52.4%) and OSWorld-Verified (39.0%), the evaluation code, exact prompts, and few-shot examples used to achieve these results are not public. Third-party analysis indicates that these scores are difficult to replicate without access to the internal evaluation harness.

Identity Consistency

9.0 / 10

The model consistently identifies itself as GPT-5.4 nano via API responses and system prompts. It maintains a clear versioning identity within the OpenAI ecosystem and accurately reflects its role as a low-latency, cost-optimized variant of the GPT-5.4 family.

Downstream

10.0 / 30

License Clarity

2.0 / 10

The model is released under a restrictive 'Proprietary' license. While the Terms of Service for the API are public, they include vaguely defined clauses regarding 'abusive usage' and 'programmatic extraction.' There is no open-source license for weights or code, and derivative works are strictly prohibited.

Hardware Footprint

3.0 / 10

As an API-only model, there is no documentation for local VRAM requirements or quantization tradeoffs. While OpenAI provides pricing and context window limits (400k), it offers no guidance for developers on the hardware resources required to run equivalent-scale models or the impact of quantization on this specific architecture.

Versioning Drift

5.0 / 10

OpenAI uses date-based snapshots for its models, allowing developers to lock in specific versions. However, the changelog lacks technical depth regarding what specifically changes between snapshots, and there is a history of 'silent' updates to safety filters and alignment layers that affect model behavior without a version increment.

GPT-5.4 nano: Model Specifications and Details