ApX logo

GPT-5.1 No Thinking

Parameters

-

Context Length

400K

Modality

Text

Architecture

Dense

License

Proprietary

Release Date

13 Nov 2025

Knowledge Cutoff

-

Technical Specifications

Attention Structure

Multi-Head Attention

Hidden Dimension Size

-

Number of Layers

-

Attention Heads

-

Key-Value Heads

-

Activation Function

-

Normalization

-

Position Embedding

Absolute Position Embedding

GPT-5.1 No Thinking

GPT-5.1 with thinking capabilities turned off for optimal response speed. Maintains good coding performance (77.48 LiveBench Coding) while minimizing latency. Ideal for interactive applications, real-time assistance, and scenarios where thinking overhead is not required. Provides quick, direct responses for straightforward tasks.

About GPT-5

OpenAI's latest generation of language models featuring advanced reasoning capabilities, extended context windows up to 400K tokens, and specialized variants for coding, general intelligence, and efficiency. GPT-5 series introduces improved thinking modes, superior performance across benchmarks, and variants optimized for different use cases from high-capacity Pro models to efficient Nano models. Features native multimodal understanding, enhanced mathematical reasoning, and state-of-the-art coding abilities through Codex variants.


Other GPT-5 Models

Evaluation Benchmarks

Rank

#67

BenchmarkScoreRank

0.77

7

Agentic Coding

LiveBench Agentic

0.28

30

Graduate-Level QA

GPQA

0.88

30

0.64

42

0.45

56

0.27

62

Rankings

Overall Rank

#67

Coding Rank

#22

GPT-5.1 No Thinking: Specifications and GPU VRAM Requirements