ApX 标志ApX 标志

趋近智

GPT-5.1 High

参数

-

上下文长度

400K

模态

Text

架构

Dense

许可证

Proprietary

发布日期

13 Nov 2025

训练数据截止日期

Sep 2024

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

-

层数

-

注意力头

-

键值头

-

激活函数

-

归一化

-

位置嵌入

Absolute Position Embedding

GPT-5.1 High

GPT-5.1 High is a specialized reasoning variant within OpenAI's GPT-5 model family, engineered to provide high-effort cognitive processing for complex analytical tasks. The model is built upon a modular architecture that integrates a dense language backbone with sparse Mixture-of-Experts (MoE) layers and a dedicated reasoning core. This design enables the system to implement adaptive reasoning, where it dynamically allocates computational budget by extending its internal thinking time for multi-step problems such as advanced mathematical proofs and architectural code refactors. Unlike standard models that produce immediate output, GPT-5.1 High generates hidden reasoning tokens to evaluate multiple solution paths before committing to a final response.

Technically, the model employs a modified transformer architecture with Multi-Head Attention (MHA) and utilizes absolute position embeddings to maintain structural coherence across its expanded context. A significant innovation in the GPT-5.1 series is the integration of a 'compaction' mechanism for context management, which prunes and summarizes historical tokens when nearing limits to maintain long-term session coherence without full context reset. The architecture also incorporates explicit planning hooks and safety guardrails that operate both pre- and post-generation, ensuring that complex reasoning chains remain aligned with intended constraints while minimizing latency for the user.

The model is primarily intended for technical and agentic workflows where deep analysis is prioritized over raw speed. Its use cases include autonomous debugging, long-running coding projects involving multiple files, and sophisticated data synthesis. By exposing 'reasoning effort' controls to developers, GPT-5.1 High allows for granular tuning of the model's persistence on difficult queries. This makes it particularly effective for professionals building reliable agentic systems that require consistent, high-fidelity outputs across varied domains including engineering, legal analysis, and scientific research.

关于 GPT-5

OpenAI's latest generation of language models featuring advanced reasoning capabilities, extended context windows up to 400K tokens, and specialized variants for coding, general intelligence, and efficiency. GPT-5 series introduces improved thinking modes, superior performance across benchmarks, and variants optimized for different use cases from high-capacity Pro models to efficient Nano models. Features native multimodal understanding, enhanced mathematical reasoning, and state-of-the-art coding abilities through Codex variants.


其他 GPT-5 模型

评估基准

排名

#10

基准分数排名

0.99

🥇

1

0.87

4

Agentic Coding

LiveBench Agentic

0.53

5

0.84

5

Web Development

WebDev Arena

1457

5

Graduate-Level QA

GPQA

0.88

5

0.79

6

0.72

12

0.72

21

排名

排名

#10

编程排名

#11