ApX 标志ApX 标志

趋近智

Gemini 2.5 Pro Max Thinking

参数

-

上下文长度

2,097.152K

模态

Multimodal

架构

Dense

许可证

Proprietary

发布日期

25 Sept 2025

训练数据截止日期

Jan 2025

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

-

层数

-

注意力头

-

键值头

-

激活函数

SwigLU

归一化

RMS Normalization

位置嵌入

Absolute Position Embedding

Gemini 2.5 Pro Max Thinking

Gemini 2.5 Pro Max Thinking is a sophisticated multimodal model engineered for deep analytical reasoning and complex problem-solving. It represents an evolution in Google's model lineup by integrating a transparent thinking process that generates extended internal chains of thought before delivering a final response. This architectural design is specifically optimized for high-stakes tasks in software engineering, advanced mathematics, and scientific research where multi-step logical consistency is required. By exposing its reasoning path, the model provides developers with a mechanism for more effective debugging and steering of autonomous agents and automated workflows.

The model utilizes a Mixture-of-Experts (MoE) architecture, which selectively activates specialized sub-networks during inference to maintain computational efficiency while scaling intelligence. It supports a natively multimodal input space, allowing it to ingest and reason over diverse data types including text, high-resolution imagery, audio streams, and video files within a single unified context. This native multimodality ensures that the model can maintain semantic coherence across different information formats, making it highly effective for comprehensive dataset analysis and cross-modal reasoning.

A defining feature of the model is its massive context window, which supports up to 2,097,152 tokens, enabling the processing of entire codebases, lengthy technical manuals, or hours of video content. To manage the trade-off between reasoning depth and execution speed, the model supports a configurable thinking budget, allowing developers to allocate specific token limits to the reasoning phase. This control mechanism is exposed through the Gemini API and Vertex AI, providing a flexible framework for tailoring model behavior to specific operational requirements and latency constraints.

关于 Gemini 2.5

Google's advanced multimodal models with native understanding of text, images, audio, and video. Features massive context windows up to 2.1M tokens, max thinking modes for complex reasoning, and optimized variants for different performance/cost tradeoffs. Includes Pro, Flash, and Flash Lite variants with configurable thinking capabilities for transparent reasoning.


其他 Gemini 2.5 模型

评估基准

排名

#37

基准分数排名

0.83

6

0.76

12

0.71

14

0.71

17

Agentic Coding

LiveBench Agentic

0.33

25

0.68

29

排名

排名

#37

编程排名

#37

模型透明度

总分

D+

41 / 100