ApX 标志ApX 标志

趋近智

Gemini 2.5 Flash Lite Max Thinking (2025-06-17)

参数

-

上下文长度

1,048.576K

模态

Multimodal

架构

Dense

许可证

Proprietary

发布日期

17 Jun 2025

训练数据截止日期

Dec 2024

技术规格

注意力结构

Multi-Head Attention

隐藏维度大小

-

层数

-

注意力头

-

键值头

-

激活函数

-

归一化

-

位置嵌入

Absolute Position Embedding

Gemini 2.5 Flash Lite Max Thinking (2025-06-17)

Gemini 2.5 Flash Lite Max Thinking represents a specialized configuration of the lightweight Flash Lite variant within the Gemini 2.5 family. This model is engineered to balance extreme cost efficiency with the advanced reasoning capabilities inherent in the 2.5 architecture. By utilizing a configurable 'thinking' budget, the model can engage in multi-pass reasoning to resolve complex logical constraints before generating a final response. This architectural flexibility allows developers to adjust the computational intensity based on the specific requirements of the task, making it suitable for high-volume pipelines where transparency in logic is necessary but operational costs must remain low.

Technically, the model is built upon a dense transformer architecture optimized for low-latency inference and high throughput. It supports a massive context window of one million tokens, enabling the ingestion and processing of extensive datasets, such as entire codebases, lengthy technical manuals, or hours of audio and video content. The multimodal nature of the model allows for native processing of diverse data types including text, images, and audio, without the need for separate encoder-decoder systems. This unified approach simplifies the development of applications that require cross-modal reasoning, such as automated video summarization or document analysis across varying formats.

In production environments, Gemini 2.5 Flash Lite Max Thinking is frequently deployed for tasks that demand structured output and reliability at scale. Its integration with Google's native toolset, including Grounding with Google Search and code execution, provides a framework for building agentic workflows. These workflows benefit from the model's ability to verify its internal reasoning against external data sources. The model is particularly effective for high-throughput classification, large-scale translation, and intelligent routing where traditional lightweight models might fail to capture the required logical depth.

关于 Gemini 2.5

Google's advanced multimodal models with native understanding of text, images, audio, and video. Features massive context windows up to 2.1M tokens, max thinking modes for complex reasoning, and optimized variants for different performance/cost tradeoffs. Includes Pro, Flash, and Flash Lite variants with configurable thinking capabilities for transparent reasoning.


其他 Gemini 2.5 模型

评估基准

排名

#86

基准分数排名

0.43

29

0.67

30

0.66

35

0.61

38

Agentic Coding

LiveBench Agentic

0.05

40

排名

排名

#86

编程排名

#74

模型透明度

总分

C-

45 / 100