趋近智
参数
27B
上下文长度
128K
模态
Multimodal
架构
Dense
许可证
Gemma Terms of Use
发布日期
12 Mar 2025
训练数据截止日期
Aug 2024
注意力结构
Grouped-Query Attention
隐藏维度大小
4096
层数
46
注意力头
64
键值头
16
激活函数
-
归一化
RMS Normalization
位置嵌入
ROPE
不同量化方法和上下文大小的显存要求
Gemma 3 is a family of lightweight, state-of-the-art models developed by Google DeepMind, designed with research and technology derived from the Gemini models. The Gemma 3 27B variant is a multimodal model engineered to process both textual and image inputs, generating text-based outputs. This model variant is intended for broad application across various generation tasks, including question answering, summarization, and complex reasoning, and supports over 140 languages. Its design focuses on enabling deployment on a range of hardware, from consumer-grade devices like laptops and workstations to specialized cloud infrastructure.
Gemma 3 is a family of open, lightweight models from Google. It introduces multimodal image and text processing, supports over 140 languages, and features extended context windows up to 128K tokens. Models are available in multiple parameter sizes for diverse applications.
排名适用于本地LLM。
排名
#39
| 基准 | 分数 | 排名 |
|---|---|---|
StackEval ProLLM Stack Eval | 0.91 | 5 |
Summarization ProLLM Summarization | 0.8 | 6 |
StackUnseen ProLLM Stack Unseen | 0.37 | 9 |
QA Assistant ProLLM QA Assistant | 0.91 | 10 |
Agentic Coding LiveBench Agentic | 0.03 | 16 |
Coding LiveBench Coding | 0.49 | 19 |
Mathematics LiveBench Mathematics | 0.52 | 19 |
Graduate-Level QA GPQA | 0.42 | 20 |
Reasoning LiveBench Reasoning | 0.34 | 22 |
Data Analysis LiveBench Data Analysis | 0.51 | 22 |
Professional Knowledge MMLU Pro | 0.68 | 26 |
General Knowledge MMLU | 0.42 | 31 |