ApX 标志

趋近智

Gemma 3 27B

参数

27B

上下文长度

128K

模态

Multimodal

架构

Dense

许可证

Gemma Terms of Use

发布日期

12 Mar 2025

知识截止

Aug 2024

技术规格

注意力结构

Grouped-Query Attention

隐藏维度大小

4096

层数

46

注意力头

64

键值头

16

激活函数

-

归一化

RMS Normalization

位置嵌入

ROPE

系统要求

不同量化方法和上下文大小的显存要求

Gemma 3 27B

Gemma 3 is a family of lightweight, state-of-the-art models developed by Google DeepMind, designed with research and technology derived from the Gemini models. The Gemma 3 27B variant is a multimodal model engineered to process both textual and image inputs, generating text-based outputs. This model variant is intended for broad application across various generation tasks, including question answering, summarization, and complex reasoning, and supports over 140 languages. Its design focuses on enabling deployment on a range of hardware, from consumer-grade devices like laptops and workstations to specialized cloud infrastructure.

关于 Gemma 3

Gemma 3 is a family of open, lightweight models from Google. It introduces multimodal image and text processing, supports over 140 languages, and features extended context windows up to 128K tokens. Models are available in multiple parameter sizes for diverse applications.


其他 Gemma 3 模型

评估基准

排名适用于本地LLM。

排名

#32

基准分数排名

0.8

4

0.91

5

0.37

7

0.91

9

Professional Knowledge

MMLU Pro

0.68

14

Agentic Coding

LiveBench Agentic

0.03

17

0.52

17

0.49

18

0.34

20

0.51

20

Graduate-Level QA

GPQA

0.42

21

General Knowledge

MMLU

0.42

28

排名

排名

#32

编程排名

#23

GPU 要求

完整计算器

选择模型权重的量化方法

上下文大小:1024 个令牌

1k
63k
125k

所需显存:

推荐 GPU