ApX logo

Gemma 3 27B

Parameters

27B

Context Length

128K

Modality

Multimodal

Architecture

Dense

License

Gemma Terms of Use

Release Date

12 Mar 2025

Knowledge Cutoff

Aug 2024

Technical Specifications

Attention Structure

Grouped-Query Attention

Hidden Dimension Size

4096

Number of Layers

46

Attention Heads

64

Key-Value Heads

16

Activation Function

-

Normalization

RMS Normalization

Position Embedding

ROPE

System Requirements

VRAM requirements for different quantization methods and context sizes

Gemma 3 27B

Gemma 3 is a family of lightweight, state-of-the-art models developed by Google DeepMind, designed with research and technology derived from the Gemini models. The Gemma 3 27B variant is a multimodal model engineered to process both textual and image inputs, generating text-based outputs. This model variant is intended for broad application across various generation tasks, including question answering, summarization, and complex reasoning, and supports over 140 languages. Its design focuses on enabling deployment on a range of hardware, from consumer-grade devices like laptops and workstations to specialized cloud infrastructure.

About Gemma 3

Gemma 3 is a family of open, lightweight models from Google. It introduces multimodal image and text processing, supports over 140 languages, and features extended context windows up to 128K tokens. Models are available in multiple parameter sizes for diverse applications.


Other Gemma 3 Models

Evaluation Benchmarks

Ranking is for Local LLMs.

Rank

#32

BenchmarkScoreRank

0.8

4

0.91

5

0.37

7

0.91

9

Professional Knowledge

MMLU Pro

0.68

14

Agentic Coding

LiveBench Agentic

0.03

17

0.52

17

0.49

18

0.34

20

0.51

20

Graduate-Level QA

GPQA

0.42

21

General Knowledge

MMLU

0.42

28

Rankings

Overall Rank

#32

Coding Rank

#23

GPU Requirements

Full Calculator

Choose the quantization method for model weights

Context Size: 1,024 tokens

1k
63k
125k

VRAM Required:

Recommended GPUs

Gemma 3 27B: Specifications and GPU VRAM Requirements