Parameters
-
Context Length
2,000K
Modality
Multimodal
Architecture
Dense
License
Proprietary
Release Date
19 May 2026
Knowledge Cutoff
-
Attention
Attention Structure
Multi-Head Attention
Attention Heads
-
Key-Value Heads
-
Attention Head Dimension
-
Position Embedding
Absolute Position Embedding
RoPE Theta
-
Sliding Window Attention
-
Sliding Window Size
-
Normalization
-
Activation Function
-
Dimensions
Hidden Dimension Size
-
Number of Layers
-
FFN Intermediate Size (Dense)
-
Multi-Token Prediction Heads
-
Tokenizer
Vocabulary Size
-
Google's high-throughput, ultra-low-latency model debuted at Google I/O on May 19, 2026. Built directly for general availability scaling, it anchors Google's 24/7 proactive agentic infrastructure, dealing seamlessly with multimodal inputs and sprawling background tool operations.
The Gemini 3.5 generation represents Google's transition into highly proactive, autonomous agent ecosystems. Skipping standard preview staging to target instant production scalability, it features highly optimized structural modalities tailored for multi-tool execution pipelines, persistent background agent actions, and sub-second core text response latency.
Rank
#48
| Benchmark | Score | Rank |
|---|---|---|
Agentic Coding LiveBench Agentic | 0.47 | 24 |
Overall Rank
#48
Coding Rank
-
APX AI
Online