ApX logoApX logo

Gemini 3.5 Flash

Parameters

-

Context Length

2,000K

Modality

Multimodal

Architecture

Dense

License

Proprietary

Release Date

19 May 2026

Knowledge Cutoff

-

Technical Specifications

Attention

Attention Structure

Multi-Head Attention

Attention Heads

-

Key-Value Heads

-

Attention Head Dimension

-

Position Embedding

Absolute Position Embedding

RoPE Theta

-

Sliding Window Attention

-

Sliding Window Size

-

Normalization

-

Activation Function

-

Dimensions

Hidden Dimension Size

-

Number of Layers

-

FFN Intermediate Size (Dense)

-

Multi-Token Prediction Heads

-

Tokenizer

Vocabulary Size

-

Gemini 3.5 Flash

Google's high-throughput, ultra-low-latency model debuted at Google I/O on May 19, 2026. Built directly for general availability scaling, it anchors Google's 24/7 proactive agentic infrastructure, dealing seamlessly with multimodal inputs and sprawling background tool operations.

About Gemini 3.5

The Gemini 3.5 generation represents Google's transition into highly proactive, autonomous agent ecosystems. Skipping standard preview staging to target instant production scalability, it features highly optimized structural modalities tailored for multi-tool execution pipelines, persistent background agent actions, and sub-second core text response latency.


Other Gemini 3.5 Models
  • No related models available

Evaluation Benchmarks

Rank

#48

BenchmarkScoreRank

Agentic Coding

LiveBench Agentic

0.47

24

Rankings

Overall Rank

#48

Coding Rank

-