ApX 标志ApX 标志

趋近智

Gemini Omni Flash

参数

-

上下文长度

-

模态

Multimodal

架构

Dense

许可证

Proprietary

发布日期

19 May 2026

训练数据截止日期

-

技术规格

注意力

注意力结构

Multi-Head Attention

注意力头

-

键值头

-

注意力头维度

-

位置嵌入

Absolute Position Embedding

RoPE Theta

-

滑动窗口注意力

-

滑动窗口大小

-

归一化

-

激活函数

-

维度

隐藏维度大小

-

层数

-

FFN 中间层大小(稠密层)

-

多 Token 预测头数

-

分词器

词汇量大小

-

Gemini Omni Flash

The first model in Google's new Omni family, released at Google I/O on May 19, 2026. Gemini Omni Flash is a native video-generation model that accepts any combination of text, images, audio, and video as input and produces high-quality video output grounded in Gemini's real-world knowledge. It enables conversational video editing across multiple turns - maintaining character consistency, physics, and scene continuity, and supports Avatars for personalized video creation. Rolled out to Google AI Plus, Pro, and Ultra subscribers globally through the Gemini app and Google Flow.

关于 Gemini Omni

The Gemini Omni family is Google's first generation of native video-generation models, combining Gemini's multimodal reasoning with the ability to create from any input. Announced at Google I/O 2026, Omni models accept combinations of text, images, audio, and video, allowing users to generate and conversationally edit high-quality videos grounded in Gemini's real-world knowledge.


其他 Gemini Omni 模型
  • 没有相关模型

评估基准

没有可用的 Gemini Omni Flash 评估基准。

排名

排名

-

编程排名

-

Gemini Omni Flash:模型规格和详细信息