趋近智
参数
-
上下文长度
-
模态
Multimodal
架构
Dense
许可证
Proprietary
发布日期
19 May 2026
训练数据截止日期
-
注意力
注意力结构
Multi-Head Attention
注意力头
-
键值头
-
注意力头维度
-
位置嵌入
Absolute Position Embedding
RoPE Theta
-
滑动窗口注意力
-
滑动窗口大小
-
归一化
-
激活函数
-
维度
隐藏维度大小
-
层数
-
FFN 中间层大小(稠密层)
-
多 Token 预测头数
-
分词器
词汇量大小
-
The first model in Google's new Omni family, released at Google I/O on May 19, 2026. Gemini Omni Flash is a native video-generation model that accepts any combination of text, images, audio, and video as input and produces high-quality video output grounded in Gemini's real-world knowledge. It enables conversational video editing across multiple turns - maintaining character consistency, physics, and scene continuity, and supports Avatars for personalized video creation. Rolled out to Google AI Plus, Pro, and Ultra subscribers globally through the Gemini app and Google Flow.
The Gemini Omni family is Google's first generation of native video-generation models, combining Gemini's multimodal reasoning with the ability to create from any input. Announced at Google I/O 2026, Omni models accept combinations of text, images, audio, and video, allowing users to generate and conversationally edit high-quality videos grounded in Gemini's real-world knowledge.
没有可用的 Gemini Omni Flash 评估基准。
排名
-
编程排名
-
APX AI
在线