
AIL Player Card #012 — Gemini Omni Flash: The World Model
92 OVR. WM. Conversational video editing. Any input → any output. Physics-grounded world knowledge baked in. Google National just fielded a position nobody in this league has played before. #AILeague
92 OVR · WM · Google National Conversational video editing. Any input → any output. Physics-grounded world knowledge baked in. Google National just fielded a player with a position nobody in this league has played before. #AILeague

The scouting report
The 92 OVR: how the card scores
| Dimension | Score | Basis |
|---|---|---|
| RZN — Reasoning | 86 | World-knowledge physics simulation; biology + history + science grounding in outputs |
| CRE — Creativity | 97 | Conversational multi-turn video editing; any-to-any multimodal generation; highest creative ceiling in the current league roster |
| SPD — Speed | 88 | Flash-tier architecture; faster than Omni Pro variant |
| MLT — Multimodal | 99 | Text + image + video + audio as simultaneous inputs → video output; no other league player has this input breadth |
| SAF — Safety | 82 | SynthID watermarking on all outputs4; red-team evaluations completed; avatar feature restricted to users' own likeness; deepfake policy enforcement ongoing |
| VAL — Value | 74 | Subscription-gated (Google AI Plus/Pro/Ultra); API pricing TBD; high quota consumption in early testing; no open pricing for developers yet |
What the WM position actually means
| Capability | Gemini Omni Flash | Seedance 2.0 |
|---|---|---|
| Motion realism | ★★★★☆ | ★★★★★ |
| Prompt adherence | ★★★★★ | ★★★★☆ |
| Cross-shot character consistency | ★★★☆☆ | ★★★★☆ |
| Cinematic quality | ★★★☆☆ | ★★★★★ |
| Conversational video editing | ★★★★★ | ★☆☆☆☆ |
| World-knowledge grounding | ★★★★★ | ★★★☆☆ |
Season highlights
Head-to-head: WM class comparison
| Model | OVR | RZN | CRE | SPD | MLT | SAF | VAL | Position |
|---|---|---|---|---|---|---|---|---|
| Gemini Omni Flash | 92 | 86 | 97 | 88 | 99 | 82 | 74 | WM |
| Gemini 2.5 Pro (#003) | 93 | 92 | 88 | 76 | 96 | 85 | 81 | MW |
| Gemini 3.5 Flash (#009) | 91 | 88 | 82 | 97 | 90 | 88 | 90 | AS |
The broadcast take
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文
Article·Gemini 3.5 Flash:Flash 系列首次在编码和智能体任务上越过旗舰 Pro
Google 在 I/O 2026 上发布 Gemini 3.5 Flash 并同步公开 Model Card。这是 Flash 系列第一次在编码(Terminal-Bench 2.1: 76.2%)和智能体任务(Finance Agent v2: +14.9pp)上越过前代旗舰 Gemini 3.1 Pro,同时推理速度比同级前沿模型快 4 倍,定价比 Pro 便宜 40%。
三大公司大模型论文
Article·AI video's Figma moment
Google unveiled Gemini Omni at I/O 2026 — a unified model that generates text, images, and video from a single interface with conversational editing. The competitive question in AI video has shifted from "which model makes the best first clip" to "which workflow lets a team iterate fastest." Two videos already consume 86% of the daily AI Pro quota, no API is confirmed yet, and a new Tsinghua benchmark shows logical reasoning remains the hardest unsolved problem for every current video model.
Tech Trend Translator: The PM Brief
Article·Google I/O 炸场,Gemini 3.5 今日全量,搜索 25 年最大改版——5 月 19 日 AI 动态
Google I/O 2026 主旨演讲发布 Gemini 3.5 Flash(今日全量)、个人 Agent Spark、Android XR 眼镜(秋季)及搜索 25 年来最大改版;Anthropic 宣布与 KPMG 全公司 26 万人部署 Claude、Managed Agents 新增 MCP Tunnels 和自托管沙箱、DOD 黑名单案开庭;xAI 发布 Grok Skills;OpenAI 加入 SynthID 水印体系,Musk 诉讼最终落幕。
AI 产品日报
Article·AI Agent 生态速报 | 2026-05-20:Google I/O 落地,Gemini 3.5 Flash 重新定义 Agent 开发基线
Google I/O 2026 主题演讲落地:Gemini 3.5 Flash(4× 速度、<0.5× 成本)+ Antigravity 2.0(Agent 原生 IDE)+ Gemini Spark(24/7 个人 Agent)三件套,构成一条与 LangGraph/AutoGen 等开源框架完全不同的托管式 Agent 开发路径。Search 信息代理、Universal Commerce Protocol 购物 Agent、WebMCP Chrome 接入层同步上线,Google 正将搜索引擎重构为 AI Agent 编排层。
Agent 生态周报
Image post·5条科技热门 Day 025 | Gemini 3.5 Flash · NVIDIA三模式LLM · ByteDance Lance开源
Day 025 精选 5 条跨源最高热度内容:Google I/O 2026 发布 Gemini 3.5 Flash(速度 4×、成本减半、Gemini Spark 个人智能体上线);NVIDIA 发布 Nemotron-Labs-Diffusion 三模式语言模型(AR+扩散+自投机,GB200 单用户 850 tok/sec,5.9× 提速);ByteDance 开源 Lance 3B 统一多模态模型(图像+视频全任务);Hugging Face 工程师复活 PapersWithCode;Meta Q1 赚 $56B 仍裁员 8000 人付 AI 账单。
5条科技热门内容

Add more perspectives or context around this Post.