
AIL Player Card #009 — Gemini 3.5 Flash: The Agentic Sprinter
91 OVR. AS. Terminal-Bench 76.2%. SWE-Bench Pro 55.1%. 4× faster than frontier rivals. $1.50/M input. Google National finally fields a player built for the agentic era — faster, cheaper, and better at multi-agent loops than its own flagship. #AILeague
Research Brief
The scouting report
Stat card
| Dimension | Score | What it measures |
|---|---|---|
| OVR | 91 | Weighted composite |
| RZN (Reasoning) | 84 | GPQA Diamond 82.8%, ARC-AGI-2 72.1%, HLE 40.2% |
| CRE (Creativity) | 82 | CharXiv Reasoning 84.2%, MMMU-Pro 83.6% |
| SPD (Speed) | 97 | 289 tokens/sec, 4× faster than frontier rivals |
| MLT (Multimodal) | 88 | Native text / image / video / audio / PDF input |
| SAF (Safety) | 79 | Strengthened CBRN safeguards; better-calibrated refusals |
| VAL (Value) | 88 | $1.50 input / $9.00 output per million tokens; 40% cheaper than 3.1 Pro |
Season highlights

Where it falls short
Head-to-head: Agentic Sprinter class
| Model | Team | OVR | Terminal-Bench | SWE-Bench | Speed tier | $/M input |
|---|---|---|---|---|---|---|
| Gemini 3.5 Flash | Google National | 91 | 76.2% | 55.1% | 289 tok/s | $1.50 |
| GPT-5.5 | OpenAI United | 93 | 82.7% | — | ~70 tok/s | $5.00 |
| Gemini 3.1 Pro | Google National | — | 70.3% | 54.2% | ~70 tok/s | $2.50 |
| Gemini 3.5 Flash (12× Opt.) | Google National | — | est. same | est. same | ~3,500 tok/s | — |
Coach's verdict
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文
Article·Gemini 3.5 Flash:Flash 系列首次在编码和智能体任务上越过旗舰 Pro
Google 在 I/O 2026 上发布 Gemini 3.5 Flash 并同步公开 Model Card。这是 Flash 系列第一次在编码(Terminal-Bench 2.1: 76.2%)和智能体任务(Finance Agent v2: +14.9pp)上越过前代旗舰 Gemini 3.1 Pro,同时推理速度比同级前沿模型快 4 倍,定价比 Pro 便宜 40%。
三大公司大模型论文
Article·AI Agent 生态速报 | 2026-05-20:Google I/O 落地,Gemini 3.5 Flash 重新定义 Agent 开发基线
Google I/O 2026 主题演讲落地:Gemini 3.5 Flash(4× 速度、<0.5× 成本)+ Antigravity 2.0(Agent 原生 IDE)+ Gemini Spark(24/7 个人 Agent)三件套,构成一条与 LangGraph/AutoGen 等开源框架完全不同的托管式 Agent 开发路径。Search 信息代理、Universal Commerce Protocol 购物 Agent、WebMCP Chrome 接入层同步上线,Google 正将搜索引擎重构为 AI Agent 编排层。
Agent 生态周报
Article·Google I/O 炸场,Gemini 3.5 今日全量,搜索 25 年最大改版——5 月 19 日 AI 动态
Google I/O 2026 主旨演讲发布 Gemini 3.5 Flash(今日全量)、个人 Agent Spark、Android XR 眼镜(秋季)及搜索 25 年来最大改版;Anthropic 宣布与 KPMG 全公司 26 万人部署 Claude、Managed Agents 新增 MCP Tunnels 和自托管沙箱、DOD 黑名单案开庭;xAI 发布 Grok Skills;OpenAI 加入 SynthID 水印体系,Musk 诉讼最终落幕。
AI 产品日报
Article·AI League — Game Day 4: OpenAI Goes Full Sprint, Google Jumps a Gear
GPT-5.5 surges +16.5% in speed. Gemini 3.1 Pro jumps +25% to 143 t/s. Claude holds #1 (AI Index 61). Qwen3.7 Max enters at 57 / 189 t/s. Full June 1 stats. #AILeague
AIL·Stats Board
Article·AI League — Game Day 2: Claude Holds Court, DeepSeek Clock Ticking
Claude Opus 4.8 holds AI Index #1 (61). DeepSeek promo expires in 24h. Grok posts 158 t/s. Full May 30 post-game stats panel. #AILeague
AIL·Stats Board

Add more perspectives or context around this Post.