
AI League — Game Day 18: Flash and Grok Separated by 0.1 t/s in the Closest Speed Finish of the Season
Gemini 3.5 Flash edges Grok 4.3 by just 0.1 t/s (166.5 vs 166.4) in the tightest speed margin of the season. Grok's Day 17 throne lasted one game. Intelligence board locked at 65 for Day 8. Full June 15 stats. #AILeague
Intelligence board: Day 8 of the 65 era
Speed panel: 0.1 t/s photo finish
| Team | Model | Day 17 | Day 18 | Δ | Speed rank |
|---|---|---|---|---|---|
| xAI | Grok 4.3 | 175.9 t/s | 166.4 t/s | −9.5 (−5.4%) | #2 ↓ |
| Gemini 3.5 Flash | 165.3 t/s | 166.5 t/s | +1.2 (+0.7%) | #1 ↑ | |
| Gemini 3.1 Pro Preview | 135.5 t/s | 132.5 t/s | −3.0 (−2.2%) | #3 ↔ | |
| DeepSeek | DeepSeek V4 Pro | 83.8 t/s | 81.5 t/s | −2.3 (−2.7%) | #4 ↔ |
| OpenAI | GPT-5.5 (xhigh) | 59.9 t/s | 63.8 t/s | +3.9 (+6.5%) | #5 ↑ |
| Anthropic | Claude Fable 5 | 63.8 t/s | 63.8 t/s | ↔ | #6 ↔ |
Pricing war: DeepSeek's moat stays intact
| Team | Blended price / 1M tokens | Intelligence score | Price-to-intelligence ratio |
|---|---|---|---|
| DeepSeek V4 Pro | $0.18 | 52 | Best in class |
| Grok 4.3 | $0.64 | 53 | Strong |
| Gemini 3.1 Pro | $1.74 | 57 | Reasonable |
| Gemini 3.5 Flash | $1.31 | 55 | Reasonable |
| GPT-5.5 (xhigh) | $4.35 | 60 | Premium |
| Claude Fable 5 | $7.70 | 65 | Trophy tier |
Challenger watch
Season speed trend: Grok's volatility vs Flash's grind
Game Day 18 verdict
References
- 1Artificial Analysis Intelligence Index — Claude Fable 5
- 2Artificial Analysis — Grok 4.3 speed data
- 3Artificial Analysis — Gemini 3.5 Flash speed data
- 4Artificial Analysis — DeepSeek V4 Pro pricing
- 5Artificial Analysis — GPT-5.5 pricing
- 6Artificial Analysis — Gemini 3.1 Pro Preview pricing
- 7Artificial Analysis — models leaderboard FAQ
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·AIL Player Card #009 — Gemini 3.5 Flash: The Agentic Sprinter
91 OVR. AS. Terminal-Bench 76.2%. SWE-Bench Pro 55.1%. 4× faster than frontier rivals. $1.50/M input. Google National finally fields a player built for the agentic era — faster, cheaper, and better at multi-agent loops than its own flagship. #AILeague
AIL·Player Card
Article·AIL Player Card #013 — Claude Fable 5: The Mythos Striker
95 OVR. SF. SWE-Bench Pro 80.3% — 20 pts clear of GPT-5.5. FrontierCode Diamond 29.3%. Mythos-class. First Anthropic model you can actually buy. Anthropic FC just cleared their best player for the public pitch. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文
Article·AIL Player Card #016 — Grok 4.3: The Price-Slashing Sprinter
92 OVR. RP. 207 tokens/sec — fastest reasoning model in the league. $1.25/M input, 12× cheaper than GPT-5.5. Always-on reasoning. Video input debut. Live X data. xAI Dynamo filed their most disciplined card yet. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Flash 系列首次在编码和智能体任务上越过旗舰 Pro
Google 在 I/O 2026 上发布 Gemini 3.5 Flash 并同步公开 Model Card。这是 Flash 系列第一次在编码(Terminal-Bench 2.1: 76.2%)和智能体任务(Finance Agent v2: +14.9pp)上越过前代旗舰 Gemini 3.1 Pro,同时推理速度比同级前沿模型快 4 倍,定价比 Pro 便宜 40%。
三大公司大模型论文
Article·还没有赢家:2026 年 5 月大模型竞争全景
Claude Opus 4.7 在编程 Agent 领域领跑,但综合智能指数第一是 GPT-5.5。Gemini 3.1 Pro 以 40% 的价格交付 80% 的性能,Grok 4.3 靠实时知识打差异化。Anthropic 估值飙至 $1.2 万亿背后是 Claude Code 的企业渗透逻辑,而开源模型正在改变「赢」的定义本身。
Twitter AI 长文精选

Add more perspectives or context around this Post.