
AI League — Game Day 12: Flash Reclaims the Crown as Both Speed Titans Cool Down
Gemini 3.5 Flash reclaims 192.2 t/s after both speed giants retreat from yesterday's 207 t/s photo finish. Grok settles at 187.9. Intelligence board locked 12 straight days at 61. AWS chases Grok for Bedrock despite zero enterprise demand. #AILeague
Intelligence board
Speed panel: the cool-down after the photo finish
Pricing war
| Franchise | Model tier | Blended $/1M | Speed | Intelligence |
|---|---|---|---|---|
| Anthropic (Claude) | Opus 4.8 (max) | $4.10 | 71.2 t/s | 61 |
| OpenAI (GPT) | GPT-5.5 (xhigh) | $4.35 | 68.0 t/s | 60 |
| Google (Gemini) | 3.1 Pro Preview | $1.74 | 140.6 t/s | 57 |
| Google (Gemini) | 3.5 Flash (high) | ~$1.58 | 192.2 t/s | 55 |
| Kimi | K2.6 | $0.70 | 51.8 t/s | 54 |
| xAI (Grok) | 4.3 (high) | $0.64 | 187.9 t/s | 53 |
| DeepSeek | V4 Pro (max) | $0.18 | 61.6 t/s | 52 |
Challenger watch
Off the court: Grok's Bedrock problem
Game Day 12 final scoreboard
| Rank | Franchise | Intelligence ↕ | Speed (t/s) ↕ | Blended $/1M |
|---|---|---|---|---|
| 🥇 1 | Anthropic — Claude Opus 4.8 (max) | 61 ↔ | 71.2 ↑ | $4.10 |
| 🥈 2 | OpenAI — GPT-5.5 (xhigh) | 60 ↔ | 68.0 ↓ | $4.35 |
| 🥉 3 | Google — Gemini 3.1 Pro Preview | 57 ↔ | 140.6 ↔ | $1.74 |
| 4 | Google — Gemini 3.5 Flash (high) | 55 ↔ | 192.2 ↓ | ~$1.58 |
| 5 | Kimi — K2.6 | 54 ↔ | 51.8 ↓ | $0.70 |
| 6 | xAI — Grok 4.3 (high) | 53 ↔ | 187.9 ↓ | $0.64 |
| 7 | DeepSeek — V4 Pro (max) | 52 ↔ | 61.6 ↑ | $0.18 |
References
- 1Artificial Analysis — model rankings
- 2Claude Opus 4.8 detail — Artificial Analysis
- 3GPT-5.5 detail — Artificial Analysis
- 4Gemini 3.5 Flash detail — Artificial Analysis
- 5Grok 4.3 detail — Artificial Analysis
- 6Gemini 3.1 Pro detail — Artificial Analysis
- 7OpenRouter coding model rankings June 2026
- 8Why AWS wants Grok on Bedrock despite weak enterprise demand
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·AIL Player Card #009 — Gemini 3.5 Flash: The Agentic Sprinter
91 OVR. AS. Terminal-Bench 76.2%. SWE-Bench Pro 55.1%. 4× faster than frontier rivals. $1.50/M input. Google National finally fields a player built for the agentic era — faster, cheaper, and better at multi-agent loops than its own flagship. #AILeague
AIL·Player Card
Article·还没有赢家:2026 年 5 月大模型竞争全景
Claude Opus 4.7 在编程 Agent 领域领跑,但综合智能指数第一是 GPT-5.5。Gemini 3.1 Pro 以 40% 的价格交付 80% 的性能,Grok 4.3 靠实时知识打差异化。Anthropic 估值飙至 $1.2 万亿背后是 Claude Code 的企业渗透逻辑,而开源模型正在改变「赢」的定义本身。
Twitter AI 长文精选
Article·AIL Player Card #016 — Grok 4.3: The Price-Slashing Sprinter
92 OVR. RP. 207 tokens/sec — fastest reasoning model in the league. $1.25/M input, 12× cheaper than GPT-5.5. Always-on reasoning. Video input debut. Live X data. xAI Dynamo filed their most disciplined card yet. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Flash 系列首次在编码和智能体任务上越过旗舰 Pro
Google 在 I/O 2026 上发布 Gemini 3.5 Flash 并同步公开 Model Card。这是 Flash 系列第一次在编码(Terminal-Bench 2.1: 76.2%)和智能体任务(Finance Agent v2: +14.9pp)上越过前代旗舰 Gemini 3.1 Pro,同时推理速度比同级前沿模型快 4 倍,定价比 Pro 便宜 40%。
三大公司大模型论文
Article·AIL Player Card #012 — Gemini Omni Flash: The World Model
92 OVR. WM. Conversational video editing. Any input → any output. Physics-grounded world knowledge baked in. Google National just fielded a position nobody in this league has played before. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文

Add more perspectives or context around this Post.