
AI League — Game Day 16: Grok's Freefall Ends as Flash Opens an 11-Point Lead
Grok 4.3 posts its first positive speed reading in three games (143.5 → 150.8 t/s, +5.1%), ending the back-to-back double-digit drops. Gemini 3.5 Flash surges to 162.1 t/s (+7.4%), rebuilding an 11.3 t/s lead. Gemini 3.1 Pro is the stealth mover at +12.7%. Intelligence board locked at 65 for Day 5. Full June 13 stats. #AILeague
🏆 Intelligence board — Day 5 of the Fable 5 era
| Rank | Model | AA Index | Δ |
|---|---|---|---|
| 1 | Claude Fable 5 (Anthropic) | 65 | ↔ |
| 2 | Claude Opus 4.8 (Anthropic) | 61 | ↔ |
| 3 | GPT-5.5 xhigh (OpenAI) | 60 | ↔ |
| 4 | GPT-5.5 high (OpenAI) | 59 | ↔ |
| 5 | Gemini 3.1 Pro Preview (Google) | 57 | ↔ |
| 6 | Gemini 3.5 Flash (Google) | 55 | ↔ |
| 7 | Grok 4.3 (xAI) | 53 | ↔ |
| 8 | DeepSeek V4 Pro (DeepSeek) | 52 | ↔ |
⚡ Speed board — Flash opens up an 11-point lead
| Model | Speed | Δ vs. Day 15 |
|---|---|---|
| Gemini 3.5 Flash | 162.1 t/s | ↑ +7.4% |
| Grok 4.3 | 150.8 t/s | ↑ +5.1% |
| Gemini 3.1 Pro Preview | 125.4 t/s | ↑ +12.7% |
| DeepSeek V4 Pro | 60.2 t/s | ~flat |
| Claude Fable 5 | 63.8 t/s | — |
| Claude Opus 4.8 | 56.7 t/s | ↓ |
| GPT-5.5 xhigh | 53.0 t/s | ↑ +5.8% |
💰 Pricing war — the budget tier holds
| Model | Input | Output | Blended |
|---|---|---|---|
| DeepSeek V4 Pro | $0.435 | $0.87 | ~$0.18 |
| Grok 4.3 | $1.25 | $2.50 | $0.64 |
| Gemini 3.5 Flash | $1.50 | $9.00 | $1.31 |
| Gemini 3.1 Pro | $2.00 | $12.00 | ~$1.74 |
| GPT-5.5 xhigh | $5.00 | $30.00 | $4.35 |
| Claude Opus 4.8 | $5.00 | $25.00 | ~$4.10 |
| Claude Fable 5 | $10.00 | $50.00 | $7.70 |
🔭 Challenger watch
📊 Game Day 16 — stat line summary
References
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·AIL Player Card #009 — Gemini 3.5 Flash: The Agentic Sprinter
91 OVR. AS. Terminal-Bench 76.2%. SWE-Bench Pro 55.1%. 4× faster than frontier rivals. $1.50/M input. Google National finally fields a player built for the agentic era — faster, cheaper, and better at multi-agent loops than its own flagship. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文
Article·Gemini 3.5 Flash:Flash 系列首次在编码和智能体任务上越过旗舰 Pro
Google 在 I/O 2026 上发布 Gemini 3.5 Flash 并同步公开 Model Card。这是 Flash 系列第一次在编码(Terminal-Bench 2.1: 76.2%)和智能体任务(Finance Agent v2: +14.9pp)上越过前代旗舰 Gemini 3.1 Pro,同时推理速度比同级前沿模型快 4 倍,定价比 Pro 便宜 40%。
三大公司大模型论文
Article·AIL Player Card #013 — Claude Fable 5: The Mythos Striker
95 OVR. SF. SWE-Bench Pro 80.3% — 20 pts clear of GPT-5.5. FrontierCode Diamond 29.3%. Mythos-class. First Anthropic model you can actually buy. Anthropic FC just cleared their best player for the public pitch. #AILeague
AIL·Player Card
Article·AIL Player Card #016 — Grok 4.3: The Price-Slashing Sprinter
92 OVR. RP. 207 tokens/sec — fastest reasoning model in the league. $1.25/M input, 12× cheaper than GPT-5.5. Always-on reasoning. Video input debut. Live X data. xAI Dynamo filed their most disciplined card yet. #AILeague
AIL·Player Card
Article·AIL Player Card #012 — Gemini Omni Flash: The World Model
92 OVR. WM. Conversational video editing. Any input → any output. Physics-grounded world knowledge baked in. Google National just fielded a position nobody in this league has played before. #AILeague
AIL·Player Card

Add more perspectives or context around this Post.