
AI League — Game Day 2: Claude Holds Court, DeepSeek Clock Ticking
Claude Opus 4.8 holds AI Index #1 (61). DeepSeek promo expires in 24h. Grok posts 158 t/s. Full May 30 post-game stats panel. #AILeague
Final standings — AI Index leaderboard
| Rank | Franchise | Model (Best Variant) | AI Index | Speed (t/s) | Price Blend |
|---|---|---|---|---|---|
| 🥇 1 | Anthropic | Claude Opus 4.8 (Max) | 61 | 57.8 | $4.10/M |
| 🥈 2 | OpenAI | GPT-5.5 (xhigh) | 60 | 54.7 | $4.35/M |
| 🥉 3 | OpenAI | GPT-5.5 (high) | 59 | — | — |
| 4 | Anthropic | Claude Opus 4.7 (Max) | 57 | — | — |
| 5 | Gemini 3.1 Pro Preview | 57 | 112.9 | $1.74/M | |
| — | Gemini 3.5 Flash (high) | 55 | 175.6 | $1.31/M | |
| — | xAI | Grok 4.3 (high) | 53 | 157.9 | $0.64/M |
| — | DeepSeek | V4 Pro (Reasoning Max) | 52 | 46.2 | $0.18/M |
| — | Meta | Llama 4 Scout | 14 | 105.9 | $0.22/M |
Game of the night — Gemini vs Grok: best value in the top bracket
Speed panel — who's running the floor
| Tier | Model | Speed (t/s) | Notes |
|---|---|---|---|
| League-wide | Mercury 2 | 742 | Inception Labs — not in core 6 |
| Fast tier | Gemini 3.5 Flash (high) | 175.6 | Speed leader in top 10 intelligence |
| Fast tier | Grok 4.3 (high) | 157.9 | Surprise: posted 158 t/s at AI Index 53 |
| Mid tier | Gemini 3.1 Pro Preview | 112.9 | Strong dual: 57 index + speed |
| Mid tier | Llama 4 Scout | 105.9 | 10M context window |
| Slow tier | Claude Opus 4.8 (Max) | 57.8 | Speed below average for its price class |
| Slow tier | GPT-5.5 (xhigh) | 54.7 | Also below average; TTFT 67s |
| Slow tier | DeepSeek V4 Pro (Max) | 46.2 | Cheapest; slowest of the group |

Pricing war breakdown
| Franchise | Input (<latex inline="true" source="/1M) | Output (" />/1M) | Blend (7:2:1) | Promo? |
|---|---|---|---|---|
| Anthropic Claude Opus 4.8 | $6.25 | $25.00 | $4.10 | — |
| OpenAI GPT-5.5 (xhigh) | $5.00 | $30.00 | $4.35 | — |
| Google Gemini 3.1 Pro | $2.00 | $12.00 | $1.74 | — |
| Google Gemini 3.5 Flash | $1.50 | $9.00 | $1.31 | — |
| xAI Grok 4.3 (high) | $1.25 | $2.50 | $0.64 | — |
| Kimi K2.6 | $0.95 | $4.00 | $0.70 | — |
| DeepSeek V4 Pro (Max) | $0.435 | $0.87 | $0.18 | ⚠️ Ends May 31 |
| Meta Llama 4 Scout | ~$0.17 | ~$0.66 | $0.22 | — |
Context window chart
| Model | Context Window | Use case edge |
|---|---|---|
| Llama 4 Scout | 10M tokens | League-widest; RAG pipelines, bulk doc ingestion |
| Claude Opus 4.8 | 1M tokens | Multimodal; complex instruction stacking |
| GPT-5.5 (xhigh) | ~922K tokens | Near-parity with Claude |
| Gemini 3.1/3.5 Flash | 1M tokens | Strong multimodal + video input |
| Grok 4.3 | 1M tokens | Competitive parity |
| DeepSeek V4 Pro | 1M tokens | Text-only; no image input |


Challenger watch
- Grok Code Fast 1 — evaluated May 25, 2026. Coding specialist, early access. Designed to complement Grok 4.3 in developer workflows. Pricing and benchmark scores not yet in Artificial Analysis index. 8
Postgame summary
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·还没有赢家:2026 年 5 月大模型竞争全景
Claude Opus 4.7 在编程 Agent 领域领跑,但综合智能指数第一是 GPT-5.5。Gemini 3.1 Pro 以 40% 的价格交付 80% 的性能,Grok 4.3 靠实时知识打差异化。Anthropic 估值飙至 $1.2 万亿背后是 Claude Code 的企业渗透逻辑,而开源模型正在改变「赢」的定义本身。
Twitter AI 长文精选
Article·AIL Player Card #009 — Gemini 3.5 Flash: The Agentic Sprinter
91 OVR. AS. Terminal-Bench 76.2%. SWE-Bench Pro 55.1%. 4× faster than frontier rivals. $1.50/M input. Google National finally fields a player built for the agentic era — faster, cheaper, and better at multi-agent loops than its own flagship. #AILeague
AIL·Player Card
Article·AIL Player Card #016 — Grok 4.3: The Price-Slashing Sprinter
92 OVR. RP. 207 tokens/sec — fastest reasoning model in the league. $1.25/M input, 12× cheaper than GPT-5.5. Always-on reasoning. Video input debut. Live X data. xAI Dynamo filed their most disciplined card yet. #AILeague
AIL·Player Card
Article·四大模型同日登场,五角大楼清洗 AI 供应商 | 5月1日
GPT-5.5、Claude Opus 4.7、DeepSeek V4、Grok 4.3 同日亮相,闭源模型越来越贵、开源竞品越来越能打,剪刀差持续扩大;美国国防部重新划定 AI 供应商名单,Anthropic 因拒绝放松「自主武器」使用限制而被踢出;Meta 收购人形机器人公司 ARI 入局具身 AI,Musk v. Altman 庭审中 xAI 用 OpenAI 模型蒸馏 Grok 的事实被坐实;Salesforce 发布企业 Agent 运维平台,Netomi 完成 1.1 亿美元 C 轮。
AI 日报|量子位风
Article·AIL Player Card #004 — DeepSeek V4 Pro: The Value Engineer
95 OVR. VE. Open-source. 1.6T parameters. $3.48/M output vs $30 for GPT-5.5. Codeforces ELO 3206 — beats the incumbent by 38 points. DeepSeek Athletic just repriced the frontier. #AILeague
AIL·Player Card
Article·Gemini 3.5 Flash:Google 首个在 Agent 任务上超越旗舰 Pro 的 Flash 模型
Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。
三大公司大模型论文

Add more perspectives or context around this Post.