OVR
93
RZN
91
CRE
87
SPD
85
MLT
95
SAF
83
VAL
89

93 OVR. MW. Topped LMArena on debut. 1M context, native multimodal, thinking engine built in. Google National finally fields a player who shows up when it matters. #AILeague
93 OVR. MW. Topped LMArena on debut. 1M context, native multimodal, thinking engine built in. Google National finally fields a player who shows up when it matters. #AILeague
| Attribute | Detail |
|---|---|
| Team | Google National |
| Position | MW — Multimodal Wing |
| Season | 2025 |
| Context window | 1,048,576 tokens (1M) |
| Knowledge cutoff | January 2025 |
| Thinking | Native (chain-of-thought before answer) |
| Modalities | Text, image, audio, video, PDF (input); text (output) |
gemini-2.5-pro with confirmed pricing, production rate limits, and full multi-platform availability (Gemini API, Vertex AI, Google AI Studio).
| Benchmark | Gemini 2.5 Pro | Notes |
|---|---|---|
| LMArena ELO | #1 at launch (~+40 pts lead) | Human preference voting |
| GPQA Diamond | 84.0% | Scientific reasoning, pass@1 |
| AIME 2025 | 86.7% | Math competition, pass@1 |
| AIME 2024 | 92.0% | Math competition, pass@1 |
| SWE-bench Verified | 63.8% | Agentic coding, custom agent |
| MMMU | 81.7% | Visual reasoning, pass@1 |
| Global MMLU (Lite) | 89.8% | Multilingual knowledge |
| MRCR 128k context | 94.5% | Long context retrieval |
| MRCR 1M context | 83.1% | Long context retrieval |
| HLE (no tools) | 18.8% | Expert academic reasoning |
| Stat | Gemini 2.5 Pro | GPT-4o (Card #002) | Claude Sonnet 4 (Card #001) |
|---|---|---|---|
| OVR | 93 | 90 | 91 |
| RZN | 91 | 86 | 90 |
| MLT | 95 | 91 | 80 |
| SPD | 85 | 90 | 88 |
| SAF | 83 | 82 | 93 |
| VAL | 89 | 82 | 86 |
| Context | 1M tokens | 128K tokens | 200K tokens |
| Thinking | Native | No | Optional |
| GPQA Diamond | 84.0% | ~53% | ~80% (3.5 Sonnet) |
| Pricing (input/output) | $1.25/$10 | $2.50/$10 | $3/$15 |
Picked from other channels by content similarity—find new creators to follow.

GPT-5.5 hits 68.2 t/s — new season-high at the 60+ index tier. Claude bumps to 63.7. Google fields the fastest pro model in the 57+ club at 138 t/s AND a 187 t/s flash unit. DeepSeek quietly +6.2 t/s. Intelligence board locked at 61. Full June 4 stats. #AILeague


GPT-5.5 surges +16.5% in speed. Gemini 3.1 Pro jumps +25% to 143 t/s. Claude holds #1 (AI Index 61). Qwen3.7 Max enters at 57 / 189 t/s. Full June 1 stats. #AILeague


Claude Opus 4.8 holds AI Index #1 (61). DeepSeek promo expires in 24h. Grok posts 158 t/s. Full May 30 post-game stats panel. #AILeague


Google 在 Google I/O 2026 发布 Gemini 3.5 Flash,这是 Gemini 系列中首个在智能体和编码基准上整体超越自家旗舰 Gemini 3.1 Pro 的 Flash 模型,同时保持 4 倍于其他前沿模型的输出速度。Finance Agent v2 领先 3.1 Pro 达 14.9 个百分点,Terminal-Bench 2.1 领先 5.9 个百分点。定价 $1.50/$9.00 / 百万 token,支持 1M 上下文窗口。


Claude Opus 4.8 tops the board (AI Index: 61). DeepSeek V4 Pro cuts output price 75% to $0.87/M. Gemini 3.5 Flash hits 207 t/s. Full post-game stats panel. #AILeague


GPT-5.5 hits 62.1 t/s — new season-high for the top-2 bracket. Claude bounces to 60.1 t/s. Gemini 3.5 Flash surges to 187 t/s. Grok and DeepSeek hold. Full June 2 stats. #AILeague

Add more perspectives or context around this Post.