
AIL Player Card #011 — MiniMax M3: The Open Frontier
91 OVR. CW. SWE-Bench Pro 59.0% — beats GPT-5.5. 1M context. Native multimodal. Open-weight. $0.30/M input at 5% of Claude Opus's cost. A challenger club just walked into the league carrying a player nobody expected. #AILeague
91 OVR. CW. SWE-Bench Pro 59.0% — beats GPT-5.5. 1M context. Native multimodal. Open-weight. $0.30/M input at 5% of Claude Opus's cost. A challenger club just walked into the league carrying a player nobody expected. #AILeague
Scouting report
Stat card
| Dimension | Score | Benchmark anchor |
|---|---|---|
| OVR | 91 | Composite |
| RZN — Reasoning | 79 | BrowseComp 83.5%, GDPval rubrics 74.7% |
| CRE — Creativity | 78 | VIBE V2 50.1, SVG-Bench 63.7 |
| SPD — Speed | 84 | MSA architecture: 9× prefill, 15× decode speedup vs prior gen |
| MLT — Multimodal | 86 | Native Step-0 multimodal training; video input; desktop operation |
| SAF — Safety | 72 | No dedicated safety benchmark disclosed; adequate, not safety-first |
| VAL — Value | 92 | $0.30/M input (promo); open-weight; 5–10% of closed-source frontier cost |
Key plays this season

The open-weight asterisk
Pricing breakdown
| Model | Input (<latex inline="true" source="/M) | Output (" />/M) | Notes |
|---|---|---|---|
| MiniMax M3 | $0.30 | $1.20 | 50% promo; regular $0.60/$2.40 |
| Mistral Large 3 | $0.50 | $1.50 | Apache 2.0, open-weight |
| Gemini 3.5 Flash | $1.50 | $9.00 | Closed, fast tier |
| GPT-5.5 | $5.00 | $30.00 | Closed, top tier |
| Claude Opus 4.8 | $5.00 | $25.00 | Closed, safety-first |
Head-to-head: CW / open-weight position
| MiniMax M3 | Llama 4 Maverick | DeepSeek V4 Pro | |
|---|---|---|---|
| Position | CW | CW | VE |
| OVR | 91 | 88 | 95 |
| Context | 1M tokens | 1M tokens | 128K tokens |
| Multimodal | Native (Step 0) | Native | Text-only |
| SWE-Bench Pro | 59.0% | ~45% est. | 55.4% |
| API input cost | $0.30/M (promo) | Free tier / $0.19/M | ~$0.14/M |
| Open-weight | Yes (weights) | Yes (weights) | Yes (weights) |
| Full open-source | No | No | No |
League context
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·MiniMax M3 开源 428B 参数、Kimi 代码模型提升 21.8%——AI HOT 今日热点(2026-06-13)
MiniMax M3 以 428B 总参数开源登场,上下文窗口扩展至 1M token;Kimi-K2.7-Code 代码基准全线提升、推理 token 减少 30%。豆包上线「任务模式」,Codex 推出速率存储与浏览器开发者模式,Claude Code 一天内三版连发。TCS 宣布将 Claude 部署至 56 个国家 5 万员工。精选 2026-06-12 全天 17 条 AI 行业动态。
AI HOT 每日热点简报
Article·还没有赢家:2026 年 5 月大模型竞争全景
Claude Opus 4.7 在编程 Agent 领域领跑,但综合智能指数第一是 GPT-5.5。Gemini 3.1 Pro 以 40% 的价格交付 80% 的性能,Grok 4.3 靠实时知识打差异化。Anthropic 估值飙至 $1.2 万亿背后是 Claude Code 的企业渗透逻辑,而开源模型正在改变「赢」的定义本身。
Twitter AI 长文精选
Article·HF Breakout Models, Jun 15–22: GLM-5.2 and VibeThinker-3B
Two MIT-licensed LLM breakouts this week: GLM-5.2 (753B, ~41×, frontier agentic coding) and VibeThinker-3B (3B, ~32×, 96.1% LeetCode).
Hugging Face Surging Models
Article·AI League — Game Day 10: Grok Crosses 197 t/s, Intelligence Board Locked for Ten Straight
Grok 4.3 hits 197.7 t/s — 3rd straight record day, +36% since season open. Claude holds #1 at 61 pts, intelligence board frozen for 10 days. GPT-5.5 dips to 61.7 t/s. Microsoft ships two new models. Full June 7 stats. #AILeague
AIL·Stats Board
Article·AI League — Game Day 4: OpenAI Goes Full Sprint, Google Jumps a Gear
GPT-5.5 surges +16.5% in speed. Gemini 3.1 Pro jumps +25% to 143 t/s. Claude holds #1 (AI Index 61). Qwen3.7 Max enters at 57 / 189 t/s. Full June 1 stats. #AILeague
AIL·Stats Board
Article·AI Coding Tools Weekly: Fable 5 tops FrontierCode
Claude Fable 5 — Anthropic's first publicly accessible Mythos-class model — debuted in Claude Code on June 9 and scored 46.3 on Cognition's new FrontierCode benchmark (vs. GPT-5.5's 25.5 and Opus 4.8's 23.0), a benchmark that scores real code-review mergeability rather than test passage. Despite a nominal 2x per-token price over Opus 4.8, independent MineBench testing found the real per-task cost premium was ~30%. The week also brought Claude Code v2.1.170–v2.1.175 (5-layer sub-agent recursion, enforceAvailableModels), Cursor Auto-review's classifier-based agent autonomy (4% intercept rate, now default for new users), Xiaomi's open-source MiMo Code V0.1.0 with a four-layer persistent memory architecture, GitHub Copilot's PAT-free Agentic Workflows, Replit's Package Firewall blocking 8,000 malicious packages per day, Kimi K2.7 Code (1T MoE, 30% fewer thinking tokens), and the Gemini CLI shutdown deadline at June 18 with migration tool gaps confirmed.
Global AI Coding Tools Update

Add more perspectives or context around this Post.