
AIL Player Card #014 — Kimi K2.6: The Agentic Swarm
92 OVR. CW. SWE-Bench Pro 58.6% — ties GPT-5.5. AIME 2026 96.4%. 981 tokens/sec on Cerebras. 300-agent swarm built into the model. Open-weight, Modified MIT, $0.95/M input. Moonshot Challengers just fielded the most dangerous open-weight player in the league. #AILeague
Research Brief
📋 Player card
| Field | Value |
|---|---|
| Player | Kimi K2.6 |
| Club | Moonshot Challengers |
| Position | CW — Community Wing |
| Overall | 92 |
| Season | AI League 2026 |
Stat breakdown
| Dimension | Score | What it measures |
|---|---|---|
| RZN (Reasoning) | 91 | AIME 2026 96.4%, GPQA-Diamond 90.5%, HMMT 2026 92.7% |
| CRE (Creativity) | 85 | Code Arena WebDev Elo 1,529 — 6th of 67 models |
| SPD (Speed) | 94 | 981 TPS on Cerebras — fastest trillion-param open model ever clocked |
| MLT (Multimodal) | 82 | MMMU-Pro 79.4%, MathVision 93.2%, 400M MoonViT vision encoder |
| SAF (Safety) | 79 | Hallucination rate 39% (down from K2.5's 65%), approaching Opus 4.7 |
| VAL (Value) | 93 | $0.95/$4.00 per M input/output — 5–6× cheaper than Claude Opus 4.7 |
Position note: CW (Community Wing) = open-weight / open-source roster pick; competes on accessibility, cost, and self-hostability; community-first. K2.6 extends this ceiling into frontier coding territory — prior CW archetype was about accessibility; K2.6 is about frontier agentic performance at community prices.
What Moonshot just fielded
The benchmark picture
The speed dimension nobody expected
Head-to-head vs same-position rivals
| Model | Club | OVR | SWE-Bench Pro | AIME 2026 | AA Intelligence Index | API Price (input/M) |
|---|---|---|---|---|---|---|
| Kimi K2.6 | Moonshot Challengers | 92 | 58.6% | 96.4% | 54 | $0.95 |
| MiniMax M3 | MiniMax Challengers | 91 | 59.0% | ~88% | ~52 | $0.30 |
| Llama 4 Maverick | Meta Open | 88 | ~45% | ~84% | ~46 | $0.18 |
| DeepSeek V4 Pro | DeepSeek Athletic | 95 | 55.1% | 93.5% | ~55 | $0.14 |
License, deployment, and the open-weight story
Season highlights
Key numbers at a glance
Scout's verdict
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·五月大模型竞技:Kimi K2.6 开源、Qwen 35小时连跑、Gemini 3.5 Flash 登场、Mistral 一体化重组
4月底至5月底,Moonshot AI、阿里Qwen、Google、Mistral在4周内相继发布重要版本。本文逐一拆解Kimi K2.6的1T MoE开源架构与300子智能体能力、Qwen3.7-Max的35小时kernel优化10倍加速、Google I/O上Gemini 3.5 Flash的速度优势、以及Mistral废弃Magistral后的一体化新旗舰Medium 3.5——并横向对比四家发布背后共同指向的Agent执行趋势。
LLM Release Notes
Article·Twitter AI 长文精选|Kimi K2.6 登顶、Cloudflare 重写 Agent 基础设施、Claude Code 工程解剖
本期精选 Twitter 上四篇热门 AI 深度长文:Kimi K2.6 以 SWE-Bench Pro 58.6 分夺下开源编程 SOTA;Cloudflare Agent Week 重写云基础设施假设;阿里云开发者完整拆解 Claude Code 三层工程架构;腾讯研究院 16,500 字长文《人类正在走下牌桌》分析 Agent 时代的四阶演进。
Twitter AI 长文精选
Article·HF Breakout Models, Jun 8–15: MiniMax M3, Kimi-K2.7-Code, and the License Week Builders Waited For
Four HuggingFace models cleared the >10x download-growth bar during June 8–15, led by Kimi-K2.7-Code (Moonshot, 1T/32B active, Modified MIT, 33x growth, 81.1% MCPMark Verified, $0.75/M on OpenRouter), DiffusionGemma (Google DeepMind, 25.2B/3.8B active, Apache 2.0, 311K downloads, 4× faster text generation with documented hallucination trade-off), and Nex-N2-mini (nex-agi, 35B/3B active, 15.9x growth, Qwen3.5 derivative). MiniMax M3 (428B/23B active, Community License, 1M context, native multimodal) graduated from last week's "on radar" with weights confirmed June 12. The week's highest-downloaded model, Rio-3.5-Open-397B (189K downloads), was exposed as a weight-merge fraud by nex-agi.
Hugging Face Surging Models
Article·AI Coding Tools Weekly: Composer 2.5 on Kimi K2.5, Copilot's LTS model, Google goes closed-source
This week's digest covers 12 tools and 27 confirmed events: Cursor's Composer 2.5 (built on open-source Kimi K2.5, ~10× cheaper than Opus 4.7 at comparable benchmarks) leads alongside a Gartner MQ Leader designation and Automations/Jira integrations. GitHub Copilot installs GPT-5.3-Codex as its first LTS model, ships Gemini 3.5 Flash GA, trims its Web Chat model menu ahead of June 1 billing, and open-sources the Eclipse plugin under MIT. Google's Antigravity 2.0 at I/O introduces a closed-source CLI replacing the open-source Gemini CLI — shutdown deadline June 18. Claude Code ships pinned background sessions, renames /simplify to /code-review (breaking), removes the legacy SDK (breaking), and demonstrates the Dreaming self-learning feature. Codex CLI v0.133.0 graduates Goals to GA. Devin adds Windows VM support and Auto-Triage; Replit Enterprise goes self-serve. Aider confirmed stalled at 9+ months.
Global AI Coding Tools Update
Article·X Feed 每日中文简报|2026年6月13日
今日关注圈:Kimi-K2.7-Code 开源发布,推理 token 降 30%,MCP 工具编排超越 Opus 4.8;@shao__meng 整理 Spec 驱动开发(SDD)框架与 Claude Fable 5 首日 playbook 8 条实践;SpaceX SPCX 以 $135 定价正式上市纳斯达克,盘中冲至 $176,马斯克成万亿富翁;华为 HDC 2026 宣布开源盘古 openPangu 2.0。
X Feed 每日中文简报
Article·Just Open-Sourced: Week of Jun 9, 2026
This week's legitimacy-screened open-source releases: Nex-N2-Pro (Apache 2.0, 397B MoE model matching GPT-5.5 on coding benchmarks), Kimi Code CLI (MIT terminal agent), DiffusionGemma 26B (Apache 2.0, up to 4x faster text generation), Future AGI (Apache 2.0 agent evaluation platform), and MinerU v3.3. Each entry includes license name, who's behind it, maturity signals, and a plain-language legitimacy call.
Freshly Open-Sourced

Add more perspectives or context around this Post.