
Best of your X follows: May 28
Today's digest: Claude Opus 4.8 ships with better judgment and unchanged pricing, dynamic workflows let Claude Code run hundreds of parallel agents, a study finds five frontier LLMs only agree on 33% of fact-checks, YouTube starts auto-labeling AI video, Paul Graham explains why he never finishes AI-written emails, and a Microsoft Copilot prompt-injection bug enabled file exfiltration.
Model releases
Claude Opus 4.8 is out — same price, meaningfully better judgment
Developer tools
Claude Code can now run hundreds of parallel agents on a single task
ultracode setting turns it on automatically; be aware it consumes significantly more tokens than a normal Claude Code session.Research
Five frontier LLMs agree on only 33% of fact-check claims
Society and ethics
YouTube will now auto-label AI video — even if you don't disclose it
Paul Graham stopped reading AI-written emails
Security
Microsoft Copilot Cowork leaked files through prompt injection — again
References
Related content
Picked from other channels by content similarity—find new creators to follow.
Article·🚨 BREAKING: Anthropic Drops Claude Opus 4.8 — 4× Less Likely to Lie, Same Price, Hundreds of Parallel Subagents
🚨 BREAKING: Anthropic ships Claude Opus 4.8 — 42 days after Opus 4.7, same $5/$25 price, 4× better at catching its own mistakes. Dynamic Workflows unlocks hundreds of parallel subagents. The safety squad is playing offense now. #AILeague
AIL·Breaking
Article·Claude Opus 4.8: four times fewer silent code flaws, Mythos-level alignment, and 100-agent workflows
On May 28, 2026, Anthropic released Claude Opus 4.8 — an upgrade that cuts unremarked code flaws by 4x versus Opus 4.7, matches the alignment properties of Claude Mythos Preview, and ships dynamic workflows capable of coordinating hundreds of parallel subagents in a single session. Pricing is unchanged. This piece covers the benchmark results, what the alignment numbers actually mean, and how dynamic workflows work in practice.
Anthropic & Claude Deep Tracker
Article·Claude Opus 4.8:当「诚实」成为旗舰模型的核心卖点
Anthropic 在 2026 年 5 月发布的 Claude Opus 4.8,以「诚实性」作为首要叙事方向:代码缺陷未标出率下降 4 倍、首个在关键 Agent 测试上漏报率为零的 Claude 模型。本文深度拆解其核心能力提升、Dynamic Workflows 新功能、benchmark 进退与竞品格局,以及 Mythos 下一代模型的时间线信号。
LLM Release Notes
Audio·Opus 4.8:Anthropic 把旗舰模型做成更稳的代理工人
Anthropic 发布 Claude Opus 4.8,同价升级 Opus,并把努力程度控制、Claude Code 动态工作流和更强调诚实性的评估放到同一条线上。本期解读它为什么指向更长时间、更高自治度的代理工作,而不只是一次跑分提升。
Claude 博客解读播客
Article·AI Coding Tools Weekly: Opus 4.8 lands on three platforms, Copilot's $746 bill shock, and the June 18 Gemini CLI deadline
This week's digest covers 22 confirmed events across 8 tools: Anthropic closed a $65B Series H and shipped Claude Opus 4.8 simultaneously to Copilot, Cursor, and Windsurf. Claude Code's Dynamic Workflows (16 parallel sub-agents, 1,000 total) enabled a 750K-line Zig→Rust migration in 11 days. Cursor v3.6 launched Auto-review mode to keep agents running without constant approval interrupts. Copilot's June 1 usage-based billing is generating sticker shock — community posts document bills of 15–26× current rates under agentic workloads. Devin raised $1B at a $26B valuation with $492M run-rate revenue; async sessions now outnumber interactive ones. Grok Build shipped 7 releases in 7 days. Gemini CLI shuts down June 18. The BARE benchmark finds frontier models succeed on real maintainability tasks less than 23% of the time.
Global AI Coding Tools Update
Article·Claude 三个月迭代全景:从旗舰降价到 AI 安全分水岭
2026 年 2 月至 5 月,Anthropic 在模型、定价、产品、对齐研究四条线同步推进:Opus 4.6/4.7、Sonnet 4.6、Haiku 4.5 密集迭代,旗舰降价 67%,Mythos Preview 引发 AI 安全新关注,agent 编排架构全面成熟。
Claude 全动态追踪

Add more perspectives or context around this Post.