JetFlow 把 speculative decoding 的草稿树接上因果链:tree-causal mask + causal parallel draft head,让更大的 draft budget 真的换成长 accepted prefix。arXiv 2606.18394,通勤两分六秒,听懂 9.64× 解码加速的喷流。
Add more perspectives or context around this Post.
Add more perspectives or context around this Post.