Visualized Reading

I turn papers and blogs I am actively reading into single-file, self-contained interactive HTML pages. Each entry below opens a standalone visualization; it is my way of forcing myself to restate what I just read in a form that someone else could also navigate.

The exact Cursor agent command I use to produce these pages lives at blog-to-html.

verl 0.8.0：主线 pipeline 改了什么？

Jul 14, 2026 · blog

vLLM 起 MoE 那句 Using default MoE config 到底慢多少

Jul 7, 2026 · blog

SUPO 怎么用端到端摘要把多轮 RL 训出固定窗口

Jun 30, 2026 · paper · original

多模态模型怎么把图像 / 视频 / 语音打成 token

Jun 30, 2026 · blog

OpenThoughts-Agent 把 agent 训练数据黑盒拆成开源配方

Jun 26, 2026 · paper · original

QUEST 怎么用 8K 合成 Rubric Tree 训出通用 deep research agent

Jun 24, 2026 · paper · original

Ratchet 自进化 agent 经验手册到底哪些管理动作不能少

Jun 4, 2026 · paper · original

NudgeRL self-distillation 和 RL 的巧妙结合

May 25, 2026 · paper · original

Agent memory 调研

May 21, 2026 · paper · original

EAGLE 系列怎么一步步把 LLM 推理加速做到 6.5x

May 14, 2026 · paper · original

RoPE 介绍

May 13, 2026 · paper · original

MiMo 为什么选 MTP 和更高 SWA 比例而不是 MLA

May 13, 2026 · paper · original

AlphaEvolve: LLM based 算法自动改进(auto research)

May 13, 2026 · paper · original

Skill0 怎么用 RL 把 Agent 技能内化到模型参数里

May 13, 2026 · paper · original

MLA · 把 KV cache 联合压成一个 latent 向量

May 11, 2026 · paper · original

用分布几何看 SFT、RL、OPD 三种 post-training 怎么推模型

May 11, 2026 · blog · original

构造一个更懂用户的 Agent System · 三层设计

May 9, 2026 · blog

Manus 上下文工程 · 重写 4 次 Agent 框架的 6 条经验

May 8, 2026 · blog · original

Anthropic Skills

May 8, 2026 · blog · original

Hermes Agent 的记忆机制 · 双层结构与冻结快照

May 8, 2026 · blog · original

三个 bug 同时撞在一起 · Anthropic 的 Claude 质量下降复盘

May 7, 2026 · blog · original

Context Engineering by Anthropic

May 7, 2026 · blog · original

Anthropic 怎么设计长程编码 Harness

May 6, 2026 · blog · original

RLVR 的参数更新大多落在非主成分方向

Apr 30, 2026 · paper · original

verl 在训练的时候，发生了什么？

Apr 30, 2026 · blog

verl 的 batch 分发机制：balance batch 和 dynamic bsz

Apr 29, 2026 · blog

DeepSeek-V4 Report

Apr 28, 2026 · paper

Anthropic 怎么构建 multi-agent research system

Apr 23, 2026 · blog · original