Tighten public-facing copy

wimi321 · wimi321 · commit e0d518f915c9 · 2026-03-19T00:00:17.000+08:00
diff --git a/README.md b/README.md
@@ -6,42 +6,37 @@
   <img src="./assets/hero-banner.svg" alt="Task Bundle hero banner" width="100%" />
 </p>
 
-<p align="center"><strong>Turn AI coding runs into portable, replayable, benchmark-ready task bundles.</strong></p>
-<p align="center">A practical format between raw chat logs and heavyweight benchmark platforms.</p>
+<p align="center"><strong>Turn AI coding runs into portable, replayable task bundles.</strong></p>
+<p align="center">Useful when chat logs are too loose and full benchmark platforms are too heavy.</p>
 <p align="center">
   <a href="#quickstart"><strong>Quick Start</strong></a> ·
-  <a href="#real-bundles"><strong>Real Output</strong></a> ·
-  <a href="#format-vs-alternatives"><strong>Why This Format</strong></a> ·
+  <a href="#example-output"><strong>Example Output</strong></a> ·
+  <a href="#where-it-fits"><strong>Where It Fits</strong></a> ·
   <a href="./docs/bundle-format.md"><strong>Bundle Format</strong></a> ·
   <a href="./docs/sample-benchmark-report.md"><strong>Sample Report</strong></a> ·
-  <a href="./ROADMAP.md"><strong>Roadmap</strong></a> ·
-  <a href="./docs/branding.md"><strong>Brand Assets</strong></a>
+  <a href="./ROADMAP.md"><strong>Roadmap</strong></a>
 </p>
 
 [![CI](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml/badge.svg)](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml)
 [![GitHub stars](https://img.shields.io/github/stars/wimi321/task-bundle?style=social)](https://github.com/wimi321/task-bundle/stargazers)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](./LICENSE)
 
-Task Bundle is a TypeScript + Node.js CLI for teams building agents, evals, coding benchmarks, and reproducible AI workflows.
+Task Bundle is a TypeScript + Node.js CLI for packaging one coding task into a directory you can inspect, compare, archive, validate, and report on.
 
-Package a task once, inspect it later, compare tools on the same starting point, and generate benchmark-style reports from real artifacts.
+Use it to:
+- keep task inputs, summaries, diffs, events, and workspace files together
+- compare Codex, Claude Code, Cursor, or internal tools using metadata, hashes, and outcome fields
+- generate benchmark-style reports from a folder of bundles
+- preserve enough context for reruns without aiming for token-perfect replay
 
-It helps you:
-- turn one AI coding run into a clean, shareable directory instead of leaving it scattered across screenshots, transcripts, or loose patches
-- compare Codex, Claude Code, Cursor, or internal agents using metadata, hashes, and outcome fields
-- generate benchmark-style reports from a folder of bundles without standing up a full evaluation platform first
-- preserve enough context for reruns and comparisons without requiring token-perfect recording
-
-It fits the gap between raw logs and full evaluation systems: light enough for day-to-day work, structured enough for replay and benchmarking.
-
-It is designed for workflows where you want to:
+It works best when you want to:
 - inspect what happened
 - share a task with someone else
 - rerun a task later
 - compare outputs across tools and models
 - grow toward replay and benchmark workflows
 
-It is intentionally not:
+It is not:
 - an agent framework
 - a chat UI
 - a provider router
@@ -60,13 +55,13 @@ npm run build
 npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
 ```
 
-If you want the shortest possible proof that the project already works, this is it.
+This is the fastest way to see the format in action.
 
 ![Task Bundle workflow overview](./assets/workflow-overview.svg)
 
-<a id="real-bundles"></a>
+<a id="example-output"></a>
 
-## See It On Real Bundles
+## Example Output
 
 Inspect a bundle:
 
@@ -110,13 +105,13 @@ Ranking
 2. Fix greeting punctuation | claude-code / claude-sonnet-4 | success | score 0.89
 ```
 
-Browse the committed example report:
+See the committed sample report:
 - [docs/sample-benchmark-report.md](./docs/sample-benchmark-report.md)
 - [docs/sample-benchmark-report.zh-CN.md](./docs/sample-benchmark-report.zh-CN.md)
 
-<a id="format-vs-alternatives"></a>
+<a id="where-it-fits"></a>
 
-## How It Compares To Common Alternatives
+## Where It Fits
 
 | Need | Chat logs | Zip or tarball | Full benchmark platform | Task Bundle |
 | --- | --- | --- | --- | --- |
@@ -161,7 +156,6 @@ See:
 - [docs/bundle-format.zh-CN.md](./docs/bundle-format.zh-CN.md)
 - [docs/design-decisions.md](./docs/design-decisions.md)
 - [docs/replay-contract.md](./docs/replay-contract.md)
-- [docs/branding.md](./docs/branding.md)
 
 ## Five-Minute Demo
 
diff --git a/README.zh-CN.md b/README.zh-CN.md
@@ -6,42 +6,37 @@
   <img src="./assets/hero-banner.svg" alt="Task Bundle hero banner" width="100%" />
 </p>
 
-<p align="center"><strong>把 AI coding 过程变成可分享、可重跑、可比较、可做 benchmark 的任务包。</strong></p>
-<p align="center">它适合放在聊天记录和 benchmark 平台之间，承接真实任务与结果。</p>
+<p align="center"><strong>把 AI coding 过程整理成可分享、可比较、可重跑的任务包。</strong></p>
+<p align="center">适合用在聊天记录不够稳定、完整 benchmark 平台又太重的场景里。</p>
 <p align="center">
   <a href="#quickstart"><strong>快速开始</strong></a> ·
-  <a href="#real-bundles"><strong>真实输出</strong></a> ·
-  <a href="#format-vs-alternatives"><strong>为什么是这个格式</strong></a> ·
+  <a href="#example-output"><strong>示例输出</strong></a> ·
+  <a href="#where-it-fits"><strong>方案对比</strong></a> ·
   <a href="./docs/bundle-format.zh-CN.md"><strong>格式说明</strong></a> ·
   <a href="./docs/sample-benchmark-report.zh-CN.md"><strong>示例报告</strong></a> ·
-  <a href="./ROADMAP.zh-CN.md"><strong>路线图</strong></a> ·
-  <a href="./docs/branding.zh-CN.md"><strong>品牌素材</strong></a>
+  <a href="./ROADMAP.zh-CN.md"><strong>路线图</strong></a>
 </p>
 
 [![CI](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml/badge.svg)](https://github.com/wimi321/task-bundle/actions/workflows/ci.yml)
 [![GitHub stars](https://img.shields.io/github/stars/wimi321/task-bundle?style=social)](https://github.com/wimi321/task-bundle/stargazers)
 [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](./LICENSE)
 
-Task Bundle 是一个 TypeScript + Node.js CLI，适合 agent、eval、benchmark、可复现实验这类工作流。
+Task Bundle 是一个 TypeScript + Node.js CLI，用来把一次编码任务打包成可以查看、比较、归档、校验、生成报告的目录。
 
-把一次运行整理好之后，就可以 inspect、compare、validate、report，也方便把不同工具放到同一起点上做对照。
+你可以用它来：
+- 把任务输入、执行摘要、diff、事件和工作区文件放在一起
+- 比较 Codex、Claude Code、Cursor 或内部工具的运行结果，并保留元数据、哈希和 outcome 字段
+- 从一组 bundle 生成 benchmark 风格报告
+- 为后续重跑保留足够上下文，而不是追求逐 token 回放
 
-它主要解决这些问题：
-- 把一次 AI coding 任务整理成干净、稳定、可搬运的目录，而不是散落在截图、聊天记录或 patch 里
-- 比较 Codex、Claude Code、Cursor 或内部工具的结果，而且比较依据包括元数据、哈希和 outcome 字段
-- 从一组 bundle 直接生成 benchmark 风格报告，不用先搭完整评测平台
-- 为后续重跑和比较保留足够上下文，而不是依赖逐 token 录制
-
-它适合放在“聊天记录不够稳”和“完整 benchmark 平台太重”之间，作为更轻但足够结构化的方案。
-
-它适合这些场景：
+它比较适合这些场景：
 - 查看一次任务最后到底做了什么
 - 把任务结果分享给别人
 - 在之后重新执行同一个任务
 - 比较不同模型或工具在同一起点上的表现
 - 作为未来 replay / benchmark 工作流的基础层
 
-它明确不做这些事情：
+它不打算解决这些问题：
 - agent 框架
 - 聊天 UI
 - provider 路由器
@@ -60,15 +55,15 @@ npm run build
 npm run dev -- compare ./examples/hello-world-bundle ./examples/hello-world-bundle-claude
 ```
 
-如果你只想先确认“这项目现在到底能不能用”，这组命令就是最短路径。
+这是最快看懂项目在做什么的一组命令。
 
 ![Task Bundle workflow overview](./assets/workflow-overview.svg)
 
-<a id="real-bundles"></a>
+<a id="example-output"></a>
 
-## 看看真实输出
+## 示例输出
 
-先 inspect 一个 bundle：
+先查看一个 bundle：
 
 ```text
 $ npm run dev -- inspect ./examples/hello-world-bundle
@@ -110,31 +105,31 @@ Ranking
 2. Fix greeting punctuation | claude-code / claude-sonnet-4 | success | score 0.89
 ```
 
-你也可以直接点开仓库里提交好的示例报告：
+也可以直接查看仓库里提交好的示例报告：
 - [docs/sample-benchmark-report.zh-CN.md](./docs/sample-benchmark-report.zh-CN.md)
 - [docs/sample-benchmark-report.md](./docs/sample-benchmark-report.md)
 
-<a id="format-vs-alternatives"></a>
+<a id="where-it-fits"></a>
 
-## 和常见替代方案怎么区分
+## 和常见方案对比
 
 | 需求 | 聊天记录 | Zip / tarball | 完整 benchmark 平台 | Task Bundle |
 | --- | --- | --- | --- | --- |
 | 把原始任务和最终结果放在一起分享 | 部分满足 | 可以 | 可以 | 可以 |
-| 在同一起点上比较不同工具 | 很弱 | 很靠手工 | 可以 | 可以 |
+| 在同一起点上比较不同工具 | 较弱 | 很靠手工 | 可以 | 可以 |
 | 携带 artifact 哈希和结果元数据 | 不行 | 不行 | 可以 | 可以 |
 | 足够轻，能融入日常 coding 工作流 | 可以 | 可以 | 不太行 | 可以 |
-| 之后继续长成 replay / benchmark 工作流 | 很弱 | 很弱 | 可以 | 可以 |
+| 之后继续扩展成 replay / benchmark 工作流 | 较弱 | 较弱 | 可以 | 可以 |
 
 ## 为什么值得关注
 
 很多 AI coding 结果最后只留下截图、聊天记录或者一个 patch，后续几乎没法稳定比较。
 
-Task Bundle 想解决的就是这个空档：把一次任务变成一个可以 inspect、archive、compare、validate、report 的稳定单元。它特别适合：
+Task Bundle 主要解决的是这个问题：把一次任务变成一个可以查看、归档、比较、校验、生成报告的稳定单元。它比较适合：
 - 想做可复现实验的 agent 作者
 - 想做任务评测和 benchmark 的团队
 - 想比较 Codex、Claude Code、Cursor 或内部工具的开发者
-- 更关心可重跑，而不是逐 token 回放一致性的人
+- 更关心可重跑，而不是逐 token 一致性的人
 
 ## 这里的 Replay 是什么意思
 
@@ -161,7 +156,6 @@ task-bundle/
 - [docs/bundle-format.md](./docs/bundle-format.md)
 - [docs/design-decisions.md](./docs/design-decisions.md)
 - [docs/replay-contract.md](./docs/replay-contract.md)
-- [docs/branding.zh-CN.md](./docs/branding.zh-CN.md)
 
 ## 五分钟演示
 
diff --git a/docs/branding.md b/docs/branding.md
@@ -1,19 +1,17 @@
 # Branding Assets
 
-Task Bundle includes repository-ready visual assets under `assets/`.
-
-The art direction is intentionally warm-editorial rather than generic SaaS gradients: a calm dark field, paper-toned bundle cards, and a benchmark signal accent that reinforces "portable tasks" plus "measurable outcomes."
+This repository includes the visual assets used in the README and GitHub social preview.
 
 ## Files
 
 - `assets/hero-banner.svg`
-  Embedded at the top of the README to make the repository landing page feel like a product, not just a package listing.
+  Hero image used at the top of the README.
 - `assets/workflow-overview.svg`
-  A second README visual that explains the capture -> inspect -> compare -> report loop in one glance.
+  Workflow diagram used in the README.
 - `assets/social-preview.svg`
   Source artwork for GitHub social preview uploads.
 - `assets/social-preview.png`
-  Recommended raster export for GitHub social preview uploads. Kept in the repository for easy manual upload, but not required in the npm package.
+  Recommended raster export for GitHub social preview uploads.
 
 ## Suggested GitHub Setup
 
@@ -23,4 +21,4 @@ The art direction is intentionally warm-editorial rather than generic SaaS gradi
 
 ## Local Export Tips
 
-If you want to regenerate the PNG on macOS, you can use Quick Look or another SVG-to-PNG tool. The repository artwork is intentionally kept as SVG so it stays editable and versionable, while the committed PNG keeps GitHub social preview setup friction low.
+The SVG files stay in the repository so they remain editable and easy to version.
diff --git a/docs/branding.zh-CN.md b/docs/branding.zh-CN.md
@@ -1,17 +1,17 @@
 # 品牌素材
 
-Task Bundle 在 `assets/` 目录下提供了一套可直接用于仓库展示的视觉素材，让 README、GitHub 首页和分享卡片能保持统一气质。
+这个仓库在 `assets/` 目录下提供了 README 和 GitHub 社交预览图会用到的视觉素材。
 
 ## 文件说明
 
 - `assets/hero-banner.svg`
-  中英文 README 顶部使用的主视觉横幅，可继续编辑。
+  README 顶部使用的主视觉横幅。
 - `assets/workflow-overview.svg`
-  README 里的第二张主视觉，用来一眼解释 capture -> inspect -> compare -> report 这条路径。
+  README 中使用的工作流示意图。
 - `assets/social-preview.svg`
   GitHub 社交预览图的可编辑源文件。
 - `assets/social-preview.png`
-  已导出的上传版本，适合直接放到 GitHub 仓库设置里。
+  GitHub 社交预览图的 PNG 版本。
 
 ## 推荐设置
 
@@ -27,4 +27,4 @@ Task Bundle 在 `assets/` 目录下提供了一套可直接用于仓库展示的
 sips -s format png ./assets/social-preview.svg --out ./assets/social-preview.png
 ```
 
-仓库里保留 SVG，是为了让这套素材更容易继续修改、做版本对比，也更适合长期维护。
+仓库里保留 SVG，是为了继续编辑和版本管理更方便。
diff --git a/docs/sample-benchmark-report.md b/docs/sample-benchmark-report.md
@@ -1,6 +1,6 @@
 # Sample Benchmark Report
 
-This page shows what `taskbundle report` looks like against the example bundles included in this repository.
+This page keeps a saved example of `taskbundle report` generated from the example bundles in this repository.
 
 ## Regenerate Locally
 
@@ -28,8 +28,8 @@ npm run dev -- report ./examples --out ./dist/benchmark-report.md
 | codex | gpt-5 | 1 | 1 | 1 | 0.93 | 0.93 |
 | claude-code | claude-sonnet-4 | 1 | 1 | 1 | 0.89 | 0.89 |
 
-## Why This Matters
+## Why Keep This Page
 
-- It gives the repo a benchmark-shaped artifact without forcing a full benchmark platform.
-- It shows that the example bundles are not toy files with no downstream use.
-- It makes cross-tool comparisons legible for humans before you build dashboards.
+- It gives readers a concrete example of the report output.
+- It shows how the example bundles can be compared without extra tooling.
+- It provides a stable link that can be referenced from the README.
diff --git a/docs/sample-benchmark-report.zh-CN.md b/docs/sample-benchmark-report.zh-CN.md
@@ -1,6 +1,6 @@
 # 示例 Benchmark 报告
 
-这个页面展示的是：把仓库自带的 example bundles 交给 `taskbundle report` 之后，大概会得到什么样的结果。
+这个页面保存了一份基于仓库示例 bundle 生成的 `taskbundle report` 输出，方便直接查看结果长什么样。
 
 ## 本地重新生成
 
@@ -21,15 +21,15 @@ npm run dev -- report ./examples --out ./dist/benchmark-report.md
 | 1 | Fix greeting punctuation | codex | gpt-5 | success | 0.93 | 3 | 1 |
 | 2 | Fix greeting punctuation | claude-code | claude-sonnet-4 | success | 0.89 | 4 | 1 |
 
-## Tool / Model 排行
+## 按工具 / 模型汇总
 
 | Tool | Model | Runs | Scored | Successes | Avg Score | Best Score |
 | --- | --- | --- | --- | --- | --- | --- |
 | codex | gpt-5 | 1 | 1 | 1 | 0.93 | 0.93 |
 | claude-code | claude-sonnet-4 | 1 | 1 | 1 | 0.89 | 0.89 |
 
-## 为什么这个页面有价值
+## 为什么保留这个页面
 
-- 它让仓库直接具备一个 benchmark 风格的可见成果，不需要先做完整平台。
-- 它说明 example bundles 不是摆设，而是真的可以继续拿来分析和比较。
-- 它让“跨工具比较”在没有 dashboard 之前，也已经足够清楚可读。
+- 让读者直接看到报告输出的样子。
+- 说明仓库里的示例 bundle 可以继续拿来比较和分析。
+- README 可以稳定链接到这份示例结果。