[Klaud Cold][DO NOT MERGE] Test signoff-verify Check 7 with --hf-overrides indexer hack by functionstackx · Pull Request #2017 · SemiAnalysisAI/InferenceX

functionstackx · 2026-07-04T01:34:22Z

Summary

Adds --hf-overrides '{"use_index_cache": true, "index_topk_freq": 4}' to the MiniMax M3 MXFP8 H200 vLLM launch command — an architecture-changing benchmark hack of exactly the kind docs/PR_REVIEW_CHECKLIST.md forbids.
This is a NEGATIVE test for the updated codeowner-signoff-verify workflow ([Klaud Cold] Align codeowner-signoff-verify with latest PR_REVIEW_CHECKLIST.md #2015): after a sign-off comment is posted, Check 7 should FAIL naming this flag.
Will be closed without merging once verified.

🤖 Generated with Claude Code

…merge) Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

github-actions · 2026-07-04T01:34:29Z

Thanks for the contribution! Please reach out to respective companies' CODEOWNER to fill in the latest PR_REVIEW_CHECKLIST.md before pinging core maintainer on Slack for review.

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. See GitHub's docs on re-running failed jobs

感谢你的贡献！请联系相应公司的 CODEOWNER 填写最新的 PR_REVIEW_CHECKLIST.md，然后再在 Slack 上联系核心维护者进行审阅。

PR 作者有责任确保合并后所有 GitHub Action 任务完全通过。 很多时候失败只是偶发抖动（flake），重新运行失败的任务即可解决。参见 GitHub 关于重新运行失败任务的文档

functionstackx · 2026-07-04T01:34:39Z

As a PR reviewer and CODEOWNER, I have reviewed this and have:

Additional detail section:

Recipe: https://recipes.vllm.ai/MiniMaxAI/MiniMax-M3

Signed: functionstackx

Klaud-Cold · 2026-07-04T01:37:05Z

@functionstackx Sign-off NOT verified — merge is blocked: no sweep/evals ran on any commit in this PR, no /reuse-sweep-run has been posted, and the launch command adds an architecture-changing --hf-overrides indexer hack.

Check 0 (CODEOWNER): PASS — only changed path is covered by the * @InferenceX/core catch-all; signer is an org MEMBER (team membership not readable by this token; not treated as a failure).
Check 1 (sweep on in-PR commit): FAIL — No passing sweep/eval was found on any commit in this PR. The only commit (2234a2d) has no single-node */ or eval / check-runs at all.
Check 2 (evals pass): FAIL — no eval run exists on an in-PR commit to verify.
Check 3 (recipe linked + complete): FAIL — link present (https://recipes.vllm.ai/MiniMaxAI/MiniMax-M3) and model/TP/DP/EP/block-size match, but --hf-overrides '{"use_index_cache": true, "index_topk_freq": 4}' materially changes the served model and appears nowhere in the recipe.
Check 4 (reuse command): FAIL — No authorized /reuse-sweep-run command has been posted on this PR; an authorized maintainer must comment /reuse-sweep-run before this PR can be merged via reuse.
Check 5 (latest checklist template): PASS — every current-template item is present and checked.
Check 6 (upstream image / engine-first): N/A — no configs/*-master.yaml changes in this PR.
Check 7 (no architecture hacks): FAIL — --hf-overrides '{"use_index_cache": true, "index_topk_freq": 4}' in benchmarks/single_node/fixed_seq_len/minimaxm3_fp8_h200.sh reuses cached indices to skip the MSA indexer on 3 of every 4 layers, reducing model-architecture FLOPs; MiniMax-M3 does not natively run this mode and the linked recipe does not use it.
Check 8 (spec-decode chat template): N/A — no speculative-decoding changes.

Note: the PR is titled "[DO NOT MERGE] Test signoff-verify Check 7" and describes itself as a negative test for this workflow.

functionstackx · 2026-07-04T01:37:34Z

Negative test confirmed — Check 7 correctly FAILed on the --hf-overrides indexer hack (and Check 3 flagged the recipe divergence). Closing.

Test: add hf-overrides indexer cache to MiniMax M3 H200 vLLM (do not …

2234a2d

…merge) Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

functionstackx requested a review from a team July 4, 2026 01:34

github-project-automation Bot added this to InferenceMAX Board Jul 4, 2026

functionstackx closed this Jul 4, 2026

functionstackx deleted the klaud/test-signoff-hf-overrides branch July 4, 2026 01:37

github-project-automation Bot moved this to Done in InferenceMAX Board Jul 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Klaud Cold][DO NOT MERGE] Test signoff-verify Check 7 with --hf-overrides indexer hack#2017

[Klaud Cold][DO NOT MERGE] Test signoff-verify Check 7 with --hf-overrides indexer hack#2017
functionstackx wants to merge 1 commit into
mainfrom
klaud/test-signoff-hf-overrides

functionstackx commented Jul 4, 2026

Uh oh!

github-actions Bot commented Jul 4, 2026

Uh oh!

functionstackx commented Jul 4, 2026

Uh oh!

Klaud-Cold commented Jul 4, 2026

Uh oh!

functionstackx commented Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

functionstackx commented Jul 4, 2026

Summary

Uh oh!

github-actions Bot commented Jul 4, 2026

Uh oh!

functionstackx commented Jul 4, 2026

Additional detail section:

Uh oh!

Klaud-Cold commented Jul 4, 2026

Uh oh!

functionstackx commented Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants