Skip to content

Bump sglang from 0.5.4 to 0.5.12.post1#7

Open
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/pip/sglang-0.5.12.post1
Open

Bump sglang from 0.5.4 to 0.5.12.post1#7
dependabot[bot] wants to merge 1 commit into
mainfrom
dependabot/pip/sglang-0.5.12.post1

Conversation

@dependabot
Copy link
Copy Markdown

@dependabot dependabot Bot commented on behalf of github May 29, 2026

Bumps sglang from 0.5.4 to 0.5.12.post1.

Release notes

Sourced from sglang's releases.

v0.5.12.post1 is a stability patch on top of v0.5.12. It cherry-picks 12 fixes — primarily for DeepSeek V4 — onto the release branch.

Bug Fixes

DeepSeek V4

  • DSV4-Pro emits garbled text during single-token decode on B200/B300 (fix deep_gemm UE8M0 scale-packing path by ceiling activation scales before packing): #25733
  • DSV4 + EAGLE/MTP in disaggregation decode crashes around 2000 requests with a SWA allocator assertion (recycled KV pages kept stale sliding-window mappings): #25805
  • DSV4 NSA prefill context-parallel (--enable-nsa-prefill-context-parallel --nsa-prefill-cp-mode round-robin-split) in --disaggregation-mode prefill: scheduler crash at startup: #25396
  • DSV4 HiSparse + SGLANG_OPT_USE_COMPRESSOR_V2=1: GSM8K accuracy restored from 0.825 → 0.960: #25646
  • DSV4 PD disaggregation now works with pipeline parallelism > 1 (removed stale pp_size=1 assertion): #25771
  • DSV4-Flash with --load-format dummy + FlashInfer mxfp4 hits CUDA illegal memory access during CUDA-graph capture (the integer HashTopK.tid2eid lookup table was left uninitialized by dummy load): #25892
  • DSV4 HiCache + SGLANG_OPT_CACHE_SWA_TRANSLATION=1 returns stale translation indices after a cache rebuild, causing OOB writes / wrong outputs: #25889

Disaggregation

  • [PD][NIXL] Always send aux on is_last; only expect state when truthy: #25699

Other

  • Fix missing group arg in get_dp_buffer: #25585

Performance

  • DSV4: warm MHC token-count buckets at startup (gated to SGLANG_OPT_DEEPGEMM_HC_PRENORM=1 + SGLANG_OPT_USE_TILELANG_MHC_PRE=1 + hybrid SWA) to eliminate 20–40s cold-bucket forward stalls: #25810
  • DSV4-Pro: precompile a DeepGEMM branch for _dispatch_bf16_fp32_backend to cut runtime JIT compile cost: #25860

Dependencies

  • Use [cu13] extra for nvidia-cutlass-dsl (default to CUDA 13; required for sm_103 / B300): #25576

All PRs included in this release: sgl-project/sglang@v0.5.12...v0.5.12.post1

Full Changelog: sgl-project/sglang@v0.5.12...v0.5.12.post1

v0.5.12

Highlights

  • DeepSeek V4 support: Full inference path for DeepSeek-V4 (#23882), including:

    Day-0 Features: #23882

    • Parallelism: Tensor Parallelism/Expert Parallelism/Context Parallelism/Data Parallel Attention
    • Hardware: Nvidia B300/B200/H200/H100/GB200/GB300, AMD MI35X
    • Prefill-Decode Disaggregation
    • HiSparse for offloading inactive KV cache to CPU memory
    • Reasoning parser and Tool Call Parser
    • DeepGemm and FlashMLA kernels for DeepSeek V4, including MegaMoE

    Post-Day-0 additions:

    • HiCache for DeepSeek V4 under unified Radix Tree [UnifiedTree]: #24691
    • W4A4 MegaMoE kernels — faster speed with negligible accuracy drop: #25052
    • Marlin/FlashInfer W4A8 MoE kernels on Hopper: #24816 #24986
    • Faster V2 fused compression kernels: #24890

... (truncated)

Commits
  • 5a15cde [Cherry-pick to release/v0.5.12] fix: fix deepseek v4 CP error (#25396) (#26...
  • ff700aa [Cherry-pick to release/v0.5.12] ci: self-heal $GITHUB_PATH/$GITHUB_ENV write...
  • d8d60ee [Cherry-pick to release/v0.5.12] [CI] pr-test-extra: add run_all_tests to wor...
  • 39f29f0 [CI] release/v0.5.12: fix base-* suite names dragged in by cherry-picks (#26109)
  • fd10d3e [Cherry-pick to release/v0.5.12] [Fix] DSV4 cached_loc invalidated when SWA m...
  • a573877 [Cherry-pick to release/v0.5.12] Fix SWA double-free in disagg decode with MT...
  • 928c0c7 [Cherry-pick to release/v0.5.12] add git gemm warpper for dispatch_bf16_fp32_...
  • b18120b [Cherry-pick to release/v0.5.12] fix(dsv4): drop stale pp_size=1 guard for V4...
  • ef0fab8 [Cherry-pick to release/v0.5.12] Fix/dsv4 flash eagle dummy ima (#25892) (#26...
  • efe5c83 [Cherry-pick to release/v0.5.12] fix deepseek v4 hisparse (#25646) (#26072)
  • Additional commits viewable in compare view

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
    You can disable automated security fix PRs for this repo from the Security Alerts page.

Bumps [sglang](https://github.com/sgl-project/sglang) from 0.5.4 to 0.5.12.post1.
- [Release notes](https://github.com/sgl-project/sglang/releases)
- [Commits](sgl-project/sglang@v0.5.4...v0.5.12.post1)

---
updated-dependencies:
- dependency-name: sglang
  dependency-version: 0.5.12.post1
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels May 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python Pull requests that update python code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants