antirez / ds4 Public

Notifications You must be signed in to change notification settings
Fork 1.5k
Star 17.4k

Code
Issues 89
Pull requests 157
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: antirez/ds4

Labels 24 Milestones 0

New pull request New

157 Open 138 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Draft of Inference Sandbox jj-dai.org

#490 opened Jul 3, 2026 by VLADLEVIT • Draft

Server: fix agent-loop cache misses, add cancellation, observability, and robustness fixes

#489 opened Jul 2, 2026 by elkaix

Loading…

cuda: enable streaming auto cache (implement recommended_working_set_size)

#488 opened Jul 2, 2026 by riccardo-galbani

Loading…

cuda: fall back to pinned host memory when the model arena runs out of VRAM

#487 opened Jul 2, 2026 by riccardo-galbani

Loading…

Enable MI300X ROCm support

#484 opened Jul 1, 2026 by ehartford

Loading…

DSpark B2 rejection sampling + adaptive block sizing

#482 opened Jun 30, 2026 by machiabeli

Loading…

Add DSpark speculative draft runtime

#480 opened Jun 30, 2026 by audreyt Contributor

Loading…

feat: add headless browser support with curl fallback for web tools

#479 opened Jun 29, 2026 by J3rr1ck

Loading…

CUDA: make DeepSeek-V4-Pro correct on the indexed-attention path (top_k 512→1024) + enable decode LUT gate for in_dim>4096

#478 opened Jun 29, 2026 by slackarea

Loading…

Support DeepSeek V4 Flash 4Expert (top-4)

#474 opened Jun 28, 2026 by yuhai-china

Loading…

CUDA: scale q8->f16 cache reserve on >=112 GiB cards (fixes session OOM on large models)

#472 opened Jun 28, 2026 by slackarea

Loading…

Fix CUDA MoE router hardcoded to 256 experts

#466 opened Jun 27, 2026 by slackarea

Loading…

Fix slow decodes "poisoning" sleep times when using power throttling

#464 opened Jun 27, 2026 by omnomburp

Loading…

ROCm: discrete GPU memory management

#461 opened Jun 26, 2026 by cattivik66

Loading…

CUDA: batch gate/up/down uploads for selected expert cache misses

#460 opened Jun 26, 2026 by fmolara

Loading…

Add served model name option for server discovery

#456 opened Jun 25, 2026 by RiccardoFiorentini

Loading…

Metal: keep selected-address SSD prefill opt-in by default

#454 opened Jun 25, 2026 by andreaborio • Draft

Fix typo in README

#453 opened Jun 25, 2026 by mwbini

Loading…

Support SSD streaming for Q4_K routed experts on ROCm

#451 opened Jun 24, 2026 by kmc6042

Loading…

Protect incoming KV prefix during live miss

#448 opened Jun 23, 2026 by JordiPosthumus

Loading…

Fix ROCm Q8->F16 cache reserve starving session tensors on large models (q4q2)

#446 opened Jun 23, 2026 by alantsev Contributor

Loading…

AGENTS.md rename (and server performance improvements?)

#443 opened Jun 21, 2026 by OPS-NeoRetro

Loading…

Add Quickstart section to README

#438 opened Jun 18, 2026 by sethconvex

Loading…

cuda: generalize router-select for arbitrary expert count (fixes Pro on CUDA, #427)

#435 opened Jun 17, 2026 by newjordan • Draft

Fix quality-score link after streaming refactor

#434 opened Jun 17, 2026 by andreaborio

Loading…

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2026-06-03.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!