-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][infra] Waive 1 failed cases for main in pre-merge 43190
#15352
opened Jun 14, 2026 by
ZhanruiSunCh
Collaborator
Loading…
[None][fix] AutoDeploy: return fused_weight_dims so fused QKV split sizes are rescaled under TP
#15351
opened Jun 14, 2026 by
CodersAcademy006
•
Draft
[#4442][feat] Add client-side A2A (Agent2Agent) protocol support to Scaffolding
#15350
opened Jun 14, 2026 by
Achyuthan-S
Loading…
2 tasks
[https://nvbugs/6293536][fix] order KV cache transfers with overlap scheduler
#15349
opened Jun 14, 2026 by
VALLIS-NERIA
Collaborator
•
Draft
[feat] Enable MLA chunked prefill and KV cache reuse on SM121
#15347
opened Jun 14, 2026 by
CodersAcademy006
Loading…
[https://nvbugs/6293712][fix] Patch GSM8K.EVALUATE_KWARGS with scores_filter="exact_match,strict-match"…
#15346
opened Jun 14, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-12339][feat] enable TRTLLM cross attention backend
#15345
opened Jun 14, 2026 by
cascade812
Collaborator
•
Draft
[https://nvbugs/6287561][fix] Add
get_sm_version() < 90 check at the top of run_MTP() in…
#15343
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][test] Waive 8 failed cases for main in QA CI
#15342
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 2 failed cases for main in QA CI
#15341
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 9 failed cases for main in QA CI
#15340
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][fix] fix tinygemm barrier bug
#15338
opened Jun 13, 2026 by
yweng0828
Collaborator
Loading…
1 task done
[None][test] Waive 23 failed cases for main in QA CI
#15337
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] CI-only: probe gb200 deepseek-v32 perf-sanity at 5f106dfa (DO NOT MERGE)
#15336
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[TRTLLM-12807][test] Guard thop attention kwarg aliases
#15335
opened Jun 13, 2026 by
yuxianq
Collaborator
Loading…
1 task done
[None][test] Waive 5 failed cases for main in QA CI
#15334
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[https://nvbugs/6281014][fix] fix the repeated cute.compile and simpilify the test
#15331
opened Jun 13, 2026 by
JadoTu
Collaborator
Loading…
1 task done
[#15327][feat] Add per-request priority support to OpenAI chat/completions
#15329
opened Jun 12, 2026 by
sopwg612
Loading…
1 task done
[https://nvbugs/6306936][test] Re-enable AutoDeploy disagg tests
#15325
opened Jun 12, 2026 by
govind-ramnarayan
Collaborator
Loading…
1 task done
[TRTLLM-12721][perf] Remove ready-ID transfer gathers
#15324
opened Jun 12, 2026 by
chienchunhung
Collaborator
•
Draft
[None][test] Fix Mamba hybrid transceiver helper
#15323
opened Jun 12, 2026 by
chienchunhung
Collaborator
Loading…
[None][chore] Small cleanups to MultimodalModelMixin
#15322
opened Jun 12, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[https://nvbugs/6193854][fix] PR #14851 already removed the bad
is_sliding_window/mMaxSeqLenKv logic on…
#15321
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-05-14.