Skip to content

Pull requests: pytorch/helion

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Pallas] Fix synchronize_device skipping sync for tuple returns on TPU CLA Signed This label is managed by the Meta Open Source bot.
#2580 opened May 25, 2026 by thcmbs Collaborator Loading…
[test] Split cute compiler-pass tests by pass, drop kernel name from filenames CLA Signed This label is managed by the Meta Open Source bot.
#2579 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Rewrite online softmax_two_pass → equivalent 3-pass form CLA Signed This label is managed by the Meta Open Source bot.
#2578 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Optional carried-only gate for the load pipeline pass CLA Signed This label is managed by the Meta Open Source bot.
#2577 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Software-pipeline inner vec loads to hide HBM latency CLA Signed This label is managed by the Meta Open Source bot.
#2576 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Cleaner LICM: alias DCE + FMA-friendly scale hoist CLA Signed This label is managed by the Meta Open Source bot.
#2575 opened May 25, 2026 by oulgen Contributor Loading…
[cute] LICM for reciprocals: hoist 1/divisor out of inner loops CLA Signed This label is managed by the Meta Open Source bot.
#2574 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Warp-per-row layout for softmax-shape reductions CLA Signed This label is managed by the Meta Open Source bot.
#2573 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Merge sibling constexpr V-loops + elide warp-reduce dtype wraps CLA Signed This label is managed by the Meta Open Source bot.
#2572 opened May 25, 2026 by oulgen Contributor Loading…
[cute] V=8 fp16/bf16 via 4+4 split; multi-row investigation tests CLA Signed This label is managed by the Meta Open Source bot.
#2571 opened May 25, 2026 by oulgen Contributor Loading…
[cute] Softmax perf: vec hoist, fuser alias, hoist warp reduce, heuristics CLA Signed This label is managed by the Meta Open Source bot.
#2570 opened May 25, 2026 by oulgen Contributor Loading…
Fix fbcode CI torch.compile fusion with newer PyTorch CLA Signed This label is managed by the Meta Open Source bot.
#2567 opened May 23, 2026 by choijon5 Contributor Loading…
Speed up Helion kernel launches by avoiding repeated Python work CLA Signed This label is managed by the Meta Open Source bot.
#2565 opened May 23, 2026 by yushangdi Contributor Draft
[Pallas] Fix layernorm example tolerances and split bwd test CLA Signed This label is managed by the Meta Open Source bot.
#2560 opened May 22, 2026 by thcmbs Collaborator Draft
[Pallas] Propagate inner tile alignment min_size to bounding outer tiles CLA Signed This label is managed by the Meta Open Source bot.
#2559 opened May 22, 2026 by thcmbs Collaborator Loading…
[Pallas] Add support for non zero dim in gather CLA Signed This label is managed by the Meta Open Source bot.
#2558 opened May 22, 2026 by thcmbs Collaborator Loading…
Reject tensor_descriptor indexing when block size exceeds tensor dim (#2555) CLA Signed This label is managed by the Meta Open Source bot. fb-exported meta-exported
#2555 opened May 22, 2026 by mengluy0125 Contributor Loading…
Skip even more Python on repeated identical calls CLA Signed This label is managed by the Meta Open Source bot.
#2537 opened May 20, 2026 by choijon5 Contributor Draft
Reuse kernel output buffers instead of allocating fresh on every call CLA Signed This label is managed by the Meta Open Source bot.
#2536 opened May 20, 2026 by choijon5 Contributor Draft
Use the fast launcher during autotuning CLA Signed This label is managed by the Meta Open Source bot.
#2535 opened May 20, 2026 by choijon5 Contributor Draft
Add a C extension so launches skip more Python frames CLA Signed This label is managed by the Meta Open Source bot.
#2534 opened May 20, 2026 by choijon5 Contributor Draft
Speed up Helion kernel launches by avoiding repeated Python work CLA Signed This label is managed by the Meta Open Source bot.
#2533 opened May 20, 2026 by choijon5 Contributor Draft
[WIP] Pallas grid index map fp8 attention CLA Signed This label is managed by the Meta Open Source bot.
#2530 opened May 20, 2026 by thcmbs Collaborator Draft
[WIP] Fix Pallas grid index BlockSpecs CLA Signed This label is managed by the Meta Open Source bot.
#2529 opened May 20, 2026 by thcmbs Collaborator Draft
[Pallas] Reclaim HBM between kernels in run_tpu.py sweep CLA Signed This label is managed by the Meta Open Source bot.
#2495 opened May 20, 2026 by norx1991 Contributor Draft
ProTip! Mix and match filters to narrow down what you’re looking for.