Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Save run manifest for distillation reproducibility. gemini-review
#3709 opened Apr 21, 2026 by gagika Collaborator Loading…
4 tasks done
Add optional --skip-validation flag to benchmark recipes and XPK workload creation
#3708 opened Apr 21, 2026 by RUEI4341 Contributor Loading…
4 tasks done
[Draft] Onboard ragged gather to moe draft Draft PR
#3707 opened Apr 21, 2026 by NuojCheng Collaborator Draft
4 tasks
draft pull ready
#3703 opened Apr 20, 2026 by shuningjin Collaborator Draft
4 tasks done
Jackyf/qlora
#3702 opened Apr 20, 2026 by RexBearIU Collaborator Draft
4 tasks
distillation: resume + xpk launcher + metrics refactor gemini-review
#3701 opened Apr 20, 2026 by gagika Collaborator Loading…
4 tasks done
Speed up cpu-unit CI gemini-review
#3700 opened Apr 19, 2026 by gagika Collaborator Draft
4 tasks
fix masking error when using mlp_bias=True causing NaN during gradien…
#3699 opened Apr 19, 2026 by snehalv2002 Collaborator Loading…
4 tasks done
update test
#3690 opened Apr 17, 2026 by shuningjin Collaborator Draft
4 tasks
Enable elastic training for RL
#3689 opened Apr 17, 2026 by copybara-service bot Loading…
[Distillation] base learn-to-init llama attention for distillation
#3688 opened Apr 17, 2026 by vlad-karp Collaborator Loading…
4 tasks done
[DRAFT] Vllm fused moe draft Draft PR
#3687 opened Apr 16, 2026 by NuojCheng Collaborator Draft
4 tasks
feat: Add FLOPs calculation for Multi-Token Prediction (MTP) modules gemini-review
#3685 opened Apr 16, 2026 by parambole Collaborator Loading…
4 tasks done
[Inference] Diverse Beam Search Integration
#3681 opened Apr 16, 2026 by yipkingster Loading…
5 tasks done
Add MoE load balancing loss to distillation
#3679 opened Apr 16, 2026 by JamesDeng42 Collaborator Loading…
4 tasks done
fix gemma4 to vllm weight conversion
#3677 opened Apr 15, 2026 by aireenmei Collaborator Draft
4 tasks
test: refactor DeepSeek v3 MTP tests into standard two-step structure pull ready
#3676 opened Apr 15, 2026 by parambole Collaborator Loading…
4 tasks done
Support grain data checkpoint for elastic training
#3673 opened Apr 15, 2026 by aireenmei Collaborator Draft
4 tasks done
Make all links internal (where possible)
#3671 opened Apr 15, 2026 by melissawm Collaborator Loading…
1 task done
ProTip! Follow long discussions with comments:>50.