-
Notifications
You must be signed in to change notification settings - Fork 537
Pull requests: AI-Hypercomputer/maxtext
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs(rl): update tutorial with AgenticGRPOLearner for async RL training
#4181
opened Jun 16, 2026 by
AntonyMei
Collaborator
Loading…
4 tasks done
[DECOUPLED-MODE] grpo_trainer: route goodput/vertex imports through gcloud_stub for decoupled
gemini-review
#4180
opened Jun 16, 2026 by
gulsumgudukbay
Collaborator
Loading…
4 tasks done
Make MoE dispatch/MLP expert-axis batch sharding configurable (fix Mixtral EP throughput)
gemini-review
#4179
opened Jun 16, 2026 by
gulsumgudukbay
Collaborator
Loading…
4 tasks done
Load balancing changes for Deepseek v4
#4178
opened Jun 16, 2026 by
dipakg-lang
Collaborator
Loading…
4 tasks
Fix MRoPE dimension expansion in vLLM adapter for hybrid Qwen models
pull ready
#4177
opened Jun 16, 2026 by
xuefgu
Collaborator
Loading…
4 tasks done
[Deepseek V4] Add caching support and verify decoding
#4176
opened Jun 16, 2026 by
Rohan-Bierneni
Collaborator
Loading…
4 tasks done
[RL] Fix GPT-OSS 20B dimension mismatch error in vLLM adapter by resolving intermediate_size fallback
#4175
opened Jun 16, 2026 by
susanbao
Collaborator
Loading…
2 of 4 tasks
Fix double-compilation in train_step by matching input sharding.
gemini-review
#4174
opened Jun 16, 2026 by
igorts-git
Collaborator
Loading…
4 tasks done
Add layer by layer hidden state testing support to forward_pass_logit_checker.py
#4173
opened Jun 16, 2026 by
snehalv2002
Collaborator
•
Draft
4 tasks
Introduce SubBatchCheckpointManager interface.
#4171
opened Jun 15, 2026 by
copybara-service
Bot
Loading…
Refactor moe.p: gmm and a2a unsort
#4170
opened Jun 15, 2026 by
Shuwen-Fang
Collaborator
Loading…
4 tasks done
Add support for
keep_every_nth_step in checkpointing options.
#4169
opened Jun 15, 2026 by
copybara-service
Bot
Loading…
Add weight for ragged gather kernel and enable fan out in bwd ragged sort
pull ready
#4166
opened Jun 15, 2026 by
NuojCheng
Collaborator
Loading…
4 tasks done
Add on-the-fly dynamic SafeTensors loading support and remove redundant tensor handling logic
#4162
opened Jun 15, 2026 by
copybara-service
Bot
Loading…
Configure gemini-investigate on build failure for UploadDockerImages.yml
#4161
opened Jun 13, 2026 by
shralex
Collaborator
Loading…
4 tasks done
Add low-memory streaming conversion for unscanned DeepSeek-family checkpoints
#4160
opened Jun 13, 2026 by
discobot
Loading…
3 of 4 tasks
Update Gemma3 multimodal SFT Jupyter notebook
#4154
opened Jun 11, 2026 by
SurbhiJainUSC
Collaborator
Loading…
4 tasks done
[DeepSeek-V4] Implement model integration, decoders, and configuration stack
gemini-review
#4153
opened Jun 11, 2026 by
parambole
Collaborator
Loading…
4 tasks done
extract_answer: prefer boxed{N} extraction, fall back to legacy tags
#4150
opened Jun 11, 2026 by
py4
Collaborator
Loading…
4 tasks done
Add reward_functions_path + reward_functions CLI knobs for custom rewards
#4149
opened Jun 11, 2026 by
py4
Collaborator
Loading…
5 tasks done
[DO NOT MERGE] Feat/nnx set defaults true g3 test
pull ready
#4146
opened Jun 11, 2026 by
ecnal-cienet
Collaborator
Loading…
4 tasks done
Update google-cloud-mldiagnostics to >=1.0.3
#4144
opened Jun 11, 2026 by
copybara-service
Bot
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.