Pinned Loading
-
-
cleanrl
cleanrl PublicForked from vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python
-
NVIDIA/Megatron-LM
NVIDIA/Megatron-LM PublicOngoing research training transformer models at scale
-
NVIDIA/Model-Optimizer
NVIDIA/Model-Optimizer PublicA unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…
-
NVIDIA-NeMo/RL
NVIDIA-NeMo/RL PublicScalable toolkit for efficient model reinforcement
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


