-
RL Public
Forked from NVIDIA-NeMo/RLScalable toolkit for efficient model reinforcement
Python Apache License 2.0 UpdatedFeb 20, 2026 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedFeb 20, 2026 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedNov 13, 2025 -
batch_invariant_ops Public
Forked from thinking-machines-lab/batch_invariant_opsPython MIT License UpdatedOct 14, 2025 -
mamba Public
Forked from state-spaces/mambaMamba SSM architecture
-
NeMo Public
Forked from NVIDIA-NeMo/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedDec 18, 2023 -
NeMo-Megatron-Launcher Public
Forked from NVIDIA/NeMo-Framework-LauncherNeMo Megatron launcher and tools
-
LDDL Public
Forked from NVIDIA/LDDLDistributed preprocessing and data loading for language datasets
Python Other UpdatedMay 30, 2023 -
aws-parallelcluster-post-install-scripts Public
Forked from aws-samples/aws-parallelcluster-post-install-scriptsScripts to customize AWS ParallelCluster
Shell MIT No Attribution UpdatedMay 16, 2023 -
nephele Public
Forked from NVIDIA/nepheleTools to deploy GPU clusters in the Cloud
HCL Apache License 2.0 UpdatedApr 4, 2023 -
torchscale Public
Forked from microsoft/torchscaleTransformers at any scale
Python MIT License UpdatedJan 19, 2023 -
open_clip Public
Forked from mlfoundations/open_clipAn open source implementation of CLIP.
Python Other UpdatedJan 9, 2023 -


