GitHub topics: megatron-lm
shreyansh26/Annotated-ML-Papers
Annotations of the interesting ML papers I read
Size: 315 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 240 - Forks: 23

openpsi-project/ReaLHF 📦
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Language: Python - Size: 8.75 MB - Last synced at: about 18 hours ago - Pushed at: 14 days ago - Stars: 291 - Forks: 18

alibaba/Megatron-LLaMA Fork of NVIDIA/Megatron-LM
Best practice for training LLaMA models in Megatron-LM
Language: Python - Size: 4.18 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 650 - Forks: 56

GoogleCloudPlatform/nvidia-nemo-on-gke
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
Language: HCL - Size: 964 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 12 - Forks: 7

yanring/Megatron-MoE-ModelZoo
Best practices for testing advanced Mixtral, DeepSeek, and Qwen series MoE models using Megatron Core MoE.
Language: Python - Size: 26.4 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 8 - Forks: 1

xrsrke/pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Language: Python - Size: 1.26 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 82 - Forks: 18

janelu9/EasyLLM
Running Large Language Model easily.
Language: Python - Size: 220 MB - Last synced at: 6 days ago - Pushed at: 23 days ago - Stars: 8 - Forks: 0

feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism
Language: Python - Size: 468 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 68 - Forks: 3

0-1CxH/megatron-wrap
Wrapped Megatron: As User-Friendly as HuggingFace, As Powerful as Megatron-LM | Megatron封装:和HuggingFace一样方便,和Megatron-LM一样强大
Language: Python - Size: 2.41 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

MoFHeka/LLaMA-Megatron
A LLaMA1/LLaMA12 Megatron implement.
Language: Python - Size: 288 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 2

Beomi/megatronlm_dataset_autotokenizer
Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.
Language: Python - Size: 498 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

GJ98/Megatron-LM
Megatron-LM implemented by PyTorch
Language: Python - Size: 9.77 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0
