An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: megatron-lm

shreyansh26/Annotated-ML-Papers

Annotations of the interesting ML papers I read

Size: 315 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 240 - Forks: 23

openpsi-project/ReaLHF 📦

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Language: Python - Size: 8.75 MB - Last synced at: about 18 hours ago - Pushed at: 14 days ago - Stars: 291 - Forks: 18

alibaba/Megatron-LLaMA Fork of NVIDIA/Megatron-LM

Best practice for training LLaMA models in Megatron-LM

Language: Python - Size: 4.18 MB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 650 - Forks: 56

GoogleCloudPlatform/nvidia-nemo-on-gke

Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine

Language: HCL - Size: 964 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 12 - Forks: 7

yanring/Megatron-MoE-ModelZoo

Best practices for testing advanced Mixtral, DeepSeek, and Qwen series MoE models using Megatron Core MoE.

Language: Python - Size: 26.4 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 8 - Forks: 1

xrsrke/pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

Language: Python - Size: 1.26 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 82 - Forks: 18

janelu9/EasyLLM

Running Large Language Model easily.

Language: Python - Size: 220 MB - Last synced at: 6 days ago - Pushed at: 23 days ago - Stars: 8 - Forks: 0

feifeibear/Odysseus-Transformer

Odysseus: Playground of LLM Sequence Parallelism

Language: Python - Size: 468 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 68 - Forks: 3

0-1CxH/megatron-wrap

Wrapped Megatron: As User-Friendly as HuggingFace, As Powerful as Megatron-LM | Megatron封装:和HuggingFace一样方便,和Megatron-LM一样强大

Language: Python - Size: 2.41 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

MoFHeka/LLaMA-Megatron

A LLaMA1/LLaMA12 Megatron implement.

Language: Python - Size: 288 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 2

Beomi/megatronlm_dataset_autotokenizer

Megatron-LM/GPT-NeoX compatible Text Encoder with 🤗Transformers AutoTokenizer.

Language: Python - Size: 498 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

GJ98/Megatron-LM

Megatron-LM implemented by PyTorch

Language: Python - Size: 9.77 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0