GitHub topics: pipeline-parallelism

Repositories

deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language: Python - Size: 217 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 39,507 - Forks: 4,485

bigscience-workshop/petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language: Python - Size: 4.06 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 9,731 - Forks: 567

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

Language: Python - Size: 63.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 41,044 - Forks: 4,524

Shenggan/awesome-distributed-ml

A curated list of awesome projects and papers for distributed training or inference

Size: 44.9 KB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 239 - Forks: 28

1set-t/ai-model

Industrial-grade weather visualization system that transforms AI model predictions into professional meteorological plots, emphasizing operational forecasting capabilities.

Size: 1.95 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

gty111/gLLM

gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling

Language: Python - Size: 1.42 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 29 - Forks: 1

ai-decentralized/BloomBee

Decentralized LLMs fine-tuning and inference with offloading

Language: Python - Size: 36.7 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 94 - Forks: 16

torchpipe/torchpipe

Serving Inside Pytorch

Language: C++ - Size: 41.6 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 163 - Forks: 13

xrsrke/pipegoose

Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*

Language: Python - Size: 1.26 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 84 - Forks: 19

InternLM/InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language: Python - Size: 6.8 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 393 - Forks: 69

kakaobrain/torchgpipe

A GPipe implementation in PyTorch

Language: Python - Size: 449 KB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 843 - Forks: 99

PaddlePaddle/PaddleFleetX

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Language: Python - Size: 637 MB - Last synced at: 28 days ago - Pushed at: about 1 year ago - Stars: 470 - Forks: 165

Oneflow-Inc/libai

LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training

Language: Python - Size: 34.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 406 - Forks: 56

ParCIS/Chimera

Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.

Language: Python - Size: 1.05 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 62 - Forks: 8

alibaba/EasyParallelLibrary

Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.

Language: Python - Size: 771 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 267 - Forks: 49

AlibabaPAI/DAPPLE

An Efficient Pipelined Data Parallel Approach for Training Large Model

Language: Python - Size: 1.64 MB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 73 - Forks: 17

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

Language: Jupyter Notebook - Size: 73.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 420 - Forks: 23