GitHub topics: deepspeed-ulysses
InternLM/InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Language: Python - Size: 6.78 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 386 - Forks: 64

feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language: Python - Size: 4.6 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 475 - Forks: 38
