GitHub topics: stable-llm-pretraining
bluorion-com/ZClip
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
Language: Python - Size: 500 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 125 - Forks: 8
