Topic: "sparse-distributed-training"
synxlin/deep-gradient-compression
[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Language: Python - Size: 316 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 207 - Forks: 45
