Topic: "adaptive-gradient-clipping"
vballoli/nfnets-pytorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/
Language: Python - Size: 5.63 MB - Last synced at: 17 days ago - Pushed at: over 1 year ago - Stars: 345 - Forks: 29

VITA-Group/BNN_NoBN
[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang
Language: Python - Size: 310 KB - Last synced at: 7 days ago - Pushed at: over 3 years ago - Stars: 57 - Forks: 10

andreped/GradientAccumulator
:dart: Accumulated Gradients for TensorFlow 2
Language: Python - Size: 5.3 MB - Last synced at: 4 days ago - Pushed at: about 1 year ago - Stars: 53 - Forks: 11
