Topic: "model-reusing"
VITA-Group/LiGO
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox, Zhangyang Wang, Yoon Kim
Language: Python - Size: 1.55 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 91 - Forks: 10

VITA-Group/Data-Efficient-Scaling
[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang
Language: Python - Size: 188 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 0
