GitHub topics: fully-sharded-data-parallel
arawxx/FSDP-Distributed-Training-of-ConvNextV2-on-CIFAR10
A script for training the ConvNextV2 on CIFAR10 dataset using the FSDP technique for a distributed training scheme.
Language: Python - Size: 9.77 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ridwan-salau/transformer-xl Fork of rufaelfekadu/transformer-xl
Fully Sharded Data Parallel (FSDP) implementation of Transformer XL
Language: Python - Size: 768 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
