GitHub topics: gradient-accumulation
andreped/GradientAccumulator
:dart: Gradient Accumulation for TensorFlow 2
Language: Python - Size: 5.3 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 11

rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
Language: Python - Size: 4.04 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,824 - Forks: 64

hkproj/pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
Language: Python - Size: 4.03 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 29

Ibzie/VideoPredict-PyTorch-Deep-Learning-Models-for-Video-Frame-Prediction-ConvLSTM-PredRNN-Transform
🎯 Production-ready implementation of video prediction models using PyTorch. Features Enhanced ConvLSTM with temporal attention, PredRNN with spatiotemporal memory, and Transformer-based architecture.
Language: Python - Size: 2.33 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

TanyaChutani/Gradient-Accumulation-Tensorflow2.x
Gradient Accumulation with Tensorflow2.x
Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

Behradsadeghi/flower-classification-efficientnet
Classifying images of flowers into 17 categories using EfficientNet-B0 and PyTorch.
Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

cankocagil/Low-Memory-Transformer-Finetuning
Implementation of Gradient Accumulation for low-memory language modelling transformer fine tuning.
Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

deephub-ai/torch-handle
TorchHandle makes your PyTorch development more efficient and make you use PyTorch more comfortable
Language: Python - Size: 81.1 KB - Last synced at: 4 days ago - Pushed at: almost 3 years ago - Stars: 19 - Forks: 5

mvoelk/ssd_detectors Fork of rykov8/ssd_keras
SSD-based object and text detection with Keras, SSD, DSOD, TextBoxes, SegLink, TextBoxes++, CRNN
Language: Jupyter Notebook - Size: 148 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 302 - Forks: 81

xueyouluo/Multi-Passage-BERT
A simple implementation of Multi-passage BERT
Language: Python - Size: 46.9 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

jimth001/my-tf-framework-for-nlp-tasks
This project aims to help people implement tensorflow model pipelines quickly for different nlp tasks.
Language: Python - Size: 83 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

CyberZHG/keras-gradient-accumulation 📦
Gradient accumulation for Keras
Language: Python - Size: 21.5 KB - Last synced at: 8 months ago - Pushed at: about 4 years ago - Stars: 35 - Forks: 4

shi510/tensorflow-gradient-accumulation
tensorflow2-keras gradient accumulation
Language: Python - Size: 90.8 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0
