Topic: "gradient-accumulation"
rentruewang/koila
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.
Language: Python - Size: 3.92 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 1,822 - Forks: 64

mvoelk/ssd_detectors Fork of rykov8/ssd_keras
SSD-based object and text detection with Keras, SSD, DSOD, TextBoxes, SegLink, TextBoxes++, CRNN
Language: Jupyter Notebook - Size: 148 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 302 - Forks: 81

hkproj/pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
Language: Python - Size: 4.03 MB - Last synced at: 12 days ago - Pushed at: about 1 year ago - Stars: 66 - Forks: 29

andreped/GradientAccumulator
:dart: Accumulated Gradients for TensorFlow 2
Language: Python - Size: 5.3 MB - Last synced at: about 23 hours ago - Pushed at: over 1 year ago - Stars: 53 - Forks: 11

CyberZHG/keras-gradient-accumulation 📦
Gradient accumulation for Keras
Language: Python - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: almost 4 years ago - Stars: 35 - Forks: 4

deephub-ai/torch-handle
TorchHandle makes your PyTorch development more efficient and make you use PyTorch more comfortable
Language: Python - Size: 81.1 KB - Last synced at: 22 days ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 5

xueyouluo/Multi-Passage-BERT
A simple implementation of Multi-passage BERT
Language: Python - Size: 46.9 KB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 1

jimth001/my-tf-framework-for-nlp-tasks
This project aims to help people implement tensorflow model pipelines quickly for different nlp tasks.
Language: Python - Size: 83 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 5 - Forks: 0

shi510/tensorflow-gradient-accumulation
tensorflow2-keras gradient accumulation
Language: Python - Size: 90.8 KB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

TanyaChutani/Gradient-Accumulation-Tensorflow2.x
Gradient Accumulation with Tensorflow2.x
Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 1

Ibzie/VideoPredict-PyTorch-Deep-Learning-Models-for-Video-Frame-Prediction-ConvLSTM-PredRNN-Transform
🎯 Production-ready implementation of video prediction models using PyTorch. Features Enhanced ConvLSTM with temporal attention, PredRNN with spatiotemporal memory, and Transformer-based architecture.
Language: Python - Size: 2.33 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Behradsadeghi/flower-classification-efficientnet
Classifying images of flowers into 17 categories using EfficientNet-B0 and PyTorch.
Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

cankocagil/Low-Memory-Transformer-Finetuning
Implementation of Gradient Accumulation for low-memory language modelling transformer fine tuning.
Language: Jupyter Notebook - Size: 47.9 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
