An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: distributed-deep-learning

intel/BigDL

BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray

Language: Jupyter Notebook - Size: 356 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 2,674 - Forks: 731

ParCIS/Chimera

Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.

Language: Python - Size: 1.05 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 62 - Forks: 8

dkeras-project/dkeras

Distributed Keras Engine, Make Keras faster with only one line of code.

Language: Python - Size: 6.48 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 188 - Forks: 12

intel/e2eAIOK ๐Ÿ“ฆ

Intelยฎ End-to-End AI Optimization Kit

Language: Jupyter Notebook - Size: 220 MB - Last synced at: 21 days ago - Pushed at: 11 months ago - Stars: 31 - Forks: 22

zoranzhao/DeepThings

A Portable C Library for Distributed CNN Inference on IoT Edge Clusters

Language: C - Size: 1.81 MB - Last synced at: 2 months ago - Pushed at: about 5 years ago - Stars: 83 - Forks: 40

ParCIS/Ok-Topk

Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with the decentralized parallel Stochastic Gradient Descent (SGD) optimizer, and its convergence is proved theoretically and empirically.

Language: Python - Size: 334 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 8

hyunnnchoi/google-t5-fsdp-kubeflow

A foundational repository for setting up distributed training jobs using Kubeflow and PyTorch FSDP.

Language: Python - Size: 82 KB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

rkhan055/SHADE

SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training

Language: Python - Size: 28.3 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 29 - Forks: 9

ravenprotocol/ravnest

Decentralized Asynchronous Training on Heterogeneous Devices

Language: Python - Size: 1.21 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 8 - Forks: 0

GuanhuaWang/sensAI

sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data

Language: Python - Size: 1.27 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 64 - Forks: 8

StefanoFioravanzo/distributed-deeplearning-kubernetes

Collection of resources for automatic deployment of distributed deep learning jobs on a Kubernetes cluster

Language: Python - Size: 123 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

ray-project/anyscale-workshop-nyc-2023 ๐Ÿ“ฆ

Scalable NLP model fine-tuning and batch inference with Ray and Anyscale

Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

mma735/TFM-DS

Comparison of distributed machine learning techniques applied to openly available datasets

Language: Jupyter Notebook - Size: 105 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shigangli/eager-SGD

Eager-SGD is a decentralized asynchronous SGD. It utilizes novel partial collectives operations to accumulate the gradients across all the processes.

Language: Python - Size: 1.31 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 7 - Forks: 0

siddhanthiyer-99/Distributed-Training-of-GANs

Implemented training strategies to help improve bottlenecks and to improve the training speed while maintaining the quality of our GANs.

Language: Python - Size: 121 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lancelee82/necklace

Distributed deep learning framework based on pytorch/numba/nccl and zeromq.

Language: Python - Size: 235 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

dyadxmachina/Applied-Deep-Learning-with-TensorFlow

Learn applied deep learning from zero to deployment using TensorFlow 1.8+

Language: Jupyter Notebook - Size: 3.73 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 163 - Forks: 49

gsyang33/Driple

๐Ÿšจ Prediction of the Resource Consumption of Distributed Deep Learning Systems

Language: Python - Size: 3.07 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 12

rocketmlhq/rmldnn

RocketML Deep Neural Networks

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 40 - Forks: 11

sotheanithsok/Image-Recognition-using-Distributed-ResNet-Model ๐Ÿ“ฆ

An implementation of a distributed ResNet model for classifying CIFAR-10 and MNIST datasets.

Language: Python - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

veritas9872/Horovod-Pytorch-Tutorial ๐Ÿ“ฆ

Horovod Tutorial for Pytorch using NVIDIA-Docker.

Language: Python - Size: 10.7 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

ch3njust1n/smpl ๐Ÿ“ฆ

Simultaneous Multi-Party Learning Framework

Language: Python - Size: 13.8 MB - Last synced at: 4 days ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

amirhosein-mesbah/Deep_Learning

This repository contains the implementation of a wide variety of Deep Learning Projects in different applications of computer vision, NLP, federated, and distributed learning. These projects include university projects and projects implemented due to interest in Deep Learning.

Language: Jupyter Notebook - Size: 17.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

deepspark/deepspark_java

Java based Convolutional Neural Network package running on Apache Spark framework

Language: Java - Size: 3.74 MB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 4 - Forks: 5

thanoskaravangelis/distributed-deep-learning-ntua Fork of John-Atha/distributed-deep-learning-NTUA-2022

Distributed Deep Learning experiments with the BigDL framework over Databricks

Language: Jupyter Notebook - Size: 2.57 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

christianramsey/Tensorflow-for-Distributed-Deep-Learning

TensorFlow (1.8+) Datasets, Feature Columns, Estimators and Distributed Training using Google Cloud Machine Learning Engine

Language: Jupyter Notebook - Size: 524 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 4

sqaz91819/Blockchain-NAS

A blockchain based neural architecture search project.

Language: Python - Size: 47.2 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Shigangli/WAGMA-SGD

WAGMA-SGD is a decentralized asynchronous SGD based on wait-avoiding group model averaging. The synchronization is relaxed by making the collectives externally-triggerable, namely, a collective can be initiated without requiring that all the processes enter it. It partially reduces the data within non-overlapping groups of process, improving the parallel scalability.

Language: Python - Size: 1.11 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 0

explcre/SHUKUN-Technology-AlgorithmIntern-MultiNodeTraining-for-DLmodels-Horovod-ConfigurationTutorial-Perf

SHUKUN Technology Co.,Ltd Algorithm intern (2020/12-2021/5). Multi-GPU, Multi-node training for deep learning models. Horovod, NVIDIA clara train sdk, configuration tutorial,performance testing.

Language: HTML - Size: 37 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

hkvision/analytics-zoo Fork of intel-analytics/analytics-zoo

Distributed Tensorflow, Keras and BigDL on Apache Spark

Language: Jupyter Notebook - Size: 264 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

AmrMKayid/KayDDRL

Distributed Deep Reinforcement Learning for Large Scale Robotic Simulations ๐Ÿ‘จโ€๐Ÿ’ป๐Ÿค–๐Ÿ•ธ๐Ÿ•น๐Ÿ•ทโค๏ธ๐Ÿ‘จโ€๐Ÿ”ฌ

Language: TeX - Size: 132 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

trilliwon/pytorch-examples

PyTorch Examples for Beginners

Language: Jupyter Notebook - Size: 47.6 MB - Last synced at: 9 days ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 1

pierric/Mnist-Caffe-MPI

mnist, using caffe and openmpi

Language: C++ - Size: 16.6 KB - Last synced at: 8 days ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

jayparks/deepspark_java Fork of deepspark/deepspark_java

Java based Convolutional Neural Network package running on Apache Spark framework

Language: Java - Size: 3.74 MB - Last synced at: 10 days ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

Related Keywords
distributed-deep-learning 34 deep-learning 15 pytorch 9 deep-neural-networks 9 tensorflow 7 machine-learning 7 python 7 distributed-machine-learning 5 apache-spark 4 spark 4 neural-networks 3 distributed-systems 3 keras-tensorflow 3 bigdl 3 horovod 3 mxnet 2 federated-learning 2 partial-allreduce 2 cnn 2 docker 2 convolutional-neural-networks 2 cifar10 2 java 2 mnist 2 artificial-intelligence 2 jblas 2 jcublas 2 neural-architecture-search 2 analytics-zoo 2 ray 2 scala 2 distributed 2 multi-gpu-training 1 crowd-counting 1 sgd 1 metaheuristic 1 hypergraph-sgd 1 caffe 1 hypergraph 1 hgsgd 1 gradient-descent 1 openmpi 1 evolutionary-algorithm 1 asynchronous-sgd 1 artificial-neural-networks 1 nvidia-docker 1 horovod-tutorial 1 distributed-computing 1 horovod-pytorch-tutorial 1 horovod-pytorch-example 1 horovod-pytorch 1 horovod-example 1 large-scale-learning 1 scientific-machine-learning 1 multi-node-training 1 clara-train 1 model-averaging 1 blockchain 1 inference 1 cloud 1 jupyter-notebook 1 spark-framework 1 nfs 1 nvidia 1 ssh 1 artificial-general-intelligence 1 deep-reinforcement-learning 1 deepspark-java 1 robotics 1 deeplearning 1 distributed-pytorch 1 transformer 1 segmentation 1 rnn 1 machine-translation 1 lstm 1 image-captioning 1 high-performance-computing 1 cifar100 1 cifar-100 1 cifar-10 1 storage 1 caching 1 kubeflow 1 fsdp 1 topk-sgd 1 sparse-allreduce 1 iot-edge-clusters 1 internet-of-things 1 edge-computing 1 automl 1 tensorflow-models 1 plaidml 1 parallel-computing 1 neural-network 1 keras-neural-networks 1 keras-models 1 keras-classification-models 1 keras 1 distributed-keras-engine 1