An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: efficient-deep-learning

AIoT-MLSys-Lab/Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Size: 3.96 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,164 - Forks: 97

xuyang-liu16/Awesome-Token-level-Model-Compression

📚 Collection of token-level model compression resources.

Size: 1.71 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 97 - Forks: 4

Yeez-lee/Data-Selection-and-Reweighting-for-Diffusion-Models

[ICASSP 25'] Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Size: 166 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Language: Python - Size: 10 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 3,018 - Forks: 351

mryab/efficient-dl-systems

Efficient Deep Learning Systems course materials (HSE, YSDA)

Language: Jupyter Notebook - Size: 68.7 MB - Last synced at: 10 days ago - Pushed at: about 2 months ago - Stars: 837 - Forks: 132

thu-nics/FrameFusion

The official code implementation of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"

Language: Python - Size: 19.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 40 - Forks: 1

AIoT-MLSys-Lab/Efficient-Diffusion-Model-Survey

[TMLR 2025] Efficient Diffusion Models: A Survey

Size: 320 KB - Last synced at: 10 days ago - Pushed at: about 1 month ago - Stars: 60 - Forks: 3

FrancoisPorcher/awesome-ai-tutorials

The best collection of AI tutorials to make you a boss of Data Science!

Language: Python - Size: 135 MB - Last synced at: 29 days ago - Pushed at: 5 months ago - Stars: 93 - Forks: 21

Efficient-ML/Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Size: 61.5 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2,084 - Forks: 221

tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

Size: 336 KB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 117 - Forks: 7

Efficient-ML/Awesome-Efficient-AIGC

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Size: 63.5 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 178 - Forks: 11

chaitjo/efficient-gnns

Code and resources on scalable and efficient Graph Neural Networks (TNNLS 2023)

Language: Python - Size: 2.32 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 533 - Forks: 64

MingSun-Tse/Efficient-Deep-Learning

Collection of recent methods on (deep) neural network compression and acceleration.

Size: 700 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 945 - Forks: 131

friendshipkim/overfill

Code for OverFill: Two-Stage Models for Efficient Language Model Decoding

Language: Python - Size: 1.87 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

xuyang-liu16/Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

Size: 637 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 215 - Forks: 6

ROIM1998/APT

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

Language: Python - Size: 4.08 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 39 - Forks: 2

mlvlab/EfficientViM

[CVPR 25] Official Implementation (Pytorch) of "EfficientViM: Efficient Vision Mamba with Hidden State Mixer-based State Space Duality"

Language: Python - Size: 2.37 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 51 - Forks: 1

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

Language: Python - Size: 25.2 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 185 - Forks: 12

tobna/TaylorShift

This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax"

Language: Python - Size: 98.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

tobna/WhatTransformerToFavor

Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.

Language: Python - Size: 2.85 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 27 - Forks: 7

LMD0311/DAPT

[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Language: Python - Size: 447 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 169 - Forks: 6

Ziyang-Yu/Awesome-Resource-Efficient-LLM-Papers Fork of tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

a curated list of high-quality papers on resource-efficient LLMs 🌱

Size: 252 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

VITA-Group/Peek-a-Boo

[ICLR 2022] "Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently", by Xiaohan Chen, Jason Zhang and Zhangyang Wang.

Language: Python - Size: 65.4 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 6 - Forks: 0

LauzHack/deep-learning-bootcamp

LauzHack Deep Learning Bootcamp

Language: Jupyter Notebook - Size: 74.6 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 35 - Forks: 8

tallamjr/astronet

Efficient Deep Learning for Real-time Classification of Astronomical Transients and Multivariate Time-series

Language: Jupyter Notebook - Size: 623 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 15 - Forks: 3

yhlleo/VTs-Drloc

[NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".

Language: Python - Size: 704 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 140 - Forks: 14

sdc17/CrossGET

[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.

Language: Python - Size: 11.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 26 - Forks: 0

negarhdr/skeleton-based-action-recognition

This repository provides implementation of a baseline method and our proposed methods for efficient Skeleton-based Human Action Recognition.

Language: Python - Size: 7.92 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 26 - Forks: 6

OSUPCVLab/MobileUNETR

Official Implementation of MobileUNETR: A Lightweight End-To-End Hybrid Vision Transformer For Efficient Medical Image Segmentation (ECCV2024) (Oral)

Language: Python - Size: 12 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 38 - Forks: 4

jerryfeng2003/PointGST

Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning

Language: Python - Size: 12.6 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 60 - Forks: 5

Sharath-girish/efficientgaussian

Official implementation of "EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS"

Language: C++ - Size: 25.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 93 - Forks: 2

HayeonLee/MetaD2A

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Language: Python - Size: 1.12 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 61 - Forks: 10

HayeonLee/HELP

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

Language: Python - Size: 8.61 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 59 - Forks: 7

want-ai-hpc/want-ai-hpc.github.io

Official Website for the Workshop on Advancing Neural Networks Training: Computational Efficiency, Scalability, and Resource Optimization (WANT@ICML 2024, WANT@NeurIPS 2023)

Language: HTML - Size: 10.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

fscdc/MIT-Learning

My learning record for <TinyML and Efficient Deep Learning Computing>

Language: Jupyter Notebook - Size: 301 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

LayneH/GreenMIM

[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.

Language: Python - Size: 1.39 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 6

david-knigge/separable-group-convolutional-networks

Code repository of the paper "Exploiting Redundancy: Separable Group Convolutional Networks on Lie Groups" https://proceedings.mlr.press/v162/knigge22a.html

Language: Python - Size: 869 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 11 - Forks: 2

OPTML-Group/DeepZero

[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Diffenderfer, Jiancheng Liu, Konstantinos Parasyris, Yihua Zhang, Zheng Zhang, Bhavya Kailkhura, Sijia Liu

Language: Python - Size: 2.88 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 24 - Forks: 2

DiffusionMamba/DiM

Official Codebase of "Scaling Diffusion Mamba for Efficient Image Generation"

Size: 4.88 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BeSpontaneous/FFN-pytorch

Frame Flexible Network (CVPR2023)

Language: Python - Size: 15.5 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 52 - Forks: 5

CownowAn/DaSS

Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)

Language: Python - Size: 226 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 22 - Forks: 1

AdrianBZG/tinyroberta-distillation-qa-es

Code for "Language Model Knowledge Distillation for Efficient Question Answering in Spanish" (ICLR 2024 Tiny Papers)

Language: Python - Size: 55.7 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

EnVision-Research/DDSM

Denoising Diffusion Step-aware Models (ICLR2024)

Language: Python - Size: 229 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 0

mmontielpz/jetseg

NeurIPS 2023

Language: Jupyter Notebook - Size: 2.22 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 3

gmh14/Geo-DEG

[ICML 2023] Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction

Language: Python - Size: 60.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 3

MingSun-Tse/Awesome-Pruning-at-Initialization

[IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.

Size: 71.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 49 - Forks: 2

gmh14/data_efficient_grammar

[ICLR 2022] Data-Efficient Graph Grammar Learning for Molecular Generation

Language: Jupyter Notebook - Size: 62.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 84 - Forks: 23

d-becking/nncodec-icml-2023-demo

This repository is for reproducing the results shown in the NNCodec ICML Workshop paper. Additionally, it includes a demo, prepared for the Neural Compression Workshop (NCW).

Language: Python - Size: 8.19 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

MingSun-Tse/TPP

[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)

Language: Python - Size: 982 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 20 - Forks: 0

MingSun-Tse/Why-the-State-of-Pruning-so-Confusing

[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning

Size: 6.32 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 0

MingSun-Tse/Awesome-Efficient-ViT

Recent Advances on Efficient Vision Transformers

Size: 11.7 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 0

MingSun-Tse/Smile-Pruning

A generic code base for neural network pruning, especially for pruning at initialization.

Language: Python - Size: 24 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 24 - Forks: 1

tonyjo/edlsm_pytorch

Pytorch implementation for stereo matching described in the paper: Efficient Deep learning for stereo matching

Language: Python - Size: 74.2 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 5

datvuthanh/Stereo-Matching

Efficient Deep Learning for Stereo Matching Tensorflow 2.x

Language: Python - Size: 68.4 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 25 - Forks: 5

AIoT-MLSys-Lab/MutualNet Fork of taoyang1122/MutualNet

[ECCV 2020 Oral] MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution

Size: 1.18 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

AIoT-MLSys-Lab/MSUNet

[NeurIPS 2019 Google MicroNet Challenge] MSUNet is an efficient model that won the 4th place in the Google MicroNet Challenge CIFAR-100 Track hosted at NeurIPS 2019 designed by Yu Zheng, Shen Yan, Mi Zhang

Language: Python - Size: 24.8 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 3

Related Keywords
efficient-deep-learning 56 deep-learning 15 model-compression 12 pytorch 8 computer-vision 7 diffusion-models 7 large-language-models 5 machine-learning 5 llm 5 network-pruning 4 transformers 4 pruning 4 knowledge-distillation 4 model-acceleration 4 survey 4 machine-learning-systems 4 generative-ai 4 graph-neural-networks 3 natural-language-processing 3 image-generation 3 neural-architecture-search 3 meta-learning 3 graph-convolutional-networks 2 model-quantization 2 deep-neural-networks 2 transformer 2 3d-point-clouds 2 point-cloud 2 real-time 2 self-supervised-learning 2 vision-transformer 2 neural-network 2 grammar-learning 2 graph-learning 2 symbolic-representation 2 neural-network-pruning 2 pruning-at-initialization 2 stereo-matching 2 stereo-vision 2 ai 2 mlops 2 aigc 2 awesome 2 equivariance 1 ai4science 1 blackbox-optimization 1 zeroth-order-optimization 1 diffusion-transformer 1 mamba 1 state-space-models 1 action-recognition 1 parameter-efficient 1 video-recognition 1 language-model 1 question-answering 1 spanish 1 iclr2024 1 model-pruning 1 masked-image-modeling 1 tinyml 1 mit 1 efficient 1 workshop 1 neurips 1 ml-infrastructure 1 hpc 1 hardware-aware 1 lvlm 1 novel-view-synthesis 1 gaussian-splatting 1 computer-graphics 1 compression 1 segmentation 1 lightweight-neural-network 1 nncodec 1 regularization 1 trainability 1 trainability-preserving-pruning 1 sparsity 1 attention-is-all-you-need 1 efficient-vision-transformers 1 fast-attention 1 vision-transformers 1 token-merging 1 token-compression 1 computer-vison 1 siamese-network 1 stereo-visi 1 tensorflow2 1 dynamic-neural-networks 1 embedded-systems 1 jetson-agx-xavier 1 semantic-segmentation 1 distributed-training 1 cuda 1 molecular-property-prediction 1 vision 1 token-pruning 1 formal-languages 1 molecule-generation 1