Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: model-compression

rdrachmanto/gace-characterize-pruning

Characterization study repository for model compression method: pruning

Language: Python - Size: 17.6 KB - Last synced: about 3 hours ago - Pushed: about 4 hours ago - Stars: 0 - Forks: 1

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

Size: 36.8 MB - Last synced: about 3 hours ago - Pushed: about 5 hours ago - Stars: 50 - Forks: 5

huawei-noah/Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

Language: Python - Size: 98.4 MB - Last synced: about 12 hours ago - Pushed: 23 days ago - Stars: 3,861 - Forks: 689

alibaba/TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

Language: Python - Size: 24.9 MB - Last synced: about 16 hours ago - Pushed: about 17 hours ago - Stars: 716 - Forks: 117

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

Language: Python - Size: 28.7 MB - Last synced: 1 day ago - Pushed: 3 days ago - Stars: 43 - Forks: 0

htqin/awesome-efficient-aigc

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Size: 154 KB - Last synced: 12 days ago - Pushed: about 1 month ago - Stars: 105 - Forks: 10

cedrickchee/awesome-ml-model-compression

Awesome machine learning model compression research papers, tools, and learning material.

Size: 185 KB - Last synced: 13 days ago - Pushed: 23 days ago - Stars: 446 - Forks: 57

wangxb96/Awesome-AI-on-the-Edge

Resources of our survey paper "Enabling AI on Edges: Techniques, Applications and Challenges"

Size: 3.49 MB - Last synced: 11 days ago - Pushed: 14 days ago - Stars: 39 - Forks: 2

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

Size: 49.8 MB - Last synced: 9 days ago - Pushed: 11 days ago - Stars: 101 - Forks: 21

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language: Python - Size: 127 MB - Last synced: 4 days ago - Pushed: 21 days ago - Stars: 13,817 - Forks: 1,806

datawhalechina/awesome-compression

模型压缩的小白入门教程

Size: 102 MB - Last synced: 4 days ago - Pushed: 17 days ago - Stars: 54 - Forks: 10

mohd-faizy/Hyperparameter-Tuning-with-Microsoft-Network-Intelligence-Toolkit-NNI

Hyperparameter Tuning with Microsoft NNI to automated machine learning (AutoML) experiments. The tool dispatches and runs trial jobs generated by tuning algorithms to search the best neural architecture and/or hyper-parameters in different environments like local machine, remote servers and cloud.

Language: Python - Size: 3.02 MB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 2 - Forks: 1

htqin/awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

Size: 61.4 MB - Last synced: 9 days ago - Pushed: about 1 month ago - Stars: 1,656 - Forks: 200

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

Size: 282 KB - Last synced: 9 days ago - Pushed: 5 months ago - Stars: 3,328 - Forks: 483

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

Size: 457 KB - Last synced: 9 days ago - Pushed: about 1 year ago - Stars: 2,418 - Forks: 334

haitongli/knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Language: Python - Size: 22.1 MB - Last synced: 10 days ago - Pushed: about 1 year ago - Stars: 1,787 - Forks: 340

AberHu/Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

Language: Python - Size: 90.8 KB - Last synced: 9 days ago - Pushed: over 2 years ago - Stars: 1,526 - Forks: 261

SforAiDl/KD_Lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Language: Python - Size: 22.2 MB - Last synced: 9 days ago - Pushed: about 1 year ago - Stars: 575 - Forks: 57

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

Language: Jupyter Notebook - Size: 98.7 MB - Last synced: 10 days ago - Pushed: about 2 months ago - Stars: 1,119 - Forks: 198

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

Size: 605 KB - Last synced: 12 days ago - Pushed: about 2 months ago - Stars: 2,231 - Forks: 327

guan-yuan/awesome-AutoML-and-Lightweight-Models

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

Size: 150 KB - Last synced: 13 days ago - Pushed: almost 3 years ago - Stars: 827 - Forks: 160

surajiitd/NVIDIA_Jetson_Inference

This repo contains model compression(using TensorRT) and documentation of running various deep learning models on NVIDIA Jetson Orin, Nano (aarch64 architectures)

Language: Makefile - Size: 1.39 GB - Last synced: 15 days ago - Pushed: 15 days ago - Stars: 6 - Forks: 2

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Language: Python - Size: 2.17 MB - Last synced: 9 days ago - Pushed: 29 days ago - Stars: 1,472 - Forks: 321

VainF/Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Language: Python - Size: 9.58 MB - Last synced: 17 days ago - Pushed: 17 days ago - Stars: 2,344 - Forks: 298

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language: Python - Size: 102 MB - Last synced: 17 days ago - Pushed: about 2 months ago - Stars: 620 - Forks: 29

microsoft/NeuronBlocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

Language: Python - Size: 14.9 MB - Last synced: 12 days ago - Pushed: 10 months ago - Stars: 1,441 - Forks: 192

OpenNLG/OpenBA-v2

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.

Language: Python - Size: 15 MB - Last synced: 21 days ago - Pushed: 22 days ago - Stars: 8 - Forks: 0

Tencent/PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Language: Python - Size: 1.13 MB - Last synced: 21 days ago - Pushed: about 1 year ago - Stars: 2,782 - Forks: 499

bupt-ai-club/awesomeProject

记录有意思的AI相关项目

Size: 108 MB - Last synced: 9 days ago - Pushed: about 2 months ago - Stars: 6 - Forks: 2

tianyic/only_train_once

OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM

Language: Python - Size: 3.05 MB - Last synced: 25 days ago - Pushed: 25 days ago - Stars: 261 - Forks: 45

microsoft/archai

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

Language: Python - Size: 50.2 MB - Last synced: 24 days ago - Pushed: 5 months ago - Stars: 457 - Forks: 90

vi2enne/Neural-Network-Pruning

Language: Python - Size: 60.5 KB - Last synced: 26 days ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Language: Python - Size: 1.5 MB - Last synced: 29 days ago - Pushed: 30 days ago - Stars: 569 - Forks: 35

zeyneddinoz/subMFL

subMFL: Compatible subModel Generation for Federated Learning in Device Heterogeneous Environment

Language: Jupyter Notebook - Size: 125 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 1 - Forks: 1

mlzxy/qsparse

Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules

Language: Python - Size: 293 KB - Last synced: 15 days ago - Pushed: over 1 year ago - Stars: 39 - Forks: 2

chenllliang/Model-Compression-For-Speaker-Recognition

Distillation examples. Trying to make Speaker Recognition Faster through different Model Compression techniques

Language: Python - Size: 12.7 KB - Last synced: about 1 month ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 1

lucinezhang/Deep-Slope-Estimation

MSCV 2019 Capstone Project

Language: Python - Size: 21.1 MB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 3 - Forks: 1

ciodar/deep-compression

PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.

Language: Jupyter Notebook - Size: 3.84 MB - Last synced: 18 days ago - Pushed: about 1 year ago - Stars: 13 - Forks: 0

gershonc/octopus-ml

A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.

Language: Jupyter Notebook - Size: 21.4 MB - Last synced: 3 days ago - Pushed: about 1 year ago - Stars: 19 - Forks: 5

666DZY666/micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Language: Python - Size: 6.84 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 2,178 - Forks: 477

kssteven418/I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Language: Python - Size: 6.38 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 209 - Forks: 30

jaketae/nn-svd

Neural network compression with SVD

Language: Python - Size: 33.2 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

Shuai-Xie/BiSeNet-Compression

10 variants of original BiSeNet with performance comparison, the faster, the better.

Language: Python - Size: 106 KB - Last synced: about 1 month ago - Pushed: about 5 years ago - Stars: 4 - Forks: 0

ksm26/Quantization-Fundamentals-with-Hugging-Face

Learn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.

Language: Jupyter Notebook - Size: 205 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

SqueezeAILab/KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.7 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 187 - Forks: 14

TCLResearchEurope/ptdeco

ptdeco is a library for model optimization by decomposition built on top of PyTorch

Language: Python - Size: 299 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 7 - Forks: 1

ChanChiChoi/awesome-model-compression

papers about model compression

Size: 504 KB - Last synced: 9 days ago - Pushed: over 1 year ago - Stars: 166 - Forks: 38

jim-schwoebel/allie

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

Language: Python - Size: 275 MB - Last synced: 21 days ago - Pushed: 8 months ago - Stars: 139 - Forks: 36

huawei-noah/Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Language: Python - Size: 29 MB - Last synced: about 2 months ago - Pushed: 4 months ago - Stars: 2,953 - Forks: 623

rakutentech/iterative_training

Iterative Training: Finding Binary Weight Deep Neural Networks with Layer Binarization

Language: Python - Size: 179 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 1

minseok0809/transformers-compression-practice

Transformers Compression Practice

Language: Jupyter Notebook - Size: 19.9 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

MingSun-Tse/Efficient-Deep-Learning

Collection of recent methods on (deep) neural network compression and acceleration.

Size: 618 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 894 - Forks: 126

ethanhe42/channel-pruning

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Language: Python - Size: 544 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 1,065 - Forks: 311

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

Language: Python - Size: 40.1 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 84 - Forks: 14

microsoft/Moonlit

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Language: Python - Size: 12 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 54 - Forks: 6

kxytechnologies/kxy-python

A toolkit to boost the productivity of machine learning engineers.

Language: Python - Size: 38.6 MB - Last synced: about 2 months ago - Pushed: almost 2 years ago - Stars: 47 - Forks: 12

VITA-Group/SViTE

[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Language: Python - Size: 615 KB - Last synced: about 2 months ago - Pushed: 6 months ago - Stars: 85 - Forks: 12

ziplab/SPViT

[TPAMI 2024] This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Language: Python - Size: 198 KB - Last synced: about 1 month ago - Pushed: 5 months ago - Stars: 99 - Forks: 14

minseok0809/awesome-ai-paper

A curated list of awesome NLP, Computer Vision, Model Compression, XAI, Reinforcement Learning, Security etc Paper

Language: Jupyter Notebook - Size: 2.81 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

pratyushasharma/laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Language: Python - Size: 2.31 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 318 - Forks: 18

1duo/awesome-ai-infrastructures

Infrastructures™ for Machine Learning Training/Inference in Production.

Size: 11.8 MB - Last synced: 12 days ago - Pushed: about 5 years ago - Stars: 368 - Forks: 71

jchenghu/sharebert

Implementation of the work "ShareBERT: Embeddings Are Capable of Learning Hidden Layers".

Language: Python - Size: 206 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

chester256/Model-Compression-Papers

Papers for deep neural network compression and acceleration

Size: 8.79 KB - Last synced: about 2 months ago - Pushed: almost 3 years ago - Stars: 392 - Forks: 77

cnkuangshi/LightCTR

Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosophy of Parameter Server and Ring-AllReduce collective communication.

Language: C++ - Size: 9.41 MB - Last synced: about 2 months ago - Pushed: almost 5 years ago - Stars: 674 - Forks: 143

marload/aquvitae

Knowledge Distillation Toolkit

Language: Python - Size: 170 MB - Last synced: 27 days ago - Pushed: almost 4 years ago - Stars: 90 - Forks: 10

elphinkuo/distiller

The original experiments code for AAAI 2020 paper, "AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates"

Language: Jupyter Notebook - Size: 36.1 MB - Last synced: about 2 months ago - Pushed: 9 months ago - Stars: 1 - Forks: 0

mit-han-lab/amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 17.6 KB - Last synced: 2 months ago - Pushed: 6 months ago - Stars: 416 - Forks: 108

FLHonker/ZAQ-code

CVPR 2021 : Zero-shot Adversarial Quantization (ZAQ)

Language: Python - Size: 188 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 64 - Forks: 16

czg1225/SlimSAM

SlimSAM: 0.1% Data Makes Segment Anything Slim

Language: Python - Size: 35.9 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 211 - Forks: 12

Xiuyu-Li/q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Language: Python - Size: 5.97 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 243 - Forks: 17

Lee-Gihun/MicroNet_OSI-AI

(NeurIPS-2019 MicroNet Challenge - 3rd Winner) Open source code for "SIPA: A simple framework for efficient networks"

Language: Python - Size: 14.8 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 18 - Forks: 6

liuziwei7/mobile-id

Deep Face Model Compression

Language: Matlab - Size: 3.62 MB - Last synced: 9 days ago - Pushed: almost 6 years ago - Stars: 195 - Forks: 102

zju-vipa/CMI

[IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation

Language: Python - Size: 2.56 MB - Last synced: about 1 month ago - Pushed: about 2 years ago - Stars: 65 - Forks: 15

frankaging/Causal-Distill

The Codebase for Causal Distillation for Language Models (NAACL '22)

Language: Python - Size: 631 KB - Last synced: 9 days ago - Pushed: about 2 years ago - Stars: 24 - Forks: 3

linkedin/QuantEase

QuantEase, a layer-wise quantization framework, frames the problem as discrete-structured non-convex optimization. Our work leverages Coordinate Descent techniques, offering high-quality solutions without the need for matrix inversion or decomposition.

Language: Python - Size: 209 KB - Last synced: about 2 months ago - Pushed: 3 months ago - Stars: 15 - Forks: 1

jongwooko/NASH-Pruning-Official

About Code for the paper "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP 2023 Findings)

Language: Python - Size: 114 KB - Last synced: about 1 month ago - Pushed: 8 months ago - Stars: 10 - Forks: 0

bloomberg/minilmv2.bb

Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)

Language: Python - Size: 30.3 KB - Last synced: about 2 months ago - Pushed: 12 months ago - Stars: 60 - Forks: 6

lhyfst/knowledge-distillation-papers

knowledge distillation papers

Size: 321 KB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 720 - Forks: 80

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

Size: 277 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 245 - Forks: 27

JetRunner/BERT-of-Theseus

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Language: Python - Size: 1.04 MB - Last synced: 3 months ago - Pushed: 12 months ago - Stars: 302 - Forks: 39

ismail31416/LumiNet

The official implementation of LumiNet: The Bright Side of Perceptual Knowledge Distillation https://arxiv.org/abs/2310.03669

Language: Jupyter Notebook - Size: 14.3 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 11 - Forks: 1

deep-fry/mayo

Mayo: Auto-generation of hardware-friendly deep neural networks. Dynamic Channel Pruning: Feature Boosting and Suppression.

Language: Python - Size: 33.2 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 113 - Forks: 21

htqin/BiBench

This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.

Language: Python - Size: 110 KB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 41 - Forks: 3

memgonzales/mirror-segmentation

Presented at the 2023 International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG 2023). Lightweight mirror segmentation CNN that uses an EfficientNet backbone, employs parallel convolutional layers to capture edge features, and applies filter pruning for model compression

Language: Python - Size: 197 MB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 2 - Forks: 0

Neural-Dreamers/Forest-Sound-Analysis-on-Edge

A Comparative Analysis of Sound Data Pre-processing and Deep Learning Model Compression Techniques: A Study on Forest Sound Classification

Language: Jupyter Notebook - Size: 49.8 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

christopheitenberger/VSOL

Versioning System for Online Learning systems (VSOL)

Language: Python - Size: 3.09 MB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

Sharpiless/Yolov5-distillation-train-inference

Yolov5 distillation training | Yolov5知识蒸馏训练,支持训练自己的数据

Language: Python - Size: 2.36 MB - Last synced: 3 months ago - Pushed: over 1 year ago - Stars: 193 - Forks: 31

musco-ai/musco-pytorch

MUSCO: MUlti-Stage COmpression of neural networks

Language: Jupyter Notebook - Size: 681 KB - Last synced: 4 days ago - Pushed: over 3 years ago - Stars: 73 - Forks: 16

Peterisfar/YOLOV3

yolov3 by pytorch

Language: Python - Size: 17.3 MB - Last synced: 3 months ago - Pushed: almost 2 years ago - Stars: 196 - Forks: 52

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

Language: Python - Size: 25.2 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 102 - Forks: 6

StijnVerdenius/SNIP-it

This repository is the official implementation of the paper Pruning via Iterative Ranking of Sensitivity Statistics and implements novel pruning / compression algorithms for deep learning / neural networks. Amongst others it implements structured pruning before training, its actual parameter shrinking and unstructured before/during training.

Language: Python - Size: 1.87 MB - Last synced: 4 months ago - Pushed: about 1 year ago - Stars: 28 - Forks: 4

DwangoMediaVillage/keras_compressor

Model Compression CLI Tool for Keras.

Language: Python - Size: 19.5 KB - Last synced: 2 months ago - Pushed: about 5 years ago - Stars: 157 - Forks: 39

ZexinChen/FastPose

pytorch realtime multi person keypoint estimation

Language: Jupyter Notebook - Size: 7.68 MB - Last synced: 3 months ago - Pushed: over 5 years ago - Stars: 39 - Forks: 10

VITA-Group/PrAC-LTH

[ICML 2021] "Efficient Lottery Ticket Finding: Less Data is More" by Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang

Language: Python - Size: 562 KB - Last synced: about 2 months ago - Pushed: over 2 years ago - Stars: 25 - Forks: 3

yehuitang/Pruning

Code for "Co-Evolutionary Compression for Unpaired Image Translation" (ICCV 2019), "SCOP: Scientific Control for Reliable Neural Network Pruning" (NeurIPS 2020) and “Manifold Regularized Dynamic Network Pruning” (CVPR 2021).

Language: Python - Size: 1.57 MB - Last synced: 5 months ago - Pushed: almost 3 years ago - Stars: 237 - Forks: 47

iamhankai/Versatile-Filters

Pytorch code for paper: Learning Versatile Filters for Efficient Convolutional Neural Networks (NeurIPS 2018)

Language: Python - Size: 121 KB - Last synced: about 12 hours ago - Pushed: over 4 years ago - Stars: 79 - Forks: 16

NVlabs/condensa

Programmable Neural Network Compression

Language: Python - Size: 16.2 MB - Last synced: 11 days ago - Pushed: about 2 years ago - Stars: 146 - Forks: 26

mit-han-lab/amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 37.1 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 164 - Forks: 27

Z7zuqer/model-compression-and-acceleration-4-DNN

model-compression-and-acceleration-4-DNN

Size: 67.7 MB - Last synced: 4 days ago - Pushed: over 5 years ago - Stars: 21 - Forks: 4

SKKU-ESLAB/Auto-Compression

Automatic DNN compression tool with various model compression and neural architecture search techniques

Language: C - Size: 103 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 20 - Forks: 20

Related Keywords
model-compression 231 deep-learning 70 pytorch 60 pruning 59 quantization 42 knowledge-distillation 41 machine-learning 26 deep-neural-networks 16 python 15 computer-vision 15 model-pruning 14 tensorflow 14 distillation 14 bert 12 neural-architecture-search 12 efficient-deep-learning 12 natural-language-processing 11 automl 10 neural-network 10 convolutional-neural-networks 10 network-pruning 10 model-acceleration 10 nlp 10 efficient-inference 9 compression 8 awesome-list 8 channel-pruning 8 llm 7 language-model 7 transformer 7 model-optimization 7 sparsity 7 keras 7 transformers 6 neural-network-pruning 6 large-language-models 6 efficient-model 6 quantization-aware-training 6 neural-networks 6 object-detection 6 efficient-neural-networks 5 cnn 5 diffusion-models 5 filter-pruning 5 image-classification 5 weight-pruning 5 model-quantization 5 vision-transformer 5 data-science 5 hyperparameter-optimization 5 nas 5 model-deployment 4 papers 4 text-classification 4 kd 4 natural-language-understanding 4 knowledge-transfer 4 model-distillation 4 binary-neural-networks 4 super-resolution 4 transfer-learning 4 structured-pruning 4 federated-learning 4 post-training-quantization 4 llama 4 sparsification 4 neural-network-compression 4 edge-computing 4 optimization 3 quantized-neural-networks 3 binarization 3 model-comparison 3 speech 3 acceleration 3 generative-ai 3 data-visualization 3 recurrent-neural-networks 3 ensemble-learning 3 feature-engineering 3 stable-diffusion 3 tensorrt 3 artificial-intelligence 3 face-recognition 3 classification 3 vision-transformers 3 ai 3 domain-adaptation 3 neurips-2019 3 unstructured-pruning 3 micronet-challenge 3 dnn 3 onnx 3 inference 2 efficientnet 2 unit-pruning 2 quantized-training 2 meta-learning 2 competition 2 xnor-net 2 lottery-ticket-hypothesis 2