GitHub topics: efficient-model
microsoft/nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
Language: Python - Size: 58.3 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 351 - Forks: 62

SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language: Python - Size: 19.8 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 339 - Forks: 30

mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Language: Python - Size: 245 KB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2,104 - Forks: 420

mit-han-lab/proxylessnas
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
Language: C++ - Size: 260 MB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 1,436 - Forks: 287

SHI-Labs/Any-Precision-DNNs
Any-Precision Deep Neural Networks (AAAI 2021)
Language: Python - Size: 30.3 KB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 60 - Forks: 7

Snigdho8869/DeepDream-StyleTransfer
Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.
Language: Jupyter Notebook - Size: 17 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

kssteven418/I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Language: Python - Size: 6.38 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 241 - Forks: 34

mit-han-lab/haq
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Language: Python - Size: 64.5 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 380 - Forks: 85

kssteven418/LTP
[KDD'22] Learned Token Pruning for Transformers
Language: Python - Size: 40.1 MB - Last synced at: 15 days ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 18

kssteven418/Q-ASR
[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition
Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 31 - Forks: 2

gongzix/Lite-Mind
Official code base for ‘Lite-Mind : Towards Efficient and Robust Brain Representation Learning’
Language: Jupyter Notebook - Size: 22 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 17 - Forks: 0

NoakLiu/GraphSnapShot
GraphSnapShot: Caching Local Structure for Fast Graph Learning
Language: Python - Size: 212 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 23 - Forks: 0

d-li14/HBONet
[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2
Language: Python - Size: 179 MB - Last synced at: 11 days ago - Pushed at: almost 5 years ago - Stars: 102 - Forks: 16

mit-han-lab/hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Language: Python - Size: 25.6 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 329 - Forks: 50

xvyaward/owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".
Language: Python - Size: 3.03 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 53 - Forks: 5

tuanlda78202/cod
[Thesis'24] Efficient Class Incremental Learning for Object Detection
Language: Python - Size: 194 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 13 - Forks: 0

mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Language: Python - Size: 6.83 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 1,841 - Forks: 332

tiangexiang/BiX-NAS
[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation
Language: Python - Size: 2.9 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 36 - Forks: 8

tarto-dev/tagsy-discord
Tagsy, your friendly Discord bot, designed to enhance server interaction with its intuitive tagging system
Language: Python - Size: 83 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mit-han-lab/amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Language: Python - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 416 - Forks: 108

lironui/ABCNet
The semantic segmentation of remote sensing images
Language: Python - Size: 5.29 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 2

ozora-ogino/efficient_backbones
Implementation of efficient backbones for computer vision task.
Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

mit-han-lab/amc-models
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Language: Python - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 164 - Forks: 27

amirgholami/ZeroQ
[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework
Language: Python - Size: 5.47 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 258 - Forks: 52

Kthyeon/KAIST-AI-NeurIPS2019-MicroNet-2nd-place-solution
NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".
Language: Python - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 7

linksense/EfficientNet.PyTorch
Concise, Modular, Human-friendly PyTorch implementation of EfficientNet with Pre-trained Weights.
Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 31 - Forks: 5

youngwanLEE/VoV3D
Efficient 3D Backbone Network for Temporal Modeling
Language: Python - Size: 207 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 5

01-vyom/melanoma-classification
Melanoma Classification using Semi-supervised learning
Language: Python - Size: 5.17 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

xternalz/SDPoint
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
Language: Python - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 4

linksense/MixNet-PyTorch
Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.
Language: Python - Size: 57.4 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 3

szq0214/S2-BNN
S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)
Language: Python - Size: 240 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 11

mit-han-lab/neurips-micronet
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
Language: Jupyter Notebook - Size: 65.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 40 - Forks: 6

bellthomas/VDQN
Exploring Variational Deep Q Networks. A study undertaken for the University of Cambridge's R244 Computer Science Masters Course. Inspired by https://arxiv.org/abs/1711.11225/.
Language: Python - Size: 12.7 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

HolmesShuan/PyTorch-MixNet-SS
Extremely light-weight MixNet with Top-1 75.7% and 2.5M params
Language: Python - Size: 8.79 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 1
