An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: efficient-model

microsoft/nn-Meter

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Language: Python - Size: 58.3 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 351 - Forks: 62

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.8 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 339 - Forks: 30

mit-han-lab/temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language: Python - Size: 245 KB - Last synced at: 10 days ago - Pushed at: 9 months ago - Stars: 2,104 - Forks: 420

mit-han-lab/proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Language: C++ - Size: 260 MB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 1,436 - Forks: 287

SHI-Labs/Any-Precision-DNNs

Any-Precision Deep Neural Networks (AAAI 2021)

Language: Python - Size: 30.3 KB - Last synced at: 8 days ago - Pushed at: almost 5 years ago - Stars: 60 - Forks: 7

Snigdho8869/DeepDream-StyleTransfer

Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.

Language: Jupyter Notebook - Size: 17 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

kssteven418/I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Language: Python - Size: 6.38 MB - Last synced at: 16 days ago - Pushed at: about 2 years ago - Stars: 241 - Forks: 34

mit-han-lab/haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Language: Python - Size: 64.5 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 380 - Forks: 85

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

Language: Python - Size: 40.1 MB - Last synced at: 15 days ago - Pushed at: about 2 years ago - Stars: 96 - Forks: 18

kssteven418/Q-ASR

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 15 days ago - Pushed at: over 3 years ago - Stars: 31 - Forks: 2

gongzix/Lite-Mind

Official code base for ‘Lite-Mind : Towards Efficient and Robust Brain Representation Learning’

Language: Jupyter Notebook - Size: 22 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 17 - Forks: 0

NoakLiu/GraphSnapShot

GraphSnapShot: Caching Local Structure for Fast Graph Learning

Language: Python - Size: 212 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 23 - Forks: 0

d-li14/HBONet

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2

Language: Python - Size: 179 MB - Last synced at: 11 days ago - Pushed at: almost 5 years ago - Stars: 102 - Forks: 16

mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Language: Python - Size: 25.6 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 329 - Forks: 50

xvyaward/owq

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

Language: Python - Size: 3.03 MB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 53 - Forks: 5

tuanlda78202/cod

[Thesis'24] Efficient Class Incremental Learning for Object Detection

Language: Python - Size: 194 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 13 - Forks: 0

mit-han-lab/once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Language: Python - Size: 6.83 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 1,841 - Forks: 332

tiangexiang/BiX-NAS

[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

Language: Python - Size: 2.9 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 36 - Forks: 8

tarto-dev/tagsy-discord

Tagsy, your friendly Discord bot, designed to enhance server interaction with its intuitive tagging system

Language: Python - Size: 83 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

mit-han-lab/amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 17.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 416 - Forks: 108

lironui/ABCNet

The semantic segmentation of remote sensing images

Language: Python - Size: 5.29 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 2

ozora-ogino/efficient_backbones

Implementation of efficient backbones for computer vision task.

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 4

mit-han-lab/amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 164 - Forks: 27

amirgholami/ZeroQ

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

Language: Python - Size: 5.47 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 258 - Forks: 52

Kthyeon/KAIST-AI-NeurIPS2019-MicroNet-2nd-place-solution

NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".

Language: Python - Size: 33.3 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 23 - Forks: 7

linksense/EfficientNet.PyTorch

Concise, Modular, Human-friendly PyTorch implementation of EfficientNet with Pre-trained Weights.

Language: Python - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 31 - Forks: 5

youngwanLEE/VoV3D

Efficient 3D Backbone Network for Temporal Modeling

Language: Python - Size: 207 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 5

01-vyom/melanoma-classification

Melanoma Classification using Semi-supervised learning

Language: Python - Size: 5.17 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

xternalz/SDPoint

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Language: Python - Size: 10.7 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 4

linksense/MixNet-PyTorch

Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.

Language: Python - Size: 57.4 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 16 - Forks: 3

szq0214/S2-BNN

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

Language: Python - Size: 240 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 53 - Forks: 11

mit-han-lab/neurips-micronet

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

Language: Jupyter Notebook - Size: 65.6 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 40 - Forks: 6

bellthomas/VDQN

Exploring Variational Deep Q Networks. A study undertaken for the University of Cambridge's R244 Computer Science Masters Course. Inspired by https://arxiv.org/abs/1711.11225/.

Language: Python - Size: 12.7 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 1

HolmesShuan/PyTorch-MixNet-SS

Extremely light-weight MixNet with Top-1 75.7% and 2.5M params

Language: Python - Size: 8.79 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 1

Related Keywords
efficient-model 34 quantization 7 model-compression 6 pytorch 6 efficient-neural-networks 5 efficient-inference 5 natural-language-processing 5 automl 5 transformer 4 deep-neural-networks 4 efficientnet 4 deep-learning 4 acceleration 3 imagenet 3 on-device-ai 3 hardware-aware 2 video-understanding 2 temporal-modeling 2 specialization 2 bert 2 pruning 2 pretrained-models 2 segmentation 2 semantic-segmentation 2 computer-vision 2 regularization 2 contrastive-learning 2 self-supervised-learning 2 image-classification 2 mixnet 2 neural-architecture-search 2 python 2 compression 2 edge-ai 2 llm 2 large-language-models 2 machine-learning 2 backbone-networks 1 vov3d 1 vovnet 1 augmentation 1 byol 1 byol-pytorch 1 byol-pytorch-lightning 1 latency 1 efficientnet-b5 1 kaggle 1 kaggle-competition 1 resnet-101 1 resnext-101 1 inference 1 3d-cnn-architecture 1 pretrained-weights 1 efficientseg 1 efficientnet-pytorch 1 efficientnet-pretrained 1 orthonormality 1 neurips2019 1 neurips-competition 1 neurips-2019 1 neurips 1 micronet-challenge 1 cifar100 1 adaptive-label-smoothing 1 quantized-neural-networks 1 computer-vision-tools 1 lightnet 1 mixnet-pytorch 1 mixnets 1 mixseg 1 binary-neural-networks 1 contrastive-loss 1 distillation-loss 1 knowledge-distillation 1 language-modeling 1 ai 1 ddqn 1 dqn 1 dvdqn 1 qlearning 1 variational-inference 1 vdqn 1 siim-melanoma-classification 1 batch-normalization 1 batchnorm 1 convolutional-networks 1 convolutional-neural-networks 1 cost-adjustable 1 deep-learning-algorithms 1 downsampling 1 imagenet-dataset 1 pooling 1 preact-resnet 1 resnet 1 resnets 1 resnext 1 bifpn 1 edge-computing 1 speech-recognition 1 speech 1