Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: efficient-model

microsoft/nn-Meter

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Language: Python - Size: 58.2 MB - Last synced: about 16 hours ago - Pushed: 3 months ago - Stars: 325 - Forks: 56

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

Language: Python - Size: 734 KB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 49 - Forks: 6

mit-han-lab/temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language: Python - Size: 238 KB - Last synced: 18 days ago - Pushed: 8 months ago - Stars: 2,022 - Forks: 418

mit-han-lab/once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Language: Python - Size: 6.83 MB - Last synced: 21 days ago - Pushed: 6 months ago - Stars: 1,841 - Forks: 332

mit-han-lab/proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Language: C++ - Size: 260 MB - Last synced: 30 days ago - Pushed: 10 months ago - Stars: 1,412 - Forks: 282

tiangexiang/BiX-NAS

[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

Language: Python - Size: 2.9 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 36 - Forks: 8

kssteven418/I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Language: Python - Size: 6.38 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 209 - Forks: 30

SqueezeAILab/KVQuant

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.7 MB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 187 - Forks: 14

tarto-dev/tagsy-discord

Tagsy, your friendly Discord bot, designed to enhance server interaction with its intuitive tagging system

Language: Python - Size: 83 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 1 - Forks: 0

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

Language: Python - Size: 40.1 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 84 - Forks: 14

SHI-Labs/Any-Precision-DNNs

Any-Precision Deep Neural Networks (AAAI 2021)

Language: Python - Size: 30.3 KB - Last synced: about 2 months ago - Pushed: about 4 years ago - Stars: 51 - Forks: 5

mit-han-lab/amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 17.6 KB - Last synced: 2 months ago - Pushed: 7 months ago - Stars: 416 - Forks: 108

mit-han-lab/haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Language: Python - Size: 64.5 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 350 - Forks: 83

xvyaward/owq

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

Language: Python - Size: 3.03 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 30 - Forks: 5

lironui/ABCNet

The semantic segmentation of remote sensing images

Language: Python - Size: 5.29 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 22 - Forks: 2

ozora-ogino/efficient_backbones

Implementation of efficient backbones for computer vision task.

Language: Python - Size: 19.5 KB - Last synced: 5 months ago - Pushed: over 1 year ago - Stars: 4 - Forks: 4

mit-han-lab/amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 37.1 KB - Last synced: 2 months ago - Pushed: over 3 years ago - Stars: 164 - Forks: 27

amirgholami/ZeroQ

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

Language: Python - Size: 5.47 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 258 - Forks: 52

Snigdho8869/DeepDream-StyleTransfer

Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.

Language: Jupyter Notebook - Size: 12.9 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

Kthyeon/KAIST-AI-NeurIPS2019-MicroNet-2nd-place-solution

NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".

Language: Python - Size: 33.3 MB - Last synced: 8 months ago - Pushed: over 3 years ago - Stars: 23 - Forks: 7

linksense/EfficientNet.PyTorch

Concise, Modular, Human-friendly PyTorch implementation of EfficientNet with Pre-trained Weights.

Language: Python - Size: 25.4 KB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 31 - Forks: 5

mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Language: Python - Size: 16.7 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 312 - Forks: 44

kssteven418/Q-ASR

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

Language: Jupyter Notebook - Size: 41.9 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 29 - Forks: 2

youngwanLEE/VoV3D

Efficient 3D Backbone Network for Temporal Modeling

Language: Python - Size: 207 KB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 105 - Forks: 5

01-vyom/melanoma-classification

Melanoma Classification using Semi-supervised learning

Language: Python - Size: 5.17 MB - Last synced: 9 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 2

d-li14/HBONet

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2

Language: Python - Size: 179 MB - Last synced: 7 months ago - Pushed: about 4 years ago - Stars: 102 - Forks: 16

xternalz/SDPoint

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Language: Python - Size: 10.7 KB - Last synced: about 2 months ago - Pushed: over 4 years ago - Stars: 18 - Forks: 4

linksense/MixNet-PyTorch

Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.

Language: Python - Size: 57.4 MB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 16 - Forks: 3

szq0214/S2-BNN

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

Language: Python - Size: 240 KB - Last synced: over 1 year ago - Pushed: almost 3 years ago - Stars: 53 - Forks: 11

mit-han-lab/neurips-micronet

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

Language: Jupyter Notebook - Size: 65.6 MB - Last synced: over 1 year ago - Pushed: over 3 years ago - Stars: 40 - Forks: 6

bellthomas/VDQN

Exploring Variational Deep Q Networks. A study undertaken for the University of Cambridge's R244 Computer Science Masters Course. Inspired by https://arxiv.org/abs/1711.11225/.

Language: Python - Size: 12.7 MB - Last synced: 5 months ago - Pushed: over 4 years ago - Stars: 2 - Forks: 1

HolmesShuan/PyTorch-MixNet-SS

Extremely light-weight MixNet with Top-1 75.7% and 2.5M params

Language: Python - Size: 8.79 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 6 - Forks: 1

Related Keywords
efficient-model 32 quantization 7 model-compression 6 pytorch 6 efficient-neural-networks 5 automl 5 natural-language-processing 5 efficient-inference 5 deep-learning 4 deep-neural-networks 4 efficientnet 4 transformer 4 on-device-ai 3 acceleration 3 large-language-models 3 imagenet 3 llm 2 mixnet 2 compression 2 image-classification 2 contrastive-learning 2 pruning 2 computer-vision 2 self-supervised-learning 2 bert 2 pretrained-models 2 semantic-segmentation 2 segmentation 2 neural-architecture-search 2 machine-learning 2 python 2 regularization 2 temporal-modeling 2 video-understanding 2 hardware-aware 2 specialization 2 edge-ai 2 siim-melanoma-classification 1 efficientnet-pytorch 1 iccv2019 1 resnext-101 1 resnet-101 1 kaggle-competition 1 efficientseg 1 kaggle 1 efficientnet-b5 1 byol-pytorch-lightning 1 vdqn 1 byol-pytorch 1 byol 1 augmentation 1 pretrained-weights 1 vovnet 1 machine-translation 1 vov3d 1 automatic-speech-recognition 1 jasper 1 quartznet 1 speech 1 speech-recognition 1 3d-cnn-architecture 1 backbone-networks 1 variational-inference 1 qlearning 1 dvdqn 1 dqn 1 ddqn 1 ai 1 language-modeling 1 knowledge-distillation 1 distillation-loss 1 contrastive-loss 1 binary-neural-networks 1 mixseg 1 mixnets 1 mixnet-pytorch 1 lightnet 1 bifpn 1 resnext 1 resnets 1 resnet 1 preact-resnet 1 pooling 1 imagenet-dataset 1 downsampling 1 deep-learning-algorithms 1 cost-adjustable 1 convolutional-neural-networks 1 convolutional-networks 1 batchnorm 1 batch-normalization 1 mobilenetv2 1 efficientnet-pretrained 1 mixed-precision 1 channel-pruning 1 automl-for-compression 1 on-demand 1 any-precision 1 user-friendly 1 tagging 1