An open API service providing repository metadata for many open source software ecosystems.

Topic: "efficient-model"

mit-han-lab/temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

Language: Python - Size: 245 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2,122 - Forks: 420

mit-han-lab/once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

Language: Python - Size: 6.83 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 1,924 - Forks: 343

mit-han-lab/proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Language: C++ - Size: 260 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 1,440 - Forks: 287

mit-han-lab/amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 17.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 441 - Forks: 115

mit-han-lab/haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Language: Python - Size: 64.5 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 384 - Forks: 85

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.8 MB - Last synced at: about 11 hours ago - Pushed at: 12 months ago - Stars: 362 - Forks: 32

microsoft/nn-Meter

A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.

Language: Python - Size: 58.3 MB - Last synced at: about 17 hours ago - Pushed at: 12 months ago - Stars: 356 - Forks: 63

mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Language: Python - Size: 25.6 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 334 - Forks: 52

amirgholami/ZeroQ

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

Language: Python - Size: 5.47 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 258 - Forks: 52

kssteven418/I-BERT

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Language: Python - Size: 6.38 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 246 - Forks: 36

mit-han-lab/amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Language: Python - Size: 37.1 KB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 167 - Forks: 27

youngwanLEE/VoV3D

Efficient 3D Backbone Network for Temporal Modeling

Language: Python - Size: 207 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 105 - Forks: 5

d-li14/HBONet

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2

Language: Python - Size: 179 MB - Last synced at: 4 months ago - Pushed at: about 5 years ago - Stars: 102 - Forks: 16

kssteven418/LTP

[KDD'22] Learned Token Pruning for Transformers

Language: Python - Size: 40.1 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 98 - Forks: 18

xvyaward/owq

Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".

Language: Python - Size: 3.03 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 61 - Forks: 7

SHI-Labs/Any-Precision-DNNs

Any-Precision Deep Neural Networks (AAAI 2021)

Language: Python - Size: 30.3 KB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 60 - Forks: 7

szq0214/S2-BNN

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)

Language: Python - Size: 240 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 53 - Forks: 11

mit-han-lab/neurips-micronet

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

Language: Jupyter Notebook - Size: 65.6 MB - Last synced at: 20 days ago - Pushed at: over 4 years ago - Stars: 40 - Forks: 8

lironui/ABCNet

The semantic segmentation of remote sensing images

Language: Python - Size: 5.29 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 36 - Forks: 4

tiangexiang/BiX-NAS

[MICCAI 2021] BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

Language: Python - Size: 2.9 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 36 - Forks: 8

kssteven418/Q-ASR

[ICASSP'22] Integer-only Zero-shot Quantization for Efficient Speech Recognition

Language: Jupyter Notebook - Size: 41.9 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 31 - Forks: 2

linksense/EfficientNet.PyTorch

Concise, Modular, Human-friendly PyTorch implementation of EfficientNet with Pre-trained Weights.

Language: Python - Size: 25.4 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 31 - Forks: 5

NoakLiu/GraphSnapShot

GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]

Language: Python - Size: 212 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 30 - Forks: 1

Kthyeon/KAIST-AI-NeurIPS2019-MicroNet-2nd-place-solution

NeurIPSCD2019, MicroNet Challenge hosted by Google, Deepmind Researcher, "Efficient Model for Image Classification With Regularization Tricks".

Language: Python - Size: 33.3 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 7

xternalz/SDPoint

Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks

Language: Python - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 18 - Forks: 4

gongzix/Lite-Mind

Official code base for ‘Lite-Mind : Towards Efficient and Robust Brain Representation Learning’

Language: Jupyter Notebook - Size: 22 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 17 - Forks: 0

linksense/MixNet-PyTorch

Concise, Modular, Human-friendly PyTorch implementation of MixNet with Pre-trained Weights.

Language: Python - Size: 57.4 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 16 - Forks: 3

tuanlda78202/cod

[Thesis'24] Efficient Class Incremental Learning for Object Detection

Language: Python - Size: 194 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 0

HolmesShuan/PyTorch-MixNet-SS

Extremely light-weight MixNet with Top-1 75.7% and 2.5M params

Language: Python - Size: 8.79 MB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

ozora-ogino/efficient_backbones

Implementation of efficient backbones for computer vision task.

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 4

Snigdho8869/DeepDream-StyleTransfer

Explore image transformations with DeepDream Algorithm and Neural Style Transfer in creative image processing.

Language: Jupyter Notebook - Size: 17 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

bellthomas/VDQN

Exploring Variational Deep Q Networks. A study undertaken for the University of Cambridge's R244 Computer Science Masters Course. Inspired by https://arxiv.org/abs/1711.11225/.

Language: Python - Size: 12.7 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 1

tarto-dev/tagsy-discord

Tagsy, your friendly Discord bot, designed to enhance server interaction with its intuitive tagging system

Language: Python - Size: 83 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

01-vyom/melanoma-classification

Melanoma Classification using Semi-supervised learning

Language: Python - Size: 5.17 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

aiarnob23/Zen-Easy

Language: TypeScript - Size: 14.8 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

Related Topics
quantization 7 model-compression 6 pytorch 6 efficient-neural-networks 5 natural-language-processing 5 efficient-inference 5 automl 5 deep-neural-networks 4 deep-learning 4 transformer 4 efficientnet 4 imagenet 3 acceleration 3 on-device-ai 3 contrastive-learning 2 pretrained-models 2 mixnet 2 image-classification 2 regularization 2 self-supervised-learning 2 specialization 2 hardware-aware 2 python 2 compression 2 edge-ai 2 neural-architecture-search 2 machine-learning 2 pruning 2 bert 2 llm 2 computer-vision 2 temporal-modeling 2 video-understanding 2 segmentation 2 semantic-segmentation 2 large-language-models 2 web-development 1 neurips2019 1 image-processing 1 orthonormality 1 any-precision 1 on-demand 1 mlsys 1 low-latency 1 nvidia-jetson-nano 1 tsm 1 llama 1 localllama 1 localllm 1 mistral 1 small-models 1 style-transfer-algorithms 1 style-transfer 1 neural-style-transfer-tensorflow 1 neural-style-transfer 1 efficientnet-pretrained 1 efficientnet-pytorch 1 efficientseg 1 pretrained-weights 1 adaptive-label-smoothing 1 cifar100 1 micronet-challenge 1 neurips 1 neurips-2019 1 neurips-competition 1 neural-style 1 neural-networks-visualization 1 image-manipulation 1 deepdream 1 artificial-intelligence 1 automl-for-compression 1 channel-pruning 1 knowledge-distillation 1 typescript 1 language-modeling 1 mixed-precision 1 aws-s3 1 tailwindcss 1 scss 1 express 1 infinite-scroll 1 mern-stack 1 mongoose 1 react 1 text-generation 1 brain-decoding 1 brain-retrieval 1 fmri 1 iccv2019 1 mobilenetv2 1 flask 1 machine-translation 1 edge-computing 1 inference 1 deeplearning 1 latency 1 onnx-models 1 deepdreamgenerator 1 tensorflow-models 1 deepdream-model 1