GitHub topics: quantized-neural-networks

Repositories

hailo-ai/hailo_model_zoo

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

Language: Python - Size: 5.94 MB - Last synced at: about 4 hours ago - Pushed at: about 6 hours ago - Stars: 488 - Forks: 66

tensorflow/model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Language: Python - Size: 2.23 MB - Last synced at: 2 days ago - Pushed at: 6 days ago - Stars: 1,540 - Forks: 327

larq/larq

An Open-Source Library for Training Binarized Neural Networks

Language: Python - Size: 1020 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 719 - Forks: 86

McDonnell-Research-Lab/1-bit-per-weight

Training wide residual networks for deployment using a single bit for each weight - Official Code Repository for ICLR 2018 Published Paper

Language: Jupyter Notebook - Size: 32.8 MB - Last synced at: 15 days ago - Pushed at: about 5 years ago - Stars: 37 - Forks: 10

google/qkeras

QKeras: a quantization deep learning library for Tensorflow Keras

Language: Python - Size: 1.57 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 566 - Forks: 109

fastmachinelearning/qonnx

QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX

Language: Python - Size: 5.51 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 150 - Forks: 47

LISTENAI/linger

a CSK serial based train tools， rely on pytorch

Language: Python - Size: 22.7 MB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 25 - Forks: 6

yashkant/enas-quantized-nets

Efficient Neural Architecture Search coupled with Quantized CNNs to search for resource efficient and accurate architectures.

Language: Python - Size: 15 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 2

BUG1989/caffe-int8-convert-tools

Generate a quantization parameter file for ncnn framework int8 inference

Language: Python - Size: 622 KB - Last synced at: 4 months ago - Pushed at: almost 5 years ago - Stars: 519 - Forks: 154

Loki-Silvres/Autonomous-Driving-B3RB-buggy

Autonomous Driving project for exploration of robotic perception, sensor fusion and autonomous navigation.

Language: Python - Size: 426 MB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

harshshah37/Quantized-NSFW-Classifier

This repository contains code for quantization-based NSFW classification models.

Language: Jupyter Notebook - Size: 86.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akashlevy/RRAM-Stuck-At-NN-Modeling

Modeling stuck-at faults for RRAM inference on popular neural networks after quantization

Language: Python - Size: 40 MB - Last synced at: 5 days ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 1

egorsmkv/optimized-whisper-intel

Run quantized Whisper models only on CPU with Intel hardware

Language: Python - Size: 409 KB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

yashkant/quantized-nets

Contains code for Binary, Ternary, N-bit Quantized and Hybrid CNNs for low precision experiments.

Language: Python - Size: 43.9 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 7

mtwchuang/CSC3001-QLLMChaining

QLLMChain is an AI Chatbot prototype designed to enable users to interact with data easily. The application leverages quantised locally deployed large language models to power information retrieval and interpretation, turning natural language prompts into SQL statements, text summaries, and Python visualisations.

Language: Python - Size: 1.07 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

BGUCompSci/CNNQuantizationThroughPDEs

Code repository for the paper Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations

Language: Python - Size: 219 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

htqin/QuantSR

This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.

Language: Python - Size: 9.75 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 31 - Forks: 2

heydarimo/VQ-VAE

In this project, we have implemented the VQ-VAE algorithm on both MNIST and CIFAR10 datasets considering MSELOSS and also NLLLOSE.

Language: Jupyter Notebook - Size: 2.95 MB - Last synced at: 10 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

stracini-git/qnn

Training neural nets with quantized weights on arbitrarily specified bit-depth

Language: Python - Size: 1.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

Enderdead/Pytorch_Quantize_impls

Some recent Quantizing techniques on PyTorch

Language: Python - Size: 190 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 72 - Forks: 10

timeless-spark/CNN-accelerator

Exercises on HW acceleration of quantized neural networks for the course Integrated Systems Architecture at PoliTo

Language: Python - Size: 107 MB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

sefaburakokcu/quantized-yolov5

Low Precision(quantized) Yolov5

Language: Python - Size: 9.38 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 24 - Forks: 5

Zhen-Dong/HAWQ

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Language: Python - Size: 691 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 361 - Forks: 80

amirgholami/ZeroQ

[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework

Language: Python - Size: 5.47 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 258 - Forks: 52

EEESlab/CMix-NN

CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices

Language: C - Size: 166 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 32 - Forks: 8

tca19/near-lossless-binarization

This repository contains source code to binarize any real-value word embeddings into binary vectors.

Language: C - Size: 165 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 45 - Forks: 8

cedard234/Grayscale_Verilog_Converter

A python-based utility to convert a grayscale image into verilog code.

Language: Python - Size: 55.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mrusci/training-mixed-precision-quantized-networks

This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contraints of the target device.

Language: Python - Size: 30.3 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 45 - Forks: 12

EEESlab/mobilenet_v1_stm32_cmsis_nn

Mobilenet v1 trained on Imagenet for STM32 using extended CMSIS-NN with INT-Q quantization support

Language: C - Size: 1.56 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 81 - Forks: 29

YukeWang96/APNN-TC_SC21 Fork of BoyuanFeng/APNN-TC

Artifact for SC21: APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores.

Size: 67.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

quantized-neural-networks 42 deep-learning 10 quantization 10 tensorflow 8 pytorch 7 machine-learning 5 keras 5 fpga 4 edge-ai 4 cnn 4 binarized-neural-networks 4 mixed-precision 3 stm32 3 edge-computing 3 model-compression 3 imagenet 3 neural-network 3 stm32h7 2 binary-neural-networks 2 binarization 2 artificial-intelligence 2 onnx 2 quantization-aware-training 2 image-processing 2 neural-networks 2 mobilenet 2 deep-neural-networks 2 cortex-m7 2 convolutional-neural-networks 2 qnn 2 low-power-mcu 2 tensorcore 2 computer-vision 2 compression 2 efficient-neural-networks 2 quantized-networks 2 arm-cortex-m7 2 adversarial-attacks 2 inference 2 python 2 accelerator 2 cmsis-nn 2 iot 1 tvm 1 efficient-model 1 integer-arithmetic 1 stm32f4 1 verilog 1 arm 1 arm-cortex-m4 1 stm32f7 1 cmsis 1 grayscale 1 cv 1 wordembeddings 1 word-embeddings 1 binary-word-vectors 1 binary-word-embeddings 1 autoencoder 1 tinyml 1 stm32l4 1 mixed-precision-training 1 multiprocessing 1 pre-processing 1 text 1 quantize-resnet 1 resnet-101 1 resnet-18 1 resnet-50 1 memory 1 pgd 1 gru-neural-networks 1 lstm-neural-networks 1 pruning-optimization 1 robustness 1 image-signal-processing 1 mp-spdz 1 mpc 1 secure 1 deeplearning 1 imagenet-classifier 1 pythorch 1 quantized-neural-network 1 dnn 1 gpu 1 cublaslt 1 mlp 1 cubemxai 1 edge-classification 1 mcu 1 stm32h743zi 1 stmcubemx 1 tflite 1 best-practices 1 engg 1 mlflow 1 ai-accelerators 1 caffe 1 deeplearning-ai 1 int8-inference 1