An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: model-inference

wangxb96/Awesome-EdgeAI

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

Size: 3.64 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 92 - Forks: 9

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Language: Python - Size: 41.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 11,738 - Forks: 763

hnthap/cat-or-dog

A web application that uses a pre-trained machine learning model to classify images as either a cat or a dog. The project leverages OpenVINO Model Server for inference, a Node.js backend for preprocessing and API handling, and a React-based frontend for user interaction.

Language: TypeScript - Size: 14.2 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

bentoml/CLIP-API-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 64 - Forks: 4

EmbeddedLLM/embeddedllm

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

Language: Python - Size: 12.6 MB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 42 - Forks: 1

akrisanov/inference-engineering-journey

A personal journey into model inference engineering — learning, building, and sharing along the way.

Language: Jupyter Notebook - Size: 18.3 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

Keval10github/Vehicle-Detection

This vehicle identification project utilizes the YOLOv5 deep learning model for detecting and classifying vehicles from images, videos, and live streams. It supports real-time inference, saving outputs with bounding boxes, confidence scores, and class labels, making it ideal for traffic monitoring and smart surveillance systems.

Language: Python - Size: 670 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

AlvinHon/distributed-model-inference

Example distributed system for ML model inference by using Kafka, including spring boot REST+JPA server with Java consumer program

Language: Java - Size: 1.41 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

DAVIDNYARKO123/edge-tpu-silva

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

Language: Python - Size: 24.7 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 30 - Forks: 3

hegongshan/Storage-for-AI-Paper

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

Size: 28.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 30 - Forks: 3

Koldim2001/Image_captioning

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

Language: Jupyter Notebook - Size: 34 MB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 18 - Forks: 0

SayamAlt/Symptoms-Disease-Text-Classification

Successfully developed a fine-tuned BERT transformer model which can accurately classify symptoms to their corresponding diseases upto an accuracy of 89%.

Language: Jupyter Notebook - Size: 860 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

debajitadhikary/EmoVision

😊📸 Real-Time Facial Emotion Recognition using Deep Learning 🤖🧠

Language: Python - Size: 32.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

Harshitha0706/object-detection-yolov5

Object Detection Using YOLOv5: A machine learning project that leverages YOLOv5 for real-time object detection. This project covers dataset preprocessing, model training, and image inference using OpenCV and PyTorch."

Size: 16.6 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

SayamAlt/Cyberbullying-Classification-using-fine-tuned-DistilBERT

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

Language: Jupyter Notebook - Size: 7.24 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

SayamAlt/Financial-News-Sentiment-Analysis

Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.

Language: Jupyter Notebook - Size: 745 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

ChaitanyaC22/Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

Language: HTML - Size: 1.07 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

kwame-mintah/gcp-cloud-run-function-model-inference

A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.

Language: Python - Size: 134 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SayamAlt/Mental-Health-Classification-using-fine-tuned-DistilBERT

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

Language: Jupyter Notebook - Size: 2.07 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SayamAlt/Luxury-Apparel-Product-Category-Classification-using-fine-tuned-DistilBERT

Successfully developed a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify various distinct types of luxury apparels into their respective categories i.e. pants, accessories, underwear, shoes, etc.

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

SayamAlt/Natural-Scenes-Image-Classification-using-CNNs

Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SayamAlt/Oral-Disease-Classification-using-CNN

Successfully developed an image classification model using PyTorch to classify two types of oral diseases, namely caries and gingivitis.

Language: Jupyter Notebook - Size: 77.6 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

C-bianc/NER-task

Token classification for named entities

Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SayamAlt/Global-Equity-Forecasting-using-LSTM

Successfully established an LSTM model to effectively forecast global equity based on over 20+ years of historical data of global equity.

Language: Jupyter Notebook - Size: 509 KB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

SayamAlt/Wine-Cultivator-Classification-using-ANN

Successfully established an ANN model which can classify wine cultivators based on several characteristics of distinct wines.

Language: Jupyter Notebook - Size: 74.2 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

itancio/churn

Language: Python - Size: 326 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

santosh/image-classifier

POC of image classification using scikit-learn.

Language: Python - Size: 834 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SayamAlt/English-to-Spanish-Language-Translation-using-Seq2Seq-and-Attention

Successfully established a Seq2Seq with attention model which can perform English to Spanish language translation up to an accuracy of almost 97%.

Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SayamAlt/Global-News-Headlines-Text-Summarization

Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.

Language: Jupyter Notebook - Size: 513 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vinit714/--Deep-Learning-for-Fashion-MNIST--Accessory-Classification-Project

This repository contains Python code to classify fashion items using a Convolutional Neural Network (CNN) implemented with TensorFlow and Keras. It includes data preprocessing, model building, training, evaluation, and visualization of results.

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

KrajShuffle/Classifying_SpeechAudio_CNN

CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline

Language: Jupyter Notebook - Size: 37.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GauravG-20/Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

Language: HTML - Size: 1.11 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

brian-kipkoech-tanui/sagemaker-ML-workflow

Image Classifiers are used in the field of computer vision to identify the content of an image and it is used across a broad variety of industries, from advanced technologies like autonomous vehicles and augmented reality, to eCommerce platforms, and even in diagnostic medicine.

Language: HTML - Size: 978 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Keywords
model-inference 33 model-training-and-evaluation 11 deep-learning 10 text-tokenization 7 natural-language-processing 7 pytorch 6 machine-learning 6 multiclass-classification 6 text-preprocessing 6 image-classification 5 exploratory-data-analysis 4 computer-vision 4 model-architecture-and-implementation 4 python 4 text-classification 4 distilbert-model 4 data-preprocessing 4 aws 3 convolutional-neural-networks 3 aws-lambda 3 model-evaluation 3 aws-s3 3 data-exploration-and-preprocessing 3 json 3 sagemaker-studio 3 fine-tune-bert-tensorflow 3 hugging-face-transformers 3 sagemaker-deployment 3 model-deployment 3 llm 3 object-detection 3 yolov5 2 website 2 model-training 2 fine-tuning-bert 2 jupyter-notebook 2 aws-ec2 2 cnn 2 edge-computing 2 aws-sagemaker 2 aws-statemachine 2 aws-step-functions 2 endpoint 2 image-transformations 2 llm-serving 2 mistral 2 mlops 2 open-source-llm 2 text-generation 2 python3 2 model-testing 2 luong-attention 2 streamlit 2 data-augmentation 2 llm-inference 2 image-preprocessing 2 lstm 2 llama 2 attention-mechanism 2 tensorflow 2 distilbert-fine-tuning 2 lambda-functions 1 deployment 1 aws-iam 1 sentiment-analysis 1 data-exploration 1 cyberbullying-detection 1 deep-learning-frameworks 1 data-handling-with-pandas 1 bounding-box-annotation 1 real-time 1 opencv 1 nvidia 1 neural-networks 1 haarcascade 1 gpu-computing 1 git 1 facial-expression-recognition 1 classification 1 parking-lot 1 scikit-learn 1 attention-is-all-you-need 1 attention-model 1 bert-transformer 1 language-translation 1 neural-machine-translation 1 seq2seq-modeling 1 seq2seq-model 1 text-summarization 1 evaluation-metrics 1 fashion-mnist 1 normalization 1 visualization 1 feature-engineering 1 feature-extraction 1 metrics-visualization 1 speech-classification 1 aws-endpoint 1 aws-state-machine 1 udacity-machine-learning-fundamentals 1