An open API service providing repository metadata for many open source software ecosystems.

Topic: "model-inference"

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Language: Python - Size: 41.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 11,240 - Forks: 719

wangxb96/Awesome-EdgeAI

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

Size: 3.64 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 83 - Forks: 8

bentoml/CLIP-API-service

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Language: Jupyter Notebook - Size: 945 KB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 61 - Forks: 4

array2d/deepx

Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support

Language: C++ - Size: 1.82 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 42 - Forks: 4

EmbeddedLLM/embeddedllm

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

Language: Python - Size: 12.6 MB - Last synced at: 4 days ago - Pushed at: 7 months ago - Stars: 38 - Forks: 1

DAVIDNYARKO123/edge-tpu-silva

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

Language: Python - Size: 24.7 MB - Last synced at: 19 days ago - Pushed at: about 1 year ago - Stars: 29 - Forks: 3

kdeps/kdeps

Kdeps is an all-in-one AI framework for building Dockerized full-stack AI applications (FE and BE) that includes open-source LLM models out-of-the-box.

Language: Go - Size: 4.26 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 21 - Forks: 1

hegongshan/Storage-for-AI-Paper

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

Size: 17.6 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 17 - Forks: 2

Koldim2001/Image_captioning

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

Language: Jupyter Notebook - Size: 34 MB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 0

ChaitanyaC22/Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

Language: HTML - Size: 1.07 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

debajitadhikary/EmoVision

😊📸 Real-Time Facial Emotion Recognition using Deep Learning 🤖🧠

Language: Python - Size: 32.5 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 3 - Forks: 2

Harshitha0706/object-detection-yolov5

Object Detection Using YOLOv5: A machine learning project that leverages YOLOv5 for real-time object detection. This project covers dataset preprocessing, model training, and image inference using OpenCV and PyTorch."

Size: 16.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

itancio/churn

Language: Python - Size: 326 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

AlvinHon/distributed-model-inference

Example distributed system for ML model inference by using Kafka, including spring boot REST+JPA server with Java consumer program

Language: Java - Size: 1.41 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

SayamAlt/Financial-News-Sentiment-Analysis

Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.

Language: Jupyter Notebook - Size: 745 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

kwame-mintah/gcp-cloud-run-function-model-inference

A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.

Language: Python - Size: 134 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SayamAlt/Mental-Health-Classification-using-fine-tuned-DistilBERT

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

Language: Jupyter Notebook - Size: 2.07 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SayamAlt/Luxury-Apparel-Product-Category-Classification-using-fine-tuned-DistilBERT

Successfully developed a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify various distinct types of luxury apparels into their respective categories i.e. pants, accessories, underwear, shoes, etc.

Language: Jupyter Notebook - Size: 3.7 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SayamAlt/Natural-Scenes-Image-Classification-using-CNNs

Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.

Language: Jupyter Notebook - Size: 11.4 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

SayamAlt/Oral-Disease-Classification-using-CNN

Successfully developed an image classification model using PyTorch to classify two types of oral diseases, namely caries and gingivitis.

Language: Jupyter Notebook - Size: 77.6 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

C-bianc/NER-task

Token classification for named entities

Language: Jupyter Notebook - Size: 3.37 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

SayamAlt/Global-Equity-Forecasting-using-LSTM

Successfully established an LSTM model to effectively forecast global equity based on over 20+ years of historical data of global equity.

Language: Jupyter Notebook - Size: 509 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

SayamAlt/Wine-Cultivator-Classification-using-ANN

Successfully established an ANN model which can classify wine cultivators based on several characteristics of distinct wines.

Language: Jupyter Notebook - Size: 74.2 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

santosh/image-classifier

POC of image classification using scikit-learn.

Language: Python - Size: 834 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

SayamAlt/Cyberbullying-Classification-using-fine-tuned-DistilBERT

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

Language: Jupyter Notebook - Size: 7.24 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

SayamAlt/English-to-Spanish-Language-Translation-using-Seq2Seq-and-Attention

Successfully established a Seq2Seq with attention model which can perform English to Spanish language translation up to an accuracy of almost 97%.

Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SayamAlt/Global-News-Headlines-Text-Summarization

Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.

Language: Jupyter Notebook - Size: 513 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SayamAlt/Symptoms-Disease-Text-Classification

Successfully developed a fine-tuned BERT transformer model which can accurately classify symptoms to their corresponding diseases upto an accuracy of 89%.

Language: Jupyter Notebook - Size: 860 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vinit714/--Deep-Learning-for-Fashion-MNIST--Accessory-Classification-Project

This repository contains Python code to classify fashion items using a Convolutional Neural Network (CNN) implemented with TensorFlow and Keras. It includes data preprocessing, model building, training, evaluation, and visualization of results.

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

KrajShuffle/Classifying_SpeechAudio_CNN

CNN Based Approach for Audio File Classification. Contains Notebooks Illustrating Data Preprocessing, Feature Extraction, Model Training, & Model Inference Workflows & Overall Pipeline

Language: Jupyter Notebook - Size: 37.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

GauravG-20/Udacity-AWS-MLE-ND-Project2-Build-a-ML-Workflow-For-Scones-Unlimited-On-Amazon-SageMaker

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

Language: HTML - Size: 1.11 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

brian-kipkoech-tanui/sagemaker-ML-workflow

Image Classifiers are used in the field of computer vision to identify the content of an image and it is used across a broad variety of industries, from advanced technologies like autonomous vehicles and augmented reality, to eCommerce platforms, and even in diagnostic medicine.

Language: HTML - Size: 978 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Related Topics
model-training-and-evaluation 11 deep-learning 9 text-tokenization 7 natural-language-processing 7 text-preprocessing 6 multiclass-classification 6 machine-learning 5 pytorch 5 image-classification 4 exploratory-data-analysis 4 llm 4 text-classification 4 data-preprocessing 4 distilbert-model 4 model-architecture-and-implementation 4 json 3 mlops 3 aws-s3 3 aws-lambda 3 mistral 3 aws 3 model-evaluation 3 convolutional-neural-networks 3 sagemaker-studio 3 computer-vision 3 sagemaker-deployment 3 python 3 llama 3 data-exploration-and-preprocessing 3 hugging-face-transformers 3 fine-tune-bert-tensorflow 3 model-deployment 2 llm-inference 2 cnn 2 nvidia 2 image-transformations 2 llmops 2 fine-tuning-bert 2 object-detection 2 fine-tuning 2 luong-attention 2 attention-mechanism 2 text-generation 2 aws-sagemaker 2 open-source-llm 2 llm-serving 2 lstm 2 python3 2 model-training 2 edge-computing 2 distilbert-fine-tuning 2 aws-ec2 2 aws-statemachine 2 aws-step-functions 2 endpoint 2 model-testing 2 tensorflow 2 streamlit 2 model-serving 2 npu 1 openvino 1 ipexllm 1 openvino-inference-engine 1 phi-3 1 gemma 1 data-augmentation 1 recurrent-neural-networks 1 time-series-datasets 1 time-series-forecasting 1 xavier-initialization 1 aws-iam 1 deployment 1 lambda-functions 1 binary-classification 1 data-loader 1 cuda-acceleration 1 deep-learning-framework 1 distributed-training 1 heterogeneous-computing 1 high-performance-computing 1 simd-optimization 1 data-visualization 1 multiclass-text-classification 1 aipc 1 cpu 1 directml 1 directx-12 1 word-embeddings 1 checkpoint 1 data-loading 1 data-preparation 1 data-storage 1 dataloader 1 mlsys 1 model-storage 1 storage-for-ai 1 storage-system 1 ai 1 cudnn 1 emotion-detection 1