GitHub topics: ai-inference
blace-ai/blace-ai
Cross-platform c++ sdk & model hub for easy ai inference
Language: C++ - Size: 2.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 51 - Forks: 2

uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,306 - Forks: 183

ayutaz/uPiper
Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android ready. High-quality voices for games & apps.
Language: C# - Size: 408 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

nndeploy/nndeploy
Your Local AI Workflow | 你本地的AI工作流
Language: C++ - Size: 67.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,165 - Forks: 143

bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Language: Python - Size: 98.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 8,028 - Forks: 871

uxlfoundation/oneDAL
oneAPI Data Analytics Library (oneDAL)
Language: C++ - Size: 87.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 639 - Forks: 225

tinyBigGAMES/JetInfero
Local LLM Inference Library
Language: Pascal - Size: 10.2 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 12 - Forks: 3

redbco/infermesh
GPU-aware inference mesh for large-scale AI serving
Language: Rust - Size: 376 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 0

intel/dffml 📦
The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
Language: Python - Size: 576 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 138

arbitrary-number/arbitrary-number
Arbitrary Numbers
Language: Python - Size: 76.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

open-vela/apps_mlearning_tflite-micro
Customed version of Google's tflite-micro
Language: C++ - Size: 31.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 2

PREMSAI3717/professional-nano-vllm-enterprise
Professional nano-vLLM Enterprise enhances the original nano-vLLM, transforming it into a robust, production-ready LLM engine. Explore its features on GitHub! 🚀✨
Language: Python - Size: 81.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

hanax-ai/Citadel-Beta
Citadel AI OS – Enterprise AI Runtime Environment for Inference, Agents, and Business Operations
Language: JavaScript - Size: 342 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

calinux-py/UniUi
UniUi uses AI to allow you to talk directly to your system.
Language: JavaScript - Size: 229 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dhanushk-offl/ai-inference-backend-boilerplate
A powerful, faster, scalable full-stack boilerplace for AI inference using Node.js, Python, Redis, and Docker
Language: JavaScript - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

philips-software/go-hsdp-api 📦
Client library to interact with various APIs used within Philips in a simple and uniform way
Language: Go - Size: 2.96 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

superjamie/rocswap
llama.cpp + ROCm + llama-swap
Language: Dockerfile - Size: 28.3 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 1

valvebara/valvebara
No more Hugging Face cost leaks.
Language: TypeScript - Size: 2.29 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1
