An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai-inference

blace-ai/blace-ai

Cross-platform c++ sdk & model hub for easy ai inference

Language: C++ - Size: 2.48 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 51 - Forks: 2

uxlfoundation/scikit-learn-intelex

Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,306 - Forks: 183

ayutaz/uPiper

Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android ready. High-quality voices for games & apps.

Language: C# - Size: 408 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

nndeploy/nndeploy

Your Local AI Workflow | 你本地的AI工作流

Language: C++ - Size: 67.7 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,165 - Forks: 143

bentoml/BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Language: Python - Size: 98.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 8,028 - Forks: 871

uxlfoundation/oneDAL

oneAPI Data Analytics Library (oneDAL)

Language: C++ - Size: 87.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 639 - Forks: 225

tinyBigGAMES/JetInfero

Local LLM Inference Library

Language: Pascal - Size: 10.2 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 12 - Forks: 3

redbco/infermesh

GPU-aware inference mesh for large-scale AI serving

Language: Rust - Size: 376 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 0

intel/dffml 📦

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

Language: Python - Size: 576 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 138

arbitrary-number/arbitrary-number

Arbitrary Numbers

Language: Python - Size: 76.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

open-vela/apps_mlearning_tflite-micro

Customed version of Google's tflite-micro

Language: C++ - Size: 31.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 2

PREMSAI3717/professional-nano-vllm-enterprise

Professional nano-vLLM Enterprise enhances the original nano-vLLM, transforming it into a robust, production-ready LLM engine. Explore its features on GitHub! 🚀✨

Language: Python - Size: 81.1 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

hanax-ai/Citadel-Beta

Citadel AI OS – Enterprise AI Runtime Environment for Inference, Agents, and Business Operations

Language: JavaScript - Size: 342 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

calinux-py/UniUi

UniUi uses AI to allow you to talk directly to your system.

Language: JavaScript - Size: 229 KB - Last synced at: 28 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dhanushk-offl/ai-inference-backend-boilerplate

A powerful, faster, scalable full-stack boilerplace for AI inference using Node.js, Python, Redis, and Docker

Language: JavaScript - Size: 0 Bytes - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

philips-software/go-hsdp-api 📦

Client library to interact with various APIs used within Philips in a simple and uniform way

Language: Go - Size: 2.96 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

superjamie/rocswap

llama.cpp + ROCm + llama-swap

Language: Dockerfile - Size: 28.3 KB - Last synced at: 5 months ago - Pushed at: 7 months ago - Stars: 14 - Forks: 1

valvebara/valvebara

No more Hugging Face cost leaks.

Language: TypeScript - Size: 2.29 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1