An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai-inference

bentoml/BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Language: Python - Size: 97.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 7,895 - Forks: 855

uxlfoundation/scikit-learn-intelex

Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

Language: Python - Size: 40.7 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 1,298 - Forks: 183

uxlfoundation/oneDAL

oneAPI Data Analytics Library (oneDAL)

Language: C++ - Size: 87.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 636 - Forks: 224

PREMSAI3717/professional-nano-vllm-enterprise

Professional nano-vLLM Enterprise enhances the original nano-vLLM, transforming it into a robust, production-ready LLM engine. Explore its features on GitHub! 🚀✨

Language: Python - Size: 81.1 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

hanax-ai/Citadel-Beta

Citadel AI OS – Enterprise AI Runtime Environment for Inference, Agents, and Business Operations

Language: JavaScript - Size: 342 KB - Last synced at: 18 days ago - Pushed at: 19 days ago - Stars: 0 - Forks: 0

intel/dffml 📦

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

Language: Python - Size: 576 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 253 - Forks: 138

calinux-py/UniUi

UniUi uses AI to allow you to talk directly to your system.

Language: JavaScript - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

dhanushk-offl/ai-inference-backend-boilerplate

A powerful, faster, scalable full-stack boilerplace for AI inference using Node.js, Python, Redis, and Docker

Language: JavaScript - Size: 0 Bytes - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

open-vela/apps_mlearning_tflite-micro

Customed version of Google's tflite-micro

Language: C++ - Size: 31.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 2

blace-ai/blace-ai

Cross-platform c++ sdk & model hub for easy ai inference

Language: C++ - Size: 1.82 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

tinyBigGAMES/JetInfero

Local LLM Inference Library

Language: Pascal - Size: 10.2 MB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 12 - Forks: 3

philips-software/go-hsdp-api 📦

Client library to interact with various APIs used within Philips in a simple and uniform way

Language: Go - Size: 2.96 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 32 - Forks: 11

superjamie/rocswap

llama.cpp + ROCm + llama-swap

Language: Dockerfile - Size: 28.3 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 14 - Forks: 1

valvebara/valvebara

No more Hugging Face cost leaks.

Language: TypeScript - Size: 2.29 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 1