GitHub / InftyAI / llmaz
βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/InftyAI%2Fllmaz
Stars: 125
Forks: 20
Open issues: 41
License: apache-2.0
Language: Go
Size: 6.46 MB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: about 2 hours ago
Pushed at: about 2 hours ago
Last synced at: 38 minutes ago
Topics: huggingface, inference, inference-platform, kubernetes, llamacpp, llm, modelscope, ollama, sglang, text-generation-inference, vllm