GitHub / elotl / GenAI-infra-stack
Deployment of RAG + LLM model serving on multiple K8s cloud clusters
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/elotl%2FGenAI-infra-stack
PURL: pkg:github/elotl/GenAI-infra-stack
Fork of loftyoutcome/k8s-rag-llm
Stars: 8
Forks: 0
Open issues: 14
License: None
Language: Python
Size: 789 KB
Dependencies parsed at: Pending
Created at: about 1 year ago
Updated at: 3 months ago
Pushed at: about 2 months ago
Last synced at: about 2 months ago
Topics: autoscaling, genai, genai-chatbot, llm-inference, llm-ops, luna