GitHub / Lizonghang / prima.cpp
prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Lizonghang%2Fprima.cpp
PURL: pkg:github/Lizonghang/prima.cpp
Stars: 979
Forks: 67
Open issues: 5
License: mit
Language: C++
Size: 54.8 MB
Dependencies parsed at: Pending
Created at: 11 months ago
Updated at: about 2 months ago
Pushed at: about 2 months ago
Last synced at: about 2 months ago
Topics: distributed-ai, distributed-inference, llama-cpp, llm-inference, on-device-llms