GitHub / efeslab / Nanoflow
A throughput-oriented high-performance serving framework for LLMs
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/efeslab%2FNanoflow
PURL: pkg:github/efeslab/Nanoflow
Stars: 816
Forks: 37
Open issues: 12
License: None
Language: C++
Size: 32.6 MB
Dependencies parsed at: Pending
Created at: 12 months ago
Updated at: 2 months ago
Pushed at: 2 months ago
Last synced at: 2 months ago
Commit Stats
Commits: 38
Authors: 10
Mean commits per author: 3.8
Development Distribution Score: 0.684
More commit stats: https://commits.ecosyste.ms/hosts/GitHub/repositories/efeslab/Nanoflow
Topics: cuda, inference, llama2, llm, llm-serving, model-serving