GitHub / VectorInstitute / vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/VectorInstitute%2Fvector-inference
Stars: 58
Forks: 10
Open issues: 8
License: mit
Language: Python
Size: 2.59 MB
Dependencies parsed at: Pending
Created at: about 1 year ago
Updated at: 6 days ago
Pushed at: 3 days ago
Last synced at: 3 days ago
Topics: inference, llm, llm-inference, reward-modeling, text-embedding, vllm, vlm
Loading...