Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub / NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

JSON API: https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NVIDIA%2FTensorRT-LLM

Stars: 7,030
Forks: 751
Open Issues: 625

License: apache-2.0
Language: C++
Repo Size: 272 MB
Dependencies: 206

Created: 10 months ago
Updated: about 22 hours ago
Last pushed: about 18 hours ago
Last synced: about 18 hours ago

Files
    Loading...
    Readme
    Loading...
    Dependencies
    setup.py pypi