Ecosyste.ms: Repos
An open API service providing repository metadata for many open source software ecosystems.
GitHub / NVIDIA 342 repositories
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language: C++ - Size: 260 MB - Last synced: about 4 hours ago - Pushed: about 5 hours ago - Stars: 6,743 - Forks: 703
NVIDIA/gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Language: C++ - Size: 696 KB - Last synced: about 8 hours ago - Pushed: about 8 hours ago - Stars: 781 - Forks: 139
NVIDIA/knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
Language: Go - Size: 343 KB - Last synced: about 9 hours ago - Pushed: about 10 hours ago - Stars: 22 - Forks: 5
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language: Python - Size: 8.13 MB - Last synced: about 14 hours ago - Pushed: about 15 hours ago - Stars: 8,771 - Forks: 1,963
NVIDIA/NeMo-Framework-Launcher
NeMo Megatron launcher and tools
Language: Python - Size: 27.3 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 397 - Forks: 115
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
Language: C++ - Size: 32.8 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 1,116 - Forks: 73
NVIDIA/NeMo-Curator
Scalable toolkit for data curation
Language: Python - Size: 470 KB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 245 - Forks: 23
NVIDIA/workbench-example-hybrid-rag
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
Language: Python - Size: 22.7 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 40 - Forks: 98
NVIDIA/earth2studio
Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
Language: Python - Size: 103 MB - Last synced: about 16 hours ago - Pushed: about 17 hours ago - Stars: 15 - Forks: 4
NVIDIA/holodeck
Holodeck is a project to create test environments optimised for GPU projects.
Language: Go - Size: 14.5 MB - Last synced: about 2 hours ago - Pushed: about 17 hours ago - Stars: 5 - Forks: 3
NVIDIA/trt-llm-rag-windows
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
Language: Python - Size: 33.4 MB - Last synced: about 6 hours ago - Pushed: about 7 hours ago - Stars: 2,356 - Forks: 252
NVIDIA/stdexec
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
Language: C++ - Size: 10.1 MB - Last synced: about 9 hours ago - Pushed: about 19 hours ago - Stars: 1,281 - Forks: 140
NVIDIA/JAX-Toolbox
JAX-Toolbox
Language: Python - Size: 4.01 MB - Last synced: about 6 hours ago - Pushed: about 6 hours ago - Stars: 179 - Forks: 34
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language: Python - Size: 4.29 MB - Last synced: about 19 hours ago - Pushed: about 20 hours ago - Stars: 1,462 - Forks: 227
NVIDIA/NV-Kernels
Ubuntu kernels which are optimized for NVIDIA server systems
Language: C - Size: 2.89 GB - Last synced: about 21 hours ago - Pushed: about 22 hours ago - Stars: 6 - Forks: 3
NVIDIA/mods-kernel-driver
Linux driver for diagnostic software
Language: C - Size: 407 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 15 - Forks: 3
NVIDIA/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
Language: Python - Size: 11.8 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 136 - Forks: 6
NVIDIA/gontainer
Dependency Injection container for Golang projects.
Language: Go - Size: 647 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 26 - Forks: 2
NVIDIA/k8s-kata-manager
Language: Go - Size: 8.87 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 15 - Forks: 2
NVIDIA/cloudai
CloudAI Benchmark Framework
Language: Python - Size: 113 KB - Last synced: about 1 hour ago - Pushed: about 1 hour ago - Stars: 6 - Forks: 5
NVIDIA/jetson-gpio
A Python library that enables the use of Jetson's GPIOs
Language: Python - Size: 189 KB - Last synced: 32 minutes ago - Pushed: 3 months ago - Stars: 863 - Forks: 251
NVIDIA/cuCollections
Language: C++ - Size: 5.24 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 419 - Forks: 73
NVIDIA/k8s-test-infra
K8s-test-infra
Language: Go - Size: 12.9 MB - Last synced: about 9 hours ago - Pushed: 2 days ago - Stars: 2 - Forks: 3
NVIDIA/mig-parted
MIG Partition Editor for NVIDIA GPUs
Language: Go - Size: 10.8 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 147 - Forks: 34
NVIDIA/edk2-redfish-client
NVIDIA fork of tianocore/edk2-redfish-client
Language: C - Size: 40.7 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1 - Forks: 0
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
Language: C - Size: 49.4 MB - Last synced: 2 days ago - Pushed: 5 days ago - Stars: 13,975 - Forks: 1,137
NVIDIA/MinkowskiEngine
Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors
Language: Python - Size: 14.6 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 2,309 - Forks: 337
NVIDIA/openbmc Fork of openbmc/openbmc
OpenBMC Distribution
Size: 151 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 1 - Forks: 0
NVIDIA/modulus
Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods
Language: Python - Size: 74.7 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 682 - Forks: 142
NVIDIA/NVFlare
NVIDIA Federated Learning Application Runtime Environment
Language: Python - Size: 38.1 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 544 - Forks: 145
NVIDIA/edk2
NVIDIA fork of tianocore/edk2
Language: C - Size: 291 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 17 - Forks: 13
NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Language: Python - Size: 22.1 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1,570 - Forks: 251
NVIDIA/gpu-driver-container
The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.
Language: Shell - Size: 5.64 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 44 - Forks: 24
NVIDIA/ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
Language: Go - Size: 6.73 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 60 - Forks: 19
NVIDIA/framework-reproducibility
Providing reproducibility in deep learning frameworks
Language: Python - Size: 1.19 MB - Last synced: 2 days ago - Pushed: 7 months ago - Stars: 419 - Forks: 38
NVIDIA/air_agent
A Python agent for receiving instructions from the NVIDIA Air platform
Language: Python - Size: 267 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1 - Forks: 3
NVIDIA/open-gpu-doc
Documentation of NVIDIA chip/hardware interfaces
Language: C - Size: 2.96 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1,207 - Forks: 89
NVIDIA/cuQuantum
Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples
Language: Jupyter Notebook - Size: 4.06 MB - Last synced: 4 days ago - Pushed: 7 days ago - Stars: 310 - Forks: 63
NVIDIA/DIGITS
Deep Learning GPU Training System
Language: HTML - Size: 48.8 MB - Last synced: 1 day ago - Pushed: 12 months ago - Stars: 4,114 - Forks: 1,378
NVIDIA/swift Fork of openstack/swift
OpenStack Storage (Swift). Mirror of code maintained at opendev.org.
Language: Python - Size: 70.8 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 8 - Forks: 4
NVIDIA/k8s-dra-driver
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
Language: Go - Size: 11.9 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 169 - Forks: 29
NVIDIA/dcgm-exporter
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
Language: Go - Size: 3.93 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 663 - Forks: 123
NVIDIA/DALI_deps
3rd party dependencies for DALI project
Language: Shell - Size: 288 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 10 - Forks: 21
NVIDIA/spark-rapids-tools
User tools for Spark RAPIDS
Language: Scala - Size: 12 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 40 - Forks: 33
NVIDIA/vgpu-device-manager
NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes
Language: Go - Size: 10.9 MB - Last synced: 13 days ago - Pushed: 17 days ago - Stars: 76 - Forks: 13
NVIDIA/edk2-edkrepo-manifest
NVIDIA fork of tianocore/edk2-edkrepo-manifest
Size: 37.1 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 5 - Forks: 4
NVIDIA/mlperf-common
NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions
Language: Shell - Size: 42 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 19 - Forks: 8
NVIDIA/k8s-driver-manager
The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.
Language: Shell - Size: 10.5 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 28 - Forks: 7
NVIDIA/k8s-cc-manager
The NVIDIA CC Manager is a Kubernetes component that will enable required CC mode on supported NVIDIA GPUs
Language: Shell - Size: 11 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 5 - Forks: 4
NVIDIA/flownet2-pytorch
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Language: Python - Size: 6.14 MB - Last synced: 5 days ago - Pushed: 12 months ago - Stars: 3,064 - Forks: 737
NVIDIA/kubectl-nv
Kubectl NV plugin, a tool for managing NVIDIA objects on a kubernetes cluster.
Language: Go - Size: 10.8 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 3 - Forks: 1
NVIDIA/enroot
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
Language: Shell - Size: 446 KB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 554 - Forks: 90
NVIDIA/edk2-nvidia
NVIDIA EDK2 platform support
Language: C - Size: 8.39 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 67 - Forks: 32
NVIDIA/edk2-nvidia-non-osi
NVIDIA EDK2 non-OSI licensed content
Language: BitBake - Size: 877 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 2 - Forks: 2
NVIDIA/spdm
Implementation of the SPDM protocol
Language: C++ - Size: 828 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/remote-media
Remotely mount images for the host through the BMC
Language: C++ - Size: 36.1 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/nvidia-tal
Telemetry abstraction layer
Language: C++ - Size: 38.1 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/nvidia-code-mgmt
Non-PLDM firmware update infrastructure
Language: C++ - Size: 213 KB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0
NVIDIA/libnvme
Implementation of the NVMe protocol
Language: C - Size: 6.28 MB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/nvidia-ipmi-oem
Implementation of Nvidia OEM IPMI commands
Language: C++ - Size: 327 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/cper-decoder
Converts CPERs to JSON
Language: C++ - Size: 216 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0
NVIDIA/nvbmc-docs
Documentation for Nvidia OpenBMC stack
Language: TeX - Size: 4.11 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0
NVIDIA/thrust 📦
[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl
Language: C++ - Size: 17 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 4,845 - Forks: 757
NVIDIA/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language: C++ - Size: 35 MB - Last synced: 5 days ago - Pushed: 7 days ago - Stars: 312 - Forks: 65
NVIDIA/waveglow
A Flow-based Generative Network for Speech Synthesis
Language: Python - Size: 427 KB - Last synced: 5 days ago - Pushed: 7 months ago - Stars: 2,222 - Forks: 527
NVIDIA/go-gpuallocator
Go Abstraction for Allocating NVIDIA GPUs with Custom Policies
Language: Go - Size: 748 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 95 - Forks: 21
NVIDIA/warp
A Python framework for high performance GPU simulation and graphics
Language: Python - Size: 36.4 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1,697 - Forks: 142
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language: C++ - Size: 41.7 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 4,610 - Forks: 800
NVIDIA/libcudacxx 📦
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
Language: C++ - Size: 11.9 MB - Last synced: about 15 hours ago - Pushed: 3 months ago - Stars: 2,288 - Forks: 188
NVIDIA/cub 📦
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
Language: Cuda - Size: 17.5 MB - Last synced: 6 days ago - Pushed: 7 months ago - Stars: 1,650 - Forks: 444
NVIDIA/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
Language: C++ - Size: 113 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 9,159 - Forks: 1,978
NVIDIA/nvidia-installer
NVIDIA driver installer
Language: C - Size: 2.03 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 124 - Forks: 27
NVIDIA/egl-wayland
The EGLStream-based Wayland external platform
Language: C - Size: 331 KB - Last synced: 5 days ago - Pushed: 25 days ago - Stars: 262 - Forks: 41
NVIDIA/go-nvml
Go Bindings for the NVIDIA Management Library (NVML)
Language: C - Size: 367 KB - Last synced: 13 days ago - Pushed: 18 days ago - Stars: 252 - Forks: 55
NVIDIA/jitify
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
Language: C++ - Size: 1010 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 500 - Forks: 64
NVIDIA/numbast
Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.
Language: Python - Size: 5.64 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 12 - Forks: 3
NVIDIA/gpu-operator
NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes
Language: Go - Size: 83.7 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1,178 - Forks: 227
NVIDIA/build-system-archive-import-examples
Examples for importing precompiled binary tarball and zip archives into various build and packaging systems
Language: Python - Size: 32.2 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 8 - Forks: 5
NVIDIA/pldm Fork of openbmc/pldm
Size: 3.73 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0
NVIDIA/kata-containers Fork of kata-containers/kata-containers
Kata containers is an implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs.
Size: 80.3 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 2 - Forks: 1
NVIDIA/go-nvlib
A collection of useful Go libraries for use with NVIDIA GPU management tools
Language: Go - Size: 1.12 MB - Last synced: 13 days ago - Pushed: 16 days ago - Stars: 17 - Forks: 9
NVIDIA/nvidia-docker 📦
Build and run Docker containers leveraging NVIDIA GPUs
Size: 18.9 MB - Last synced: 5 days ago - Pushed: 5 months ago - Stars: 17,093 - Forks: 2,031
NVIDIA/spark-rapids-jni
RAPIDS Accelerator JNI For Apache Spark
Language: Cuda - Size: 2.18 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 30 - Forks: 54
NVIDIA/spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Language: Scala - Size: 52.4 MB - Last synced: 9 days ago - Pushed: 11 days ago - Stars: 722 - Forks: 214
NVIDIA/linux Fork of openbmc/linux
OpenBMC Linux kernel source tree
Size: 4.82 GB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0
NVIDIA/webui-vue Fork of openbmc/webui-vue
Web-based user interface built on Vue.js for managing OpenBMC systems
Size: 5.42 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 1 - Forks: 0
NVIDIA/bios-settings-mgr Fork of openbmc/bios-settings-mgr
Size: 59.6 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 1 - Forks: 0
NVIDIA/cuda-python
CUDA Python Low-level Bindings
Language: Python - Size: 6.45 MB - Last synced: 5 days ago - Pushed: 2 months ago - Stars: 775 - Forks: 59
NVIDIA/Q2RTX
NVIDIA’s implementation of RTX ray-tracing in Quake II
Language: C - Size: 9.68 MB - Last synced: 8 days ago - Pushed: 12 days ago - Stars: 1,200 - Forks: 176
NVIDIA/nvbench
CUDA Kernel Benchmarking Library
Language: Cuda - Size: 1010 KB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 419 - Forks: 59
NVIDIA/RAD-MMM
A TTS model that makes a speaker speak new languages
Language: Roff - Size: 16.9 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 66 - Forks: 6
NVIDIA/NeMo-text-processing
NeMo text processing for ASR and TTS
Language: Python - Size: 24.5 MB - Last synced: about 19 hours ago - Pushed: about 20 hours ago - Stars: 217 - Forks: 67
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language: Python - Size: 241 MB - Last synced: 12 days ago - Pushed: 12 days ago - Stars: 10,121 - Forks: 2,154
NVIDIA/libnvidia-container
NVIDIA container runtime library
Language: C - Size: 2.21 GB - Last synced: 9 days ago - Pushed: 9 days ago - Stars: 726 - Forks: 178
NVIDIA/nv-cloud-function-helpers
Functions that simplify common tasks with NVIDIA Cloud Functions
Language: Python - Size: 9.35 MB - Last synced: 11 days ago - Pushed: 11 days ago - Stars: 6 - Forks: 2
NVIDIA/clara-viz
NVIDIA Clara Viz is a platform for visualization of 2D/3D medical imaging data
Language: C++ - Size: 62.2 MB - Last synced: 3 days ago - Pushed: about 1 month ago - Stars: 58 - Forks: 13
NVIDIA/nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
Language: Go - Size: 9.42 MB - Last synced: 13 days ago - Pushed: 16 days ago - Stars: 1,515 - Forks: 174
NVIDIA/k8s-device-plugin
NVIDIA device plugin for Kubernetes
Language: Go - Size: 61.8 MB - Last synced: 13 days ago - Pushed: 14 days ago - Stars: 2,416 - Forks: 566
NVIDIA/cloud-native-docs
Documentation repository for NVIDIA Cloud Native Technologies
Language: CSS - Size: 23.8 MB - Last synced: 13 days ago - Pushed: 13 days ago - Stars: 11 - Forks: 12
NVIDIA/CUDALibrarySamples
CUDA Library Samples
Language: Cuda - Size: 21 MB - Last synced: 11 days ago - Pushed: 12 days ago - Stars: 1,233 - Forks: 275