NVIDIA | GitHub owners | Ecosyste.ms: Repos

NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language: C++ - Size: 260 MB - Last synced: about 4 hours ago - Pushed: about 5 hours ago - Stars: 6,743 - Forks: 703

NVIDIA/gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

Language: C++ - Size: 696 KB - Last synced: about 8 hours ago - Pushed: about 8 hours ago - Stars: 781 - Forks: 139

NVIDIA/knavigator

knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.

Language: Go - Size: 343 KB - Last synced: about 9 hours ago - Pushed: about 10 hours ago - Stars: 22 - Forks: 5

NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Language: Python - Size: 8.13 MB - Last synced: about 14 hours ago - Pushed: about 15 hours ago - Stars: 8,771 - Forks: 1,963

NVIDIA/NeMo-Framework-Launcher

NeMo Megatron launcher and tools

Language: Python - Size: 27.3 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 397 - Forks: 115

NVIDIA/MatX

An efficient C++17 GPU numerical computing library with Python-like syntax

Language: C++ - Size: 32.8 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 1,116 - Forks: 73

NVIDIA/NeMo-Curator

Scalable toolkit for data curation

Language: Python - Size: 470 KB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 245 - Forks: 23

NVIDIA/workbench-example-hybrid-rag

An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)

Language: Python - Size: 22.7 MB - Last synced: about 15 hours ago - Pushed: about 16 hours ago - Stars: 40 - Forks: 98

NVIDIA/earth2studio

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.

Language: Python - Size: 103 MB - Last synced: about 16 hours ago - Pushed: about 17 hours ago - Stars: 15 - Forks: 4

NVIDIA/holodeck

Holodeck is a project to create test environments optimised for GPU projects.

Language: Go - Size: 14.5 MB - Last synced: about 2 hours ago - Pushed: about 17 hours ago - Stars: 5 - Forks: 3

NVIDIA/trt-llm-rag-windows

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Language: Python - Size: 33.4 MB - Last synced: about 6 hours ago - Pushed: about 7 hours ago - Stars: 2,356 - Forks: 252

NVIDIA/stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

Language: C++ - Size: 10.1 MB - Last synced: about 9 hours ago - Pushed: about 19 hours ago - Stars: 1,281 - Forks: 140

NVIDIA/JAX-Toolbox

JAX-Toolbox

Language: Python - Size: 4.01 MB - Last synced: about 6 hours ago - Pushed: about 6 hours ago - Stars: 179 - Forks: 34

NVIDIA/TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language: Python - Size: 4.29 MB - Last synced: about 19 hours ago - Pushed: about 20 hours ago - Stars: 1,462 - Forks: 227

NVIDIA/NV-Kernels

Ubuntu kernels which are optimized for NVIDIA server systems

Language: C - Size: 2.89 GB - Last synced: about 21 hours ago - Pushed: about 22 hours ago - Stars: 6 - Forks: 3

NVIDIA/mods-kernel-driver

Linux driver for diagnostic software

Language: C - Size: 407 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 15 - Forks: 3

NVIDIA/TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language: Python - Size: 11.8 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 136 - Forks: 6

NVIDIA/gontainer

Dependency Injection container for Golang projects.

Language: Go - Size: 647 KB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 26 - Forks: 2

NVIDIA/k8s-kata-manager

Language: Go - Size: 8.87 MB - Last synced: 1 day ago - Pushed: 1 day ago - Stars: 15 - Forks: 2

NVIDIA/cloudai

CloudAI Benchmark Framework

Language: Python - Size: 113 KB - Last synced: about 1 hour ago - Pushed: about 1 hour ago - Stars: 6 - Forks: 5

NVIDIA/jetson-gpio

A Python library that enables the use of Jetson's GPIOs

Language: Python - Size: 189 KB - Last synced: 32 minutes ago - Pushed: 3 months ago - Stars: 863 - Forks: 251

NVIDIA/cuCollections

Language: C++ - Size: 5.24 MB - Last synced: 1 day ago - Pushed: 2 days ago - Stars: 419 - Forks: 73

NVIDIA/k8s-test-infra

K8s-test-infra

Language: Go - Size: 12.9 MB - Last synced: about 9 hours ago - Pushed: 2 days ago - Stars: 2 - Forks: 3

NVIDIA/mig-parted

MIG Partition Editor for NVIDIA GPUs

Language: Go - Size: 10.8 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 147 - Forks: 34

NVIDIA/edk2-redfish-client

NVIDIA fork of tianocore/edk2-redfish-client

Language: C - Size: 40.7 MB - Last synced: 2 days ago - Pushed: 2 days ago - Stars: 1 - Forks: 0

NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language: C - Size: 49.4 MB - Last synced: 2 days ago - Pushed: 5 days ago - Stars: 13,975 - Forks: 1,137

NVIDIA/MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Language: Python - Size: 14.6 MB - Last synced: 1 day ago - Pushed: 2 months ago - Stars: 2,309 - Forks: 337

NVIDIA/openbmc Fork of openbmc/openbmc

OpenBMC Distribution

Size: 151 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 1 - Forks: 0

NVIDIA/modulus

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Language: Python - Size: 74.7 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 682 - Forks: 142

NVIDIA/NVFlare

NVIDIA Federated Learning Application Runtime Environment

Language: Python - Size: 38.1 MB - Last synced: 3 days ago - Pushed: 3 days ago - Stars: 544 - Forks: 145

NVIDIA/edk2

NVIDIA fork of tianocore/edk2

Language: C - Size: 291 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 17 - Forks: 13

NVIDIA/GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Language: Python - Size: 22.1 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1,570 - Forks: 251

NVIDIA/gpu-driver-container

The NVIDIA GPU driver container allows the provisioning of the NVIDIA driver through the use of containers.

Language: Shell - Size: 5.64 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 44 - Forks: 24

NVIDIA/ais-k8s

Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.

Language: Go - Size: 6.73 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 60 - Forks: 19

NVIDIA/framework-reproducibility

Providing reproducibility in deep learning frameworks

Language: Python - Size: 1.19 MB - Last synced: 2 days ago - Pushed: 7 months ago - Stars: 419 - Forks: 38

NVIDIA/air_agent

A Python agent for receiving instructions from the NVIDIA Air platform

Language: Python - Size: 267 KB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1 - Forks: 3

NVIDIA/open-gpu-doc

Documentation of NVIDIA chip/hardware interfaces

Language: C - Size: 2.96 MB - Last synced: 4 days ago - Pushed: 4 days ago - Stars: 1,207 - Forks: 89

NVIDIA/cuQuantum

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Language: Jupyter Notebook - Size: 4.06 MB - Last synced: 4 days ago - Pushed: 7 days ago - Stars: 310 - Forks: 63

NVIDIA/DIGITS

Deep Learning GPU Training System

Language: HTML - Size: 48.8 MB - Last synced: 1 day ago - Pushed: 12 months ago - Stars: 4,114 - Forks: 1,378

NVIDIA/swift Fork of openstack/swift

OpenStack Storage (Swift). Mirror of code maintained at opendev.org.

Language: Python - Size: 70.8 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 8 - Forks: 4

NVIDIA/k8s-dra-driver

Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes

Language: Go - Size: 11.9 MB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 169 - Forks: 29

NVIDIA/dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Language: Go - Size: 3.93 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 663 - Forks: 123

NVIDIA/DALI_deps

3rd party dependencies for DALI project

Language: Shell - Size: 288 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 10 - Forks: 21

NVIDIA/spark-rapids-tools

User tools for Spark RAPIDS

Language: Scala - Size: 12 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 40 - Forks: 33

NVIDIA/vgpu-device-manager

NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes

Language: Go - Size: 10.9 MB - Last synced: 13 days ago - Pushed: 17 days ago - Stars: 76 - Forks: 13

NVIDIA/edk2-edkrepo-manifest

NVIDIA fork of tianocore/edk2-edkrepo-manifest

Size: 37.1 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 5 - Forks: 4

NVIDIA/mlperf-common

NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions

Language: Shell - Size: 42 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 19 - Forks: 8

NVIDIA/k8s-driver-manager

The NVIDIA Driver Manager is a Kubernetes component which assist in seamless upgrades of NVIDIA Driver on each node of the cluster.

Language: Shell - Size: 10.5 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 28 - Forks: 7

NVIDIA/k8s-cc-manager

The NVIDIA CC Manager is a Kubernetes component that will enable required CC mode on supported NVIDIA GPUs

Language: Shell - Size: 11 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 5 - Forks: 4

NVIDIA/flownet2-pytorch

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Language: Python - Size: 6.14 MB - Last synced: 5 days ago - Pushed: 12 months ago - Stars: 3,064 - Forks: 737

NVIDIA/kubectl-nv

Kubectl NV plugin, a tool for managing NVIDIA objects on a kubernetes cluster.

Language: Go - Size: 10.8 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 3 - Forks: 1

NVIDIA/enroot

A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.

Language: Shell - Size: 446 KB - Last synced: 5 days ago - Pushed: 6 days ago - Stars: 554 - Forks: 90

NVIDIA/edk2-nvidia

NVIDIA EDK2 platform support

Language: C - Size: 8.39 MB - Last synced: 4 days ago - Pushed: 5 days ago - Stars: 67 - Forks: 32

NVIDIA/edk2-nvidia-non-osi

NVIDIA EDK2 non-OSI licensed content

Language: BitBake - Size: 877 KB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 2 - Forks: 2

NVIDIA/spdm

Implementation of the SPDM protocol

Language: C++ - Size: 828 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/remote-media

Remotely mount images for the host through the BMC

Language: C++ - Size: 36.1 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/nvidia-tal

Telemetry abstraction layer

Language: C++ - Size: 38.1 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/nvidia-code-mgmt

Non-PLDM firmware update infrastructure

Language: C++ - Size: 213 KB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0

NVIDIA/libnvme

Implementation of the NVMe protocol

Language: C - Size: 6.28 MB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/nvidia-ipmi-oem

Implementation of Nvidia OEM IPMI commands

Language: C++ - Size: 327 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/cper-decoder

Converts CPERs to JSON

Language: C++ - Size: 216 KB - Last synced: 6 days ago - Pushed: 11 days ago - Stars: 1 - Forks: 0

NVIDIA/nvbmc-docs

Documentation for Nvidia OpenBMC stack

Language: TeX - Size: 4.11 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0

NVIDIA/thrust 📦

[ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl

Language: C++ - Size: 17 MB - Last synced: about 1 month ago - Pushed: 3 months ago - Stars: 4,845 - Forks: 757

NVIDIA/cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Language: C++ - Size: 35 MB - Last synced: 5 days ago - Pushed: 7 days ago - Stars: 312 - Forks: 65

NVIDIA/waveglow

A Flow-based Generative Network for Speech Synthesis

Language: Python - Size: 427 KB - Last synced: 5 days ago - Pushed: 7 months ago - Stars: 2,222 - Forks: 527

NVIDIA/go-gpuallocator

Go Abstraction for Allocating NVIDIA GPUs with Custom Policies

Language: Go - Size: 748 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 95 - Forks: 21

NVIDIA/warp

A Python framework for high performance GPU simulation and graphics

Language: Python - Size: 36.4 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 1,697 - Forks: 142

NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

Language: C++ - Size: 41.7 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 4,610 - Forks: 800

NVIDIA/libcudacxx 📦

[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl

Language: C++ - Size: 11.9 MB - Last synced: about 15 hours ago - Pushed: 3 months ago - Stars: 2,288 - Forks: 188

NVIDIA/cub 📦

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Language: Cuda - Size: 17.5 MB - Last synced: 6 days ago - Pushed: 7 months ago - Stars: 1,650 - Forks: 444

NVIDIA/TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language: C++ - Size: 113 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 9,159 - Forks: 1,978

NVIDIA/nvidia-installer

NVIDIA driver installer

Language: C - Size: 2.03 MB - Last synced: 5 days ago - Pushed: 5 days ago - Stars: 124 - Forks: 27

NVIDIA/egl-wayland

The EGLStream-based Wayland external platform

Language: C - Size: 331 KB - Last synced: 5 days ago - Pushed: 25 days ago - Stars: 262 - Forks: 41

NVIDIA/go-nvml

Go Bindings for the NVIDIA Management Library (NVML)

Language: C - Size: 367 KB - Last synced: 13 days ago - Pushed: 18 days ago - Stars: 252 - Forks: 55

NVIDIA/jitify

A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).

Language: C++ - Size: 1010 KB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 500 - Forks: 64

NVIDIA/numbast

Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.

Language: Python - Size: 5.64 MB - Last synced: 6 days ago - Pushed: 6 days ago - Stars: 12 - Forks: 3

NVIDIA/gpu-operator

NVIDIA GPU Operator creates/configures/manages GPUs atop Kubernetes

Language: Go - Size: 83.7 MB - Last synced: 6 days ago - Pushed: 7 days ago - Stars: 1,178 - Forks: 227

NVIDIA/build-system-archive-import-examples

Examples for importing precompiled binary tarball and zip archives into various build and packaging systems

Language: Python - Size: 32.2 KB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 8 - Forks: 5

NVIDIA/pldm Fork of openbmc/pldm

Size: 3.73 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0

NVIDIA/kata-containers Fork of kata-containers/kata-containers

Kata containers is an implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workload isolation and security advantages of VMs.

Size: 80.3 MB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 2 - Forks: 1

NVIDIA/go-nvlib

A collection of useful Go libraries for use with NVIDIA GPU management tools

Language: Go - Size: 1.12 MB - Last synced: 13 days ago - Pushed: 16 days ago - Stars: 17 - Forks: 9

NVIDIA/nvidia-docker 📦

Build and run Docker containers leveraging NVIDIA GPUs

Size: 18.9 MB - Last synced: 5 days ago - Pushed: 5 months ago - Stars: 17,093 - Forks: 2,031

NVIDIA/spark-rapids-jni

RAPIDS Accelerator JNI For Apache Spark

Language: Cuda - Size: 2.18 MB - Last synced: 9 days ago - Pushed: 10 days ago - Stars: 30 - Forks: 54

NVIDIA/spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Language: Scala - Size: 52.4 MB - Last synced: 9 days ago - Pushed: 11 days ago - Stars: 722 - Forks: 214

NVIDIA/linux Fork of openbmc/linux

OpenBMC Linux kernel source tree

Size: 4.82 GB - Last synced: 7 days ago - Pushed: 7 days ago - Stars: 1 - Forks: 0

NVIDIA/webui-vue Fork of openbmc/webui-vue

Web-based user interface built on Vue.js for managing OpenBMC systems

Size: 5.42 MB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 1 - Forks: 0

NVIDIA/bios-settings-mgr Fork of openbmc/bios-settings-mgr

Size: 59.6 KB - Last synced: 8 days ago - Pushed: 8 days ago - Stars: 1 - Forks: 0

GitHub / NVIDIA 342 repositories