An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gpu-memory

NVIDIA/gdrcopy

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

Language: C++ - Size: 811 KB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 1,242 - Forks: 174

eyalroz/cuda-api-wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

Language: C++ - Size: 2.88 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 860 - Forks: 84

joe0731/hf_vram_calc

A CLI tool for estimating GPU VRAM requirements for Hugging Face models, supporting various data types, parallelization strategies, and fine-tuning scenarios like LoRA.

Language: Python - Size: 232 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 2

Lin-Mao/DrGPUM

A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.

Language: Python - Size: 248 KB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 26 - Forks: 3

obisin/dgls Fork of tdrussell/diffusion-pipe

Dynamic GPU Layer Swapping: Train large models on consumer GPUs with intelligent memory management

Language: Python - Size: 7.83 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

0-u-0/nvidia-gpu-monitor

Language: C++ - Size: 18.6 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

LiyuanLucasLiu/Torch-Scope

A Toolkit for Training, Tracking, Saving Models and Syncing Results

Language: Python - Size: 111 KB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 62 - Forks: 6

JonSnow1807/gradient-cache

GPU memory-efficient training for PyTorch - 90%+ memory savings through gradient compression

Language: Python - Size: 27.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

parasj/checkmate

Training neural networks in TensorFlow 2.0 with 5x less memory

Language: Python - Size: 560 KB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 130 - Forks: 15

Alex188dot/GPU-VRAM-Calculator

A simple tool to find out GPU VRAM requirements for running LLMs

Language: HTML - Size: 7.81 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

jonlamb-gh/rpi3-rust-fel4-workspace

Rust embedded things running on the seL4 microkernel for the Raspberry Pi 3

Language: Rust - Size: 134 KB - Last synced at: 7 months ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 2

sina-masnadi/nvidia-mg

📊 A command line monitoring tool (graph) for NVIDIA GPUs

Language: Python - Size: 72.3 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

pvjosue/OpenCV-Spout

OpenCV & Spout C++ library. Shared GPU memory and processing at reach.

Language: C++ - Size: 1.89 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 2

ImperialStranger/Python-GPU_benchmark

Python-GPU_benchmark is a module that provides all informations of your Graphics Card

Language: Python - Size: 28.3 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

mis-wut/feathergpu

Language: C++ - Size: 1.69 MB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 13 - Forks: 0

hongshibao/kubernetes Fork of kubernetes/kubernetes

A fork of Kubernetes with support of schedulable resource of NVIDIA GPU memory

Language: Go - Size: 554 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

Fangyh09/gpustatus

A tiny, useful command-line tool to show each user gpu usage, pid under each gpu, provide more details than nvidia-smi/gpustat

Language: Shell - Size: 8.79 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 0

eklitzke/tf-slice

Demonstration of generating mini-batches in Tensorlfow from GPU memory.

Language: Python - Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 3 - Forks: 1