GitHub topics: gpu-profiler
RightNow-AI/gpu-profiler
Open-source web-based GPU performance visualization tool that transforms NVIDIA profiling data into interactive insights for CUDA engineers. Features timeline views, flame graphs, heatmaps, and AI-powered bottleneck detection.
Language: TypeScript - Size: 272 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 1

ROCm/omnitrace
Omnitrace: Application Profiling, Tracing, and Analysis
Language: C++ - Size: 6.14 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 321 - Forks: 28

MolSSI-Education/gpu_programming_beginner
Fundamentals of heterogeneous parallel programming with CUDA C/C++ at the beginner level.
Language: Python - Size: 5.25 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 2

Siddhant-K-code/LLMTraceFX
GPU-level LLM inference profiler that analyzes token-level performance and provides AI-powered explanations.
Language: Python - Size: 224 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

dehydratedpotato/socpowerbud
Sudoless alternative to powermetrics for Apple Silicon; realtime CPU & GPU frequency, volts, usage, etc.
Language: Objective-C - Size: 343 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 11

vinjn/GpuProf
Realtime GPU Profiler for AMD / NVIDIA / Intel GPUs
Language: JavaScript - Size: 7.4 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 32 - Forks: 4

Lin-Mao/DrGPUM
A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.
Language: Python - Size: 248 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 25 - Forks: 3

TimvanScherpenzeel/profiling-research
Research on advanced profiling of high-performance web applications (primarily WebGL applications).
Language: Shell - Size: 3.72 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 42 - Forks: 3

vilbeyli/VQEngine-Legacy
DirectX 11 Renderer written in C++11
Language: C++ - Size: 822 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 282 - Forks: 37

vimontgames/optick Fork of bombomby/optick
C++ Profiler For Games
Language: C# - Size: 77.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

romslf/system-companion
Multi platform electron app for getting system details and processes manager
Language: Vue - Size: 5.47 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 3

hinofafa/torch_accelerator
Experiments to accelerate GPU device for PyTorch training
Language: Jupyter Notebook - Size: 27.3 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0
