GitHub / Liu-xiandong / How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Liu-xiandong%2FHow_to_optimize_in_GPU
Stars: 1,035
Forks: 154
Open issues: 7
License: apache-2.0
Language: Cuda
Size: 1.25 MB
Dependencies parsed at: Pending
Created at: over 3 years ago
Updated at: 4 days ago
Pushed at: almost 2 years ago
Last synced at: 4 days ago
Topics: elementwise, gpu-acceleration, high-performance-computing, hpc, reduce, sgemm, sgemv