An open API service providing repository metadata for many open source software ecosystems.

GitHub / Liu-xiandong / How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Liu-xiandong%2FHow_to_optimize_in_GPU

Stars: 1,035
Forks: 154
Open issues: 7

License: apache-2.0
Language: Cuda
Size: 1.25 MB
Dependencies parsed at: Pending

Created at: over 3 years ago
Updated at: 4 days ago
Pushed at: almost 2 years ago
Last synced at: 4 days ago

Topics: elementwise, gpu-acceleration, high-performance-computing, hpc, reduce, sgemm, sgemv

    Loading...