GitHub topics: back2back-gemm
Bruce-Lee-LY/cuda_back2back_hgemm
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
Language: Cuda - Size: 854 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2
