GitHub topics: back2back-gemm

Repositories

Bruce-Lee-LY/cuda_back2back_hgemm

Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.

Language: Cuda - Size: 854 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 2

Related Keywords

back2back-gemm 1 back2back-hgemm 1 cublas 1 cuda 1 fused-gemm 1 fused-hgemm 1 gemm 1 gpu 1 hgemm 1 matrix-multiply 1 nvidia 1 tensor-core 1