An open API service providing repository metadata for many open source software ecosystems.

GitHub / Awrsha / Advanced-CUDA-Programming-GPU-Architecture

This repository provides a comprehensive guide to optimizing GPU kernels for performance, with a focus on NVIDIA GPUs. It covers key tools and techniques such as CUDA, PyTorch, and Triton, aimed at improving computational efficiency for deep learning and scientific computing tasks.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Awrsha%2FAdvanced-CUDA-Programming-GPU-Architecture
PURL: pkg:github/Awrsha/Advanced-CUDA-Programming-GPU-Architecture

Stars: 1
Forks: 0
Open issues: 0

License: None
Language: Cuda
Size: 25.2 MB
Dependencies parsed at: Pending

Created at: 8 months ago
Updated at: 4 months ago
Pushed at: 8 months ago
Last synced at: 3 months ago

Topics: cuda-programming, gpu-programming, jit, kernels, matmul, mojo-language, multiprocessing, multithreading, torchquantum, triton

    Loading...