GitHub / Kaminyou / Flash-Attention-Practice
An minimal CUDA implementation of FlashAttention v1 and v2
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Kaminyou%2FFlash-Attention-Practice
PURL: pkg:github/Kaminyou/Flash-Attention-Practice
Stars: 0
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 19.5 KB
Dependencies parsed at: Pending
Created at: 2 months ago
Updated at: about 2 months ago
Pushed at: about 2 months ago
Last synced at: about 2 months ago
Topics: cuda-programming, deep-learning, flashattention