GitHub / lucidrains / native-sparse-attention-pytorch
Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper
Stars: 601
Forks: 30
Open issues: 7
License: mit
Language: Python
Size: 34.6 MB
Dependencies parsed at: Pending
Created at: 2 months ago
Updated at: 9 days ago
Pushed at: about 1 month ago
Last synced at: 8 days ago
Topics: artificial-intelligence, attention, deep-learning, sparse-attention
Loading...