GitHub / harleyszhang / llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harleyszhang%2Fllm_note
Stars: 722
Forks: 73
Open issues: 0
License: None
Language: Python
Size: 176 MB
Dependencies parsed at: Pending
Created at: 7 months ago
Updated at: 5 days ago
Pushed at: 5 days ago
Last synced at: 5 days ago
Topics: cuda-programming, kv-cache, llm, llm-inference, transformer-models, triton-kernels, vllm