GitHub / mikeendarson / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
Stars: 0
Forks: 0
Open issues: 0
License: mit
Language: Jupyter Notebook
Size: 16.6 MB
Dependencies parsed at: Pending
Created at: about 2 months ago
Updated at: about 2 months ago
Pushed at: about 2 months ago
Last synced at: about 2 months ago
Topics: attention, attention-mechanism, gpt, inference, kv-cache, language-model, llama, llm-configuration, llms, mask, positional-encoding, rms, rope, rotary-position-encoding