GitHub / Gawdanzo / Build-a-LLM-model-from-scratch
🚀 Build a complete LLM model from scratch with an easy-to-follow, end-to-end pipeline for data processing, training, and fine-tuning.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Gawdanzo%2FBuild-a-LLM-model-from-scratch
PURL: pkg:github/Gawdanzo/Build-a-LLM-model-from-scratch
Stars: 0
Forks: 0
Open issues: 0
License: other
Language: Jupyter Notebook
Size: 420 KB
Dependencies parsed at: Pending
Created at: 16 days ago
Updated at: 16 days ago
Pushed at: 16 days ago
Last synced at: 16 days ago
Topics: attention-mechanism, bytes, causal-attention, ddp, gpt, instruction-tuning, llm, lora, next-token-prediction, peft, perplexity, pytorch, transformer