GitHub topics: llm-training

Repositories

aayes89/PyLLM

Entrena tu propio LLM desde cero

Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers

A compact language model implementing HybridNorm and Fourier-based attention. Combines CoLA (low-rank projections), FANformer, and hybrid normalization to create an efficient decoder-only transformer. Leverages periodicity modeling and gated residuals to enhance performance while maintaining a small parameter footprint.

Language: Python - Size: 4.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

Lannuela/efficient-domain-tuning

Efficiently fine-tune small language models for financial risk management tasks using QLoRA, LoRA, and AdaLoRA. Explore datasets and experiments. 🐙

Size: 13.7 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

adeybob/LLM_MythOS_training_equations_and_formula

early release glyphstream MythOS

Size: 1.52 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

linostar/limellm

The first LLM created by an LLM.

Language: Python - Size: 115 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

22bq1a42d4/Virtual-Assistant

virtual assistant used as a local chatbot. used for question and answering and also responds to voice commands that also responds to images and generate responses and acts as ai agent takes a real call acts as a customer care.

Language: HTML - Size: 28 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

dishant2009/nanoKimi

Educational implementation of Kimi-K2 architecture featuring Mixture of Experts, Muon optimizer & Latent Attention. The nanoGPT for next-gen transformers - simple, fast, and educational. Train/finetune Kimi-K2 models with ease!

Language: Python - Size: 370 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

muzzlol/nomodit

A tui/cli tool for interfacing with a LLM fine-tuned on various language tasks. It emphasizes on making the user see the changes made in order to learn

Language: Go - Size: 47.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

smartnodes-lab/tensorlink

Distributed infrastructure for PyTorch models.

Language: Python - Size: 4.83 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 1

Hissan7/CUEY

Cuey is an AI-powered assistant for pool, snooker, and billiards players, designed to provide personalized tips, aiming strategies, and game analysis. It leverages large language model (LLM) technology and custom training data to deliver domain-specific expertise. The system is implemented in Python and uses: Hugging Face Transformers for model lo

Language: Python - Size: 94.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ShohjahonObruyevOybekovich/UzLLM

🧠 Train your own Uzbek LLM with HuggingFace — fast, flexible, local.

Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ShinoharaHare/LLM-Training

A distributed training framework for large language models powered by Lightning.

Language: Python - Size: 287 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 5

Prismadic/tractor-beam

high-efficiency text & file scraper with smart tracking, client/server networking for building language model datasets fast

Language: Python - Size: 9.72 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1

sebastianpinedaar/llumux

Compose, train and test fast LLM routers

Language: Python - Size: 13.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alizeeshan-07/Build_LLM_From_Scratch

📚 Building Large Language Models from scratch - Educational implementation with step-by-step progression

Language: Python - Size: 4.47 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Volscente/NexusLLM

NexusLLM is a GitHub repository dedicated to exploring various experiments related to Large Language Models (LLM). From fine-tuning and instruction-tuning to RAG and agent-based systems, it offers a diverse range of experiments and insights for researchers and enthusiasts interested in natural language processing and AI innovation.

Language: Shell - Size: 11.9 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 1 - Forks: 0

deepakshroff/Capston-Gemini-ChatBot

👨‍🏫This project was developed under the guidance of Mr. Lokesh Sir as part of the AI & ML Training Program. It explores LLM integration using Google Gemini APIs with a custom UI built on Streamlit.

Language: Python - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

eljandoubi/LLM-From-Scratch Fork of stanford-cs336/assignment1-basics

Stanford CS336 - Language Modeling From Scratch

Language: Python - Size: 10.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

promptslab/LLMtuner

FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)

Language: Python - Size: 591 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 240 - Forks: 15

denvrdata/examples

Examples to run various AI workloads for Denvr cloud

Language: Python - Size: 60.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Express-Legal-Funding-LLC/express-legal-funding-reviews

As part of our commitment to transparency and innovation in legal technology, Express Legal Funding is proud to release our customer reviews dataset as an open resource for researchers, developers, and AI model trainers.

Size: 42 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

BY571/Agent-Tool-RL

Train SLM to use Tools with RL

Language: Python - Size: 58.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kucingcoder/miramo

A Flask-based web app for managing multimodal datasets text and images with CRUD operations via SQLite, and seamless export as a structured Parquet dataset to Hugging Face Hub.

Language: HTML - Size: 3.16 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Vingth/subplebbit-spam-test

Explore the subplebbit-spam-test project to understand AI's role in solving captchas. Enhance security by identifying vulnerabilities. 🐙🌐

Language: JavaScript - Size: 137 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

fork123aniket/LLM-RAG-powered-QA-App

A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App

Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

aws-samples/awsome-fmops

Collection of bet practices, reference architectures, examples, and utilities for foundation model development and deployment on AWS.

Size: 260 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 4

BeekeepingAI/hexray

🔬 HexRay: An Open-Source Neuroscope for AI — Tracing Tokens, Neurons, and Decisions for Frontier AI Research, Safety, and Security

Language: Python - Size: 215 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Xiaohao-Liu/Awesome-Multi-Token-Prediction

A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.

Size: 15.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

gmartins459/FastLongSpeech

Enhance long-speech processing with FastLongSpeech, a framework for Large Speech-Language Models. Explore our model and dataset on GitHub! 🚀📦

Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

lpalbou/ForgeLLM

A comprehensive toolkit for end-to-end continued pre-training, fine-tuning, monitoring, testing and publishing of language models with MLX-LM

Language: Python - Size: 7.86 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

aman-17/911

LLM from scratch

Language: Python - Size: 506 KB - Last synced at: 19 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

BlazeWild/Custom_LLM_DataGen_Template

🔧 Modular pipeline for generating high-quality, domain-specific datasets for LLM fine-tuning — from PDFs and web scraping to synthetic Q&A generation, quality filtering, and training-ready formatting.

Language: Python - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kasun19941016/verl

verl is a powerful RL training library by ByteDance Seed team. Join the community on GitHub to enhance your reinforcement learning projects! 🌟🐙

Language: Python - Size: 5.15 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Saranoah/quantum-kintsugi-Genesis

This document is a sacred coding scroll, formatted in Word with golden embellishments, where Python scripts are not just written, but ritualized. Each block of code is infused with meaning, colored with intention, and split by gilded "fractures"—emulating the ancient art of Kintsugi, where broken pottery is mended with gold.

Language: Python - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

reddysai741/Text-to-Sql-Using-LLM-Model

This application utilizes Google's Gemini Pro for natural language processing and SQL for managing an database. It Converts English questions to SQL queries using Gemini Pro. Utilizes Streamlit for a user-friendly frontend. And Interacts with an SQLite database for product information.

Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

liguodongiot/llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

Language: HTML - Size: 23.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 19,248 - Forks: 2,295

zachdwight/ai-recipe-lora-model-for-chefs

Creating LORA model for chatbot using cooking recipes to create a chef companion

Language: Python - Size: 271 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

moinulmoin/free-llmstxt-generator

converts webpage content into Markdown format, optimized for LLM training and context

Language: TypeScript - Size: 1.48 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 1

rijahasan/Multilingual-Sexism-Classification

CLEF EXIST 2025 Lab Tasks 1.1-1.3

Language: Jupyter Notebook - Size: 103 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spider-rs/readability

The readability library for Rust

Language: Rust - Size: 42 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 10 - Forks: 2

Mattbusel/Every-Other-Token

A real-time LLM stream interceptor for token-level interaction research

Language: Rust - Size: 61.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

sail-sg/Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language: Python - Size: 1.31 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 795 - Forks: 69

umbertogriffo/llm-finetuning-playground

Learning and Experimenting with LLMs Fine-tuning and Reasoning with Unsloth

Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

openpsi-project/ReaLHF 📦

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

Language: Python - Size: 8.75 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 306 - Forks: 19

0xnu/multicollinearity_llm 📦

A multicollinearity-based compression C program, identifies and removes highly correlated weights in neural networks, thereby reducing redundancy.

Language: C - Size: 223 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

0xnu/tiny_llm_trainer 📦

The experiment implements a tiny language model trainer using PyTorch.

Language: Python - Size: 34.2 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Bevinaa/Medical-Chatbot-Application

An End-to-End Medical Chatbot powered by generative AI, designed to provide accurate responses to medical queries. Built using Flask, Cohere’s Language Model, and Pinecone for Vector Storage.

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Ali1984sh/Trump-0.0-minus42B

Trump-0.0-minus42B is a unique workspace for building a language model based on Donald J. Trump's social media posts. This project invites collaboration and exploration of a lighthearted yet opinionated AI experience. 🐙🚀

Language: Python - Size: 637 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

imtanmay46/Legal-Assistance-LLM

Legal Assistance Bot based on LLM

Language: Jupyter Notebook - Size: 349 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

kandarpa02/GPT-01

An LLM distillation repo using in TensorFlow

Language: Python - Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

bayjarvis/llm

Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ

Language: Python - Size: 486 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 12 - Forks: 0

delveopers/Shredword

Fast & efficient BPE tokenizer written in C & python for LLM tranining

Language: C++ - Size: 852 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

PardhuSreeRushiVarma20060119/OpenLoRA

"OpenLoRa" is designed to streamline and elevate the fine-tuning of large language models (LLMs) by transforming local environments into intelligent, self-adaptive LoRA (Low-Rank Adaptation) training engines — capable of learning from their own failures, optimizing training strategies, and delivering highly efficient LLMs to developers.

Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

davidgeorgewilliams/JessicaRabbit-QLoRA-Axolotl

This comprehensive technical guide, developed at the request of OnlyFans founder, demonstrates advanced AI model fine-tuning methodologies to transform Qwen2-72b into a Jessica Rabbit personality emulation using cutting-edge QLoRA and ORPO techniques.

Size: 1.39 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RenatoVassallo/FewShotX

NLP methods for text classification

Language: Jupyter Notebook - Size: 492 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

soniawmeyer/WanderChat

A Comparison of LLM Chat Bot Implementation Methods with Travel Use Case

Language: Jupyter Notebook - Size: 8.66 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

Nileshsan/taxwisary-project

The Tax Advisory (TaxWisary) is a web application designed to provide users with expert tax consultancy and advisory services. It features an interactive interface that allows users to submit their contact details, explore various tax-related services, and find answers to frequently asked questions.

Language: JavaScript - Size: 2.24 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

azminewasi/Awesome-LLMs-ICLR-24

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

Size: 821 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 3

aman-17/MediSOAP

FineTuning LLMs on conversational medical dataset.

Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

jianzhnie/ScaleTorch

A PyTorch toolkit for large model training

Language: Python - Size: 784 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ivangabriele/Trump-0.0-minus42B

A really dumb and opinionated LLM — exclusively trained on Donald J. Trump's social media posts.

Language: Python - Size: 1.09 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

puneetkakkar/Bitnet-1.58B

Bitnet 1.58b: This project implements the innovative 1-bit LLM architecture described in recent whitepapers, focusing on efficient training, inference, and open-source collaboration.

Language: Python - Size: 1020 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

bartytime4life/OpenAI-to-Z-Challenge

LLM Driven Archeological Discovery Engine For South America

Language: Python - Size: 1.51 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

souvik0908/ecom

Language: Python - Size: 47.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Asad-Shahab/sudokuLLM

LLM finetuning for Sudoku solving

Language: Python - Size: 242 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

longern/ReDuMix

Self-Reflective Dual-Context Mixture Decoding

Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SreeEswaran/Train-your-LLM

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

Language: Python - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

hkust-nlp/dart-math

[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*

Language: Jupyter Notebook - Size: 4.18 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 108 - Forks: 5

Jatin-Mehra119/Plagiarism-detector-using-smolLM-

A web app for detecting plagiarism between two PDFs. Users can upload PDF files, and the app will detect plagiarism by leveraging a fine-tuned LLM model (SmolLM2-135M) trained on the MIT Plagiarism Detection Dataset. 700+ Monthly Downloads on HuggingFace Model Repo.

Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 2

bkraad47/binnr

An easy to use Snowflake-based text clustering or LLM, tool/framework

Language: Python - Size: 137 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

kvignesh1420/cot-icl-lab

[ACL 2025] CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations

Language: Python - Size: 592 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 1

LipSync-Edusync/multispeaker-tts

Transfer Learning for Multispeaker TTS: Implementation of the NeurIPS 2018 paper "Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis" (Jia et al.). Synthesizes speech for both seen and unseen speakers using a pre-trained speaker encoder and Tacotron 2.

Language: Python - Size: 2.92 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

IamPrime/AI-Arena-Battle

Pitch random llm models against each other and vote for the best response

Language: Python - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

vishvaRam/Fine-Tune-Qwen2.5

This repository provides resources and instructions for fine-tuning the Qwen2.5-0.5B model. It includes scripts, tips, and best practices to adapt the model for specific tasks or domains. Designed for researchers and developers, it simplifies the fine-tuning process to achieve optimal performance and accuracy.

Language: Python - Size: 44.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Skripkon/llm_trainer

🤖 Train and evaluate LLMs with ease and fun 🦾

Language: Python - Size: 2.07 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 11 - Forks: 1

SSahas/Implementing-GPT-From-Scratch

Building a decoder-only (GPT-style) LLM from scratch using PyTorch and training it for text generation.

Language: Python - Size: 350 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

theonewithlord/dynamic-agent-core

Dynamic Agent Core is a flexible Python framework designed for creating AI agents that learn and adapt through task memory. With its modular structure and SQLite integration, developers can easily build and customize intelligent workflows. 🐙🌟

Language: Python - Size: 109 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0