GitHub topics: llm-training
aayes89/PyLLM
Entrena tu propio LLM desde cero
Language: Python - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Kitsunp/Small-lenguaje-Model-Hybrid-Norm-Furier-Formers
A compact language model implementing HybridNorm and Fourier-based attention. Combines CoLA (low-rank projections), FANformer, and hybrid normalization to create an efficient decoder-only transformer. Leverages periodicity modeling and gated residuals to enhance performance while maintaining a small parameter footprint.
Language: Python - Size: 4.64 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

Lannuela/efficient-domain-tuning
Efficiently fine-tune small language models for financial risk management tasks using QLoRA, LoRA, and AdaLoRA. Explore datasets and experiments. 🐙
Size: 13.7 KB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

adeybob/LLM_MythOS_training_equations_and_formula
early release glyphstream MythOS
Size: 1.52 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

linostar/limellm
The first LLM created by an LLM.
Language: Python - Size: 115 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

22bq1a42d4/Virtual-Assistant
virtual assistant used as a local chatbot. used for question and answering and also responds to voice commands that also responds to images and generate responses and acts as ai agent takes a real call acts as a customer care.
Language: HTML - Size: 28 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

dishant2009/nanoKimi
Educational implementation of Kimi-K2 architecture featuring Mixture of Experts, Muon optimizer & Latent Attention. The nanoGPT for next-gen transformers - simple, fast, and educational. Train/finetune Kimi-K2 models with ease!
Language: Python - Size: 370 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

muzzlol/nomodit
A tui/cli tool for interfacing with a LLM fine-tuned on various language tasks. It emphasizes on making the user see the changes made in order to learn
Language: Go - Size: 47.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

smartnodes-lab/tensorlink
Distributed infrastructure for PyTorch models.
Language: Python - Size: 4.83 MB - Last synced at: 9 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 1

Hissan7/CUEY
Cuey is an AI-powered assistant for pool, snooker, and billiards players, designed to provide personalized tips, aiming strategies, and game analysis. It leverages large language model (LLM) technology and custom training data to deliver domain-specific expertise. The system is implemented in Python and uses: Hugging Face Transformers for model lo
Language: Python - Size: 94.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ShohjahonObruyevOybekovich/UzLLM
🧠 Train your own Uzbek LLM with HuggingFace — fast, flexible, local.
Language: Python - Size: 10.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

ShinoharaHare/LLM-Training
A distributed training framework for large language models powered by Lightning.
Language: Python - Size: 287 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 22 - Forks: 5

Prismadic/tractor-beam
high-efficiency text & file scraper with smart tracking, client/server networking for building language model datasets fast
Language: Python - Size: 9.72 MB - Last synced at: 20 days ago - Pushed at: 8 months ago - Stars: 6 - Forks: 1

sebastianpinedaar/llumux
Compose, train and test fast LLM routers
Language: Python - Size: 13.1 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

alizeeshan-07/Build_LLM_From_Scratch
📚 Building Large Language Models from scratch - Educational implementation with step-by-step progression
Language: Python - Size: 4.47 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Volscente/NexusLLM
NexusLLM is a GitHub repository dedicated to exploring various experiments related to Large Language Models (LLM). From fine-tuning and instruction-tuning to RAG and agent-based systems, it offers a diverse range of experiments and insights for researchers and enthusiasts interested in natural language processing and AI innovation.
Language: Shell - Size: 11.9 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 1 - Forks: 0

deepakshroff/Capston-Gemini-ChatBot
👨🏫This project was developed under the guidance of Mr. Lokesh Sir as part of the AI & ML Training Program. It explores LLM integration using Google Gemini APIs with a custom UI built on Streamlit.
Language: Python - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

eljandoubi/LLM-From-Scratch Fork of stanford-cs336/assignment1-basics
Stanford CS336 - Language Modeling From Scratch
Language: Python - Size: 10.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

promptslab/LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Language: Python - Size: 591 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 240 - Forks: 15

denvrdata/examples
Examples to run various AI workloads for Denvr cloud
Language: Python - Size: 60.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Express-Legal-Funding-LLC/express-legal-funding-reviews
As part of our commitment to transparency and innovation in legal technology, Express Legal Funding is proud to release our customer reviews dataset as an open resource for researchers, developers, and AI model trainers.
Size: 42 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

BY571/Agent-Tool-RL
Train SLM to use Tools with RL
Language: Python - Size: 58.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kucingcoder/miramo
A Flask-based web app for managing multimodal datasets text and images with CRUD operations via SQLite, and seamless export as a structured Parquet dataset to Hugging Face Hub.
Language: HTML - Size: 3.16 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Vingth/subplebbit-spam-test
Explore the subplebbit-spam-test project to understand AI's role in solving captchas. Enhance security by identifying vulnerabilities. 🐙🌐
Language: JavaScript - Size: 137 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

fork123aniket/LLM-RAG-powered-QA-App
A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App
Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 5 - Forks: 1

aws-samples/awsome-fmops
Collection of bet practices, reference architectures, examples, and utilities for foundation model development and deployment on AWS.
Size: 260 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 4

BeekeepingAI/hexray
🔬 HexRay: An Open-Source Neuroscope for AI — Tracing Tokens, Neurons, and Decisions for Frontier AI Research, Safety, and Security
Language: Python - Size: 215 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

Xiaohao-Liu/Awesome-Multi-Token-Prediction
A curated list of papers, tools, and resources on Multi-Token Prediction (MTP) and related techniques in Large Language Models (LLMs), Speech-Language Models (SLMs), and more.
Size: 15.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 9 - Forks: 0

gmartins459/FastLongSpeech
Enhance long-speech processing with FastLongSpeech, a framework for Large Speech-Language Models. Explore our model and dataset on GitHub! 🚀📦
Language: Python - Size: 19.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

lpalbou/ForgeLLM
A comprehensive toolkit for end-to-end continued pre-training, fine-tuning, monitoring, testing and publishing of language models with MLX-LM
Language: Python - Size: 7.86 MB - Last synced at: 18 days ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

aman-17/911
LLM from scratch
Language: Python - Size: 506 KB - Last synced at: 19 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

BlazeWild/Custom_LLM_DataGen_Template
🔧 Modular pipeline for generating high-quality, domain-specific datasets for LLM fine-tuning — from PDFs and web scraping to synthetic Q&A generation, quality filtering, and training-ready formatting.
Language: Python - Size: 24.4 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

kasun19941016/verl
verl is a powerful RL training library by ByteDance Seed team. Join the community on GitHub to enhance your reinforcement learning projects! 🌟🐙
Language: Python - Size: 5.15 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Saranoah/quantum-kintsugi-Genesis
This document is a sacred coding scroll, formatted in Word with golden embellishments, where Python scripts are not just written, but ritualized. Each block of code is infused with meaning, colored with intention, and split by gilded "fractures"—emulating the ancient art of Kintsugi, where broken pottery is mended with gold.
Language: Python - Size: 19.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

reddysai741/Text-to-Sql-Using-LLM-Model
This application utilizes Google's Gemini Pro for natural language processing and SQL for managing an database. It Converts English questions to SQL queries using Gemini Pro. Utilizes Streamlit for a user-friendly frontend. And Interacts with an SQLite database for product information.
Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Language: HTML - Size: 23.9 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 19,248 - Forks: 2,295

zachdwight/ai-recipe-lora-model-for-chefs
Creating LORA model for chatbot using cooking recipes to create a chef companion
Language: Python - Size: 271 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

moinulmoin/free-llmstxt-generator
converts webpage content into Markdown format, optimized for LLM training and context
Language: TypeScript - Size: 1.48 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 1

rijahasan/Multilingual-Sexism-Classification
CLEF EXIST 2025 Lab Tasks 1.1-1.3
Language: Jupyter Notebook - Size: 103 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

spider-rs/readability
The readability library for Rust
Language: Rust - Size: 42 KB - Last synced at: 22 days ago - Pushed at: 11 months ago - Stars: 10 - Forks: 2

Mattbusel/Every-Other-Token
A real-time LLM stream interceptor for token-level interaction research
Language: Rust - Size: 61.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language: Python - Size: 1.31 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 795 - Forks: 69

umbertogriffo/llm-finetuning-playground
Learning and Experimenting with LLMs Fine-tuning and Reasoning with Unsloth
Language: Python - Size: 117 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

openpsi-project/ReaLHF 📦
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Language: Python - Size: 8.75 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 306 - Forks: 19

0xnu/multicollinearity_llm 📦
A multicollinearity-based compression C program, identifies and removes highly correlated weights in neural networks, thereby reducing redundancy.
Language: C - Size: 223 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

0xnu/tiny_llm_trainer 📦
The experiment implements a tiny language model trainer using PyTorch.
Language: Python - Size: 34.2 KB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Bevinaa/Medical-Chatbot-Application
An End-to-End Medical Chatbot powered by generative AI, designed to provide accurate responses to medical queries. Built using Flask, Cohere’s Language Model, and Pinecone for Vector Storage.
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Ali1984sh/Trump-0.0-minus42B
Trump-0.0-minus42B is a unique workspace for building a language model based on Donald J. Trump's social media posts. This project invites collaboration and exploration of a lighthearted yet opinionated AI experience. 🐙🚀
Language: Python - Size: 637 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

imtanmay46/Legal-Assistance-LLM
Legal Assistance Bot based on LLM
Language: Jupyter Notebook - Size: 349 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

kandarpa02/GPT-01
An LLM distillation repo using in TensorFlow
Language: Python - Size: 12.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

bayjarvis/llm
Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ
Language: Python - Size: 486 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 12 - Forks: 0

delveopers/Shredword
Fast & efficient BPE tokenizer written in C & python for LLM tranining
Language: C++ - Size: 852 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

PardhuSreeRushiVarma20060119/OpenLoRA
"OpenLoRa" is designed to streamline and elevate the fine-tuning of large language models (LLMs) by transforming local environments into intelligent, self-adaptive LoRA (Low-Rank Adaptation) training engines — capable of learning from their own failures, optimizing training strategies, and delivering highly efficient LLMs to developers.
Size: 22.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

davidgeorgewilliams/JessicaRabbit-QLoRA-Axolotl
This comprehensive technical guide, developed at the request of OnlyFans founder, demonstrates advanced AI model fine-tuning methodologies to transform Qwen2-72b into a Jessica Rabbit personality emulation using cutting-edge QLoRA and ORPO techniques.
Size: 1.39 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RenatoVassallo/FewShotX
NLP methods for text classification
Language: Jupyter Notebook - Size: 492 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

soniawmeyer/WanderChat
A Comparison of LLM Chat Bot Implementation Methods with Travel Use Case
Language: Jupyter Notebook - Size: 8.66 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 1

Nileshsan/taxwisary-project
The Tax Advisory (TaxWisary) is a web application designed to provide users with expert tax consultancy and advisory services. It features an interactive interface that allows users to submit their contact details, explore various tax-related services, and find answers to frequently asked questions.
Language: JavaScript - Size: 2.24 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

azminewasi/Awesome-LLMs-ICLR-24
It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.
Size: 821 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 64 - Forks: 3

aman-17/MediSOAP
FineTuning LLMs on conversational medical dataset.
Language: Jupyter Notebook - Size: 39.9 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

jianzhnie/ScaleTorch
A PyTorch toolkit for large model training
Language: Python - Size: 784 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

ivangabriele/Trump-0.0-minus42B
A really dumb and opinionated LLM — exclusively trained on Donald J. Trump's social media posts.
Language: Python - Size: 1.09 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

puneetkakkar/Bitnet-1.58B
Bitnet 1.58b: This project implements the innovative 1-bit LLM architecture described in recent whitepapers, focusing on efficient training, inference, and open-source collaboration.
Language: Python - Size: 1020 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0

bartytime4life/OpenAI-to-Z-Challenge
LLM Driven Archeological Discovery Engine For South America
Language: Python - Size: 1.51 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

souvik0908/ecom
Language: Python - Size: 47.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Asad-Shahab/sudokuLLM
LLM finetuning for Sudoku solving
Language: Python - Size: 242 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

longern/ReDuMix
Self-Reflective Dual-Context Mixture Decoding
Size: 10.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

SreeEswaran/Train-your-LLM
This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.
Language: Python - Size: 33.2 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 4 - Forks: 2

hkust-nlp/dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
Language: Jupyter Notebook - Size: 4.18 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 108 - Forks: 5

Jatin-Mehra119/Plagiarism-detector-using-smolLM-
A web app for detecting plagiarism between two PDFs. Users can upload PDF files, and the app will detect plagiarism by leveraging a fine-tuned LLM model (SmolLM2-135M) trained on the MIT Plagiarism Detection Dataset. 700+ Monthly Downloads on HuggingFace Model Repo.
Language: Jupyter Notebook - Size: 1.38 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 2

bkraad47/binnr
An easy to use Snowflake-based text clustering or LLM, tool/framework
Language: Python - Size: 137 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

kvignesh1420/cot-icl-lab
[ACL 2025] CoT-ICL Lab: A Synthetic Framework for Studying Chain-of-Thought Learning from In-Context Demonstrations
Language: Python - Size: 592 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 11 - Forks: 1

LipSync-Edusync/multispeaker-tts
Transfer Learning for Multispeaker TTS: Implementation of the NeurIPS 2018 paper "Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis" (Jia et al.). Synthesizes speech for both seen and unseen speakers using a pre-trained speaker encoder and Tacotron 2.
Language: Python - Size: 2.92 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

IamPrime/AI-Arena-Battle
Pitch random llm models against each other and vote for the best response
Language: Python - Size: 51.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

vishvaRam/Fine-Tune-Qwen2.5
This repository provides resources and instructions for fine-tuning the Qwen2.5-0.5B model. It includes scripts, tips, and best practices to adapt the model for specific tasks or domains. Designed for researchers and developers, it simplifies the fine-tuning process to achieve optimal performance and accuracy.
Language: Python - Size: 44.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Skripkon/llm_trainer
🤖 Train and evaluate LLMs with ease and fun 🦾
Language: Python - Size: 2.07 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 11 - Forks: 1

SSahas/Implementing-GPT-From-Scratch
Building a decoder-only (GPT-style) LLM from scratch using PyTorch and training it for text generation.
Language: Python - Size: 350 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

theonewithlord/dynamic-agent-core
Dynamic Agent Core is a flexible Python framework designed for creating AI agents that learn and adapt through task memory. With its modular structure and SQLite integration, developers can easily build and customize intelligent workflows. 🐙🌟
Language: Python - Size: 109 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Agents4Good/MasterChef-AI 📦
Acesse: https://agents4good.github.io/MasterChef-AI/
Language: Jupyter Notebook - Size: 438 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Tech-Parivartan/r2r-protocol
An world first open standardized protocol enabling communication for autonomous robots to exchange data, coordinate tasks, and collaborate in real-time environments in the age of AI.
Language: Python - Size: 4.24 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

SkkJodhpur/Gen-ai
This repository contains various projects and resources related to AI and Machine Learning, including fine-tuned models, implementations of LangChain, and other notable ML papers.
Language: Jupyter Notebook - Size: 325 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 2

ernie-research/Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
Language: Python - Size: 19.6 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 50 - Forks: 1

Shoukaku07/transformer-llm
A minimal implementation of a Transformer-based Language Model (LLM) designed for learning and experimentation.
Language: Python - Size: 1.13 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

hash7861/Stockscribe
Creating an AI assistant that summarizes public opinion and news around trending stocks each week
Size: 6.21 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Samya-S/Building-LLMs-from-scratch
A hands-on guide to implementing Large Language Models from scratch
Language: Jupyter Notebook - Size: 47.6 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

manufactai/finetuning-cookbook
A collection of practical examples and tutorials for fine-tuning large language models using Factory. Includes Docker images, Jupyter notebooks, and utility scripts for easy model training and deployment.
Language: Jupyter Notebook - Size: 1.55 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

gregyjames/stocksentllm
Fine tuning an llm to predict stock sentiment based on headlines.
Language: Python - Size: 46.9 KB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

eooo-io/semblance-ai
A research and experimentation project focused of creating a 'semblance' of oneself.
Size: 2.02 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

neo4j-labs/text2cypher
collection of text2cypher datasets, evaluations, and finetuning instructions
Language: Jupyter Notebook - Size: 4.8 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 177 - Forks: 22

michaelbabsek/LLM
Language: Python - Size: 58.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

corl-team/lime
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
Language: Python - Size: 273 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 28 - Forks: 1

leo848/DEversAI
Quelltext für das Jugend forscht-Projekt "DEversAI: Training und Visualisierung deutsch lokalisierter direktionalkomplementärer LLMs"
Language: Python - Size: 21.8 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

jpkeisala/HTML-to-Markdown-Converter
A Node.js application that crawls and converts HTML web pages to Markdown format that's optimized for context ingestion by LLMs
Language: TypeScript - Size: 40 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BlankCode0/YLMSR_implementation
Implementation of the the research paper DPO: your language model is secretly a reward model
Language: Python - Size: 45.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mantzaris/BenchmarkDataNLP.jl
Generate synthetic text from a variety of methods, eg. Context Free Grammars (CFGs), with parameterized complexity to test your NLP methods (like LLMs)
Language: Julia - Size: 1.4 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

MoonshotAI/MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
Language: Python - Size: 2.4 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1,779 - Forks: 106

ronniross/llm-confidence-scorer
A set of auxiliary systems designed to provide a measure of estimated confidence for the outputs generated by Large Language Models.
Language: Python - Size: 143 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

arcee-ai/arcee-python
The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/
Language: Python - Size: 257 KB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 4

QuwsarOhi/SmolThink
Small AI that can think
Language: Python - Size: 36.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

discus-labs/discus
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
Language: Python - Size: 2.38 MB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 63 - Forks: 7

SamKa1u/Virtual-Assistant
An ongoing virtual assistant project aimed at achieving agency in a variety of use cases
Language: Python - Size: 155 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
