GitHub topics: small-language-models
bahree/GenAIBook
"Generative AI in Action" book's code repository
Language: Python - Size: 208 MB - Last synced at: about 5 hours ago - Pushed at: 8 months ago - Stars: 96 - Forks: 51

cmccomb/yo
AI on your machine, intelligence in your terminal
Language: Shell - Size: 318 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

kev-nat/VeriFlow
Autonomous agents for supply chain event managements
Language: Jupyter Notebook - Size: 450 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

webis-de/small-text
Active Learning for Text Classification in Python
Language: Python - Size: 3.08 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 619 - Forks: 70

TheQuantScientist/TextToSQL
SLMs for domain-related Text-to-SQL tasks
Language: Python - Size: 49.8 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

FairyFali/SLMs-Survey
Survey of Small Language Models from Penn State, ...
Size: 2.65 MB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 185 - Forks: 15

d1pankarmedhi/smallLM
🧱 A small, GPT like Language Model
Language: Python - Size: 166 KB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

SmallDoges/small-doge
Doge Family of Small Language Model
Language: Python - Size: 12.8 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 152 - Forks: 13

OpenVanguard/remma-o1
Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch from scratch. Work in Progress!!! Open for collaboration.
Language: Python - Size: 8.29 MB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 34 - Forks: 1

MaxLSB/LeCarnet
LeCarnet is a 2 M+ corpus of simple French stories, featuring end‑to‑end data generation, evaluation and training pipelines for small language models
Language: Python - Size: 6.71 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 7 - Forks: 0

Mehrdadghassabi/Gaokerena-V
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Language: Jupyter Notebook - Size: 187 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

shubham0204/SmolChat-Android
Running any GGUF SLMs/LLMs locally, on-device in Android
Language: Kotlin - Size: 24.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 385 - Forks: 49

d0tTino/DeepThought-ReThought
A refactored version of the DeepThought Discord bot, focusing on improved architecture, performance, and AI agent capabilities.
Language: Python - Size: 19.8 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 1

virtualramblas/gromacs_smolagent
An HF's Smolagent to automate molecular dynamics simulations using GROMACS.
Language: Python - Size: 148 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

ChaitanyaK77/Building-a-Small-Language-Model-SLM-
This Repository provides a Jupyter Notebook for building a small language model from scratch using 'TinyStories' dataset. Covers data preprocessing, BPE tokenization, binary storage, GPU memory management, and training a Transformer in PyTorch. Generate sample stories to test your model. Ideal for learning NLP and PyTorch.
Language: Jupyter Notebook - Size: 5.13 MB - Last synced at: 15 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 1

adamelliotfields/gradio-2b-chat
Chat with small language models under 2b
Language: Python - Size: 21.5 KB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

nicolay-r/distill-d2n-long Fork of Xiaoxiao-Liu/distill-d2n
Rationale-based Distillation fine-tuning framwork for AutoModelCasualLM for TextSummarization fine-tuning
Language: Python - Size: 3.23 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

db-agent/db-agent
SQL AI Agent - Talk to your DB in Natural Language
Language: Python - Size: 6.38 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 9 - Forks: 14

nuni-neomu-areumdawo/Diffusion-Language-Model
Implementation of a LLaDA-inspired Masked Diffusion Model for Text using PURE BYTE-LEVEL TOKENIZATION (cuz why not) and Mixed Precision Training for speed.
Language: Python - Size: 85.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

ethicalabs-ai/kurtis
Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small Language Models), such as Huggingface's SmolLM2.
Language: Python - Size: 11.8 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 1

bicycleman15/skim
[KDD 2025] Code for the paper "On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme Classification"
Language: Python - Size: 43.9 KB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

ethicalabs-ai/Kurtis-E1-MLX-Voice-Agent
A lightweight voice companion, optimized for macOS.
Language: Python - Size: 197 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 7 - Forks: 1

lennart-finke/simple_stories_generate Fork of doomdagadiggiedahdah/SimpleStories
Dataset Generation Code for SimpleStories
Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 1

bunyaminergen/TAS
The TAS (Teacher-Assistant-Student) repository contains code demonstrating how "knowledge" acquired by a large-scale, open or closed-source language model (LLM) can be transferred to a relatively smaller student model through an intermediate assistant model.
Language: Python - Size: 123 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

nitya/model-mondays Fork of microsoft/model-mondays
Model Mondays is a weekly livestreamed series on Microsoft Reactor that helps you make informed model choice decisions with timely updates and model deep-dives. Watch live for the content. Join Discord for the discussions.
Language: Jupyter Notebook - Size: 6.73 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Nitin-Sagar-B/RaTiO-CoRE
A modular multi-model AI framework demonstrating advanced techniques in semantic knowledge transfer, context management, and collaborative intelligence across diverse language models.
Language: Python - Size: 136 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

eyewheel/imgrep
full text search your meme/screenshot folder with small LMs x traditional OCR
Language: Python - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

UEFI-code/miniGPT
An open-source project to show how to build a mini language model using PyTorch
Language: Python - Size: 15.3 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

imane0x/PerfectFit
PerfectFit is an AI-powered shopping assistant that uses multimodal search to quickly find ideal product matches based on text or image inputs, streamlining the online shopping experience.
Language: JavaScript - Size: 12.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Beekee-learn/ai-for-education
Learning By Doing projects as part of AI-for-Education.org
Language: Jupyter Notebook - Size: 38.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

Phihao2k3/NanoSage
Local LLM Powered Recursive Search & Smart Knowledge Explorer
Size: 1000 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Ne0bliviscaris/Ollama-SQLite-RAG
Ollama RAG using SQL Database
Language: Python - Size: 1.86 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 9 - Forks: 0

LoserCheems/WonderfulMatrices
Wonderful Matrices to Build Small Language Models
Language: Python - Size: 8.78 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 43 - Forks: 0

ProtoFaze/chatbot
Language: Python - Size: 230 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

lgalke/easy2deeplearn
Code for the paper "Deep neural networks and humans both benefit from compositional structure"
Language: Python - Size: 277 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 0

sitamgithub-MSIT/readerlm-litserve
Leverage Reader-LM's capabilities using LitServe.
Language: Python - Size: 213 KB - Last synced at: about 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

SkywardAI/shibuya
A project built Electron + React.js, to dig out the potential of cross platform AI completion.
Language: JavaScript - Size: 1.67 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 1

rb81/convollama
A simple Python application that facilitates AI-driven conversations using Ollama.
Language: Python - Size: 4.09 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Vibhuarvind/Content-Engine-RAG-for-PDF
Content Engine is RAG system that analyzes and compares multiple PDF documents, specifically identifying and highlighting their differences. The system will utilize Retrieval Augmented Generation (RAG) techniques to effectively retrieve, assess, and generate insights from the documents.
Language: Jupyter Notebook - Size: 3.92 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

quamernasim/Conversational-AI-System-using-Phi-2-PGVector-and-Llama-Index
Build a Conversational AI System that can answer questions by retrieving the answers from a document.
Language: Jupyter Notebook - Size: 1.48 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0
