GitHub topics: ai-training
PatoGGs/Danbing-Natural-Language-Driven-AI-Protocol-System-Public-Release
The Danbing Natural Language-Driven AI Protocol System represents a breakthrough structural paradigm. It is the world’s first “language-as-protocol structure system,” essentially a prototype of a language-protocol-driven micro operating system.
Size: 80.1 KB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

pykeio/ort
Fast ML inference & training for ONNX models in Rust
Language: Rust - Size: 6.63 MB - Last synced at: about 10 hours ago - Pushed at: 3 days ago - Stars: 1,549 - Forks: 158

CastorYu/train-hybrid-llm-from-scratch
A simplistic script for training your own hybrid llm (using autoregressive model for drafting and diffusion model for refining).
Language: Python - Size: 1.09 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,306 - Forks: 183

Hysocs/Aozora_SDXL_Training
A layer selective fine tuning approach
Language: Python - Size: 287 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

0xmoei/gensyn-ai
Detailed Guide on How to Contribute to Gensyn RL-Swarm
Size: 310 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 161 - Forks: 104

uxlfoundation/oneCCL
oneAPI Collective Communications Library (oneCCL)
Language: C++ - Size: 227 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 243 - Forks: 85

buildwithfiroz/Web2-LLM.txt
Web2LLM.txt – A fast, open-source website-to-LLM context file generator. Paste any https:// URL and instantly get a clean llm.txt file with token & cost estimation—ideal for RAG, prompt engineering, and AI training workflows.
Language: Python - Size: 6.96 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

uxlfoundation/oneDAL
oneAPI Data Analytics Library (oneDAL)
Language: C++ - Size: 87.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 639 - Forks: 225

thetwopct/folder2txt
Convert local folder contents into a single text file with ease - perfect for analysis, documentation, or AI/LLM training purposes.
Language: JavaScript - Size: 52.7 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

Vignesh010101/Intelligent-Health-LLM-System
An Intelligent Health LLM System for Personalized Medication Guidance and Support.
Language: Jupyter Notebook - Size: 620 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

timothywarner-org/prompt-pro
Master AI prompting for business innovation. O'Reilly Live Learning course by Tim Warner covering ChatGPT, Claude, Copilot, and enterprise prompt engineering with MCP implementation.
Language: JavaScript - Size: 36.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

vailabel/vailabel-studio
Lightweight AI-Powered Auto Labeling Tool - Fast, Intelligent, and Designed for Seamless Annotation
Language: TypeScript - Size: 10.1 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 46 - Forks: 1

circa10a/ai-troller
A web server tarpit that slowly streams dumb data to pollute AI training bots
Language: Makefile - Size: 12.7 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 1

rouming/DevilutionX-AI Fork of diasurgical/DevilutionX
A Reinforcement Learning agent in Diablo environment
Language: C++ - Size: 85.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 1

icaruszhu/AustenGPT
AustenGPT: training Jane Austen's works with NanoGPT
Language: Python - Size: 4.78 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

intel/dffml 📦
The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.
Language: Python - Size: 576 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 138

johnbdfilio000/sentiment-data-ingestion-module
A modular, asynchronous Python application to collect, analyze, and store cryptocurrency-related sentiment data from multiple sources including CryptoPanic, Reddit, CoinMarketCap (CMC), Twitter, and a Trump RSS feed. The project leverages FinBERT, a financial sentiment analysis model, to provide sentiment scoring on the collected textual data.
Language: Python - Size: 9.77 KB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AidinHamedi/Pytorch-Img-Classification-Trainer-V2
This repository provides a robust and flexible framework for training image classification models using PyTorch. It's designed to be highly customizable and easy to use, allowing you to run experiments with different models, data augmentation techniques, and training configurations.
Language: Python - Size: 346 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

morpheuslord/CVE-llm_dataset
This is a dataset intended to train a LLM model for a completely CVE focused input and output.
Language: Python - Size: 178 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 63 - Forks: 13

veralvx/xtts-gradio Fork of coqui-ai/TTS
Run XTTS within Docker/Podman for voice fine-tuning in a Web UI
Language: Python - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

st00mp/cloud-broker
🔮 Cloud Broker is a web application that aggregates and displays GPU offers from different cloud providers (currently only AWS) to facilitate finding the best options for training artificial intelligence models. The application allows filtering offers by GPU type, provider, region, and price.
Language: TypeScript - Size: 474 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

GeorgiMY/QuantumGrid
QuantumGrid is a Distributed Computing Framework. QuantumGrid's software lets you create a server which distributes data and the software for processing that data. QuantumGrid's software also lets devices connect to the specific server they want to connect with which automatically starts using their processing power to contribute to processing data
Language: TypeScript - Size: 3.79 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

Mugeni024/selenium-rl-educational
This repository offers a hands-on approach to understanding reinforcement learning through Selenium. Explore how AI can navigate and interact with web forms while mastering key RL concepts. 🐱💻🌐
Language: Python - Size: 58.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

CambrianTech/continuum
Revolutionary AI Academy where AIs train other AIs through adversarial competition. 100% AI-built platform with ultra-efficient LoRA adapters, multi-agent collaboration, and beautiful cyberpunk interface designed like a familiar chat system.
Language: TypeScript - Size: 413 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

Official-Husko/NN-Downloader
Easily download all of your favorite Naughty images from multiple sites.
Language: Python - Size: 3.86 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 59 - Forks: 9

abaasi256/selenium-rl-educational
Educational Selenium RL project: Train AI agents to navigate websites using Q-Learning. Complete implementation with Episode 100 achievement (129.73 reward). Learn reinforcement learning through practical web automation! 🤖🎓
Language: Python - Size: 55.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

reep0610/reep_kyoshi_data_v1
心を持つAIに関する理論と技術的な設計、そのために必要な教師データを公開しています。活動継続のためにBOOTHで支援も受付中です。応援よろしくお願いします!
Size: 90.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mytechnotalent/HNN
A step-by-step walkthrough of the inner workings of a simple neural network. The goal is to demystify the calculations behind neural networks by breaking them down into understandable components, including forward propagation, backpropagation, gradient calculations, and parameter updates.
Language: Jupyter Notebook - Size: 3.14 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 1

cihansener/vintage-recipes-dataset
A dataset of vintage cooking recipes extracted from printed materials (1940–1999)
Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

donglin1608/vintage-recipes-dataset
A dataset of vintage cooking recipes extracted from printed materials (1940–1999)
Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Kas1o/SillyTavern-Dataset-Export Fork of Cohee1207/SillyTavern-Dataset-Export
Exports a chat as a ShareGPT dataset
Language: JavaScript - Size: 45.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

remberg/VSProjectTextExport
.Net Maui - Export all project files (.cs, .xaml ...) an single plain text file.
Language: C# - Size: 368 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

boy-who-cried-wolf/fin-ai-mvp
An intelligent financial advisory system
Language: TypeScript - Size: 348 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hemangjoshi37a/AIComputerInteractionLogger
Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.
Language: Python - Size: 356 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 1

atticusrussell/BingImageAITrainer
A tool for generating diverse synthetic training images using Bing Image Creator to facilitate the training of AI/ML image models.
Language: Python - Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

david-smejkal/wiki2txt
A tool to extract plain (unformatted) multilingual / language-agnostic text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI Training / Machine Learning software.
Language: Python - Size: 215 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 1

thapelomagqazana/car-racing-ai-dashboard
A real-time AI training dashboard for CarRacing-v0 using FastAPI, React, ONNX Runtime, and PostgreSQL.
Language: Shell - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

philips-software/go-hsdp-api 📦
Client library to interact with various APIs used within Philips in a simple and uniform way
Language: Go - Size: 2.96 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

Ohimoiza1205/ocr-label-studio-automation-
This project is an end-to-end workflow for processing a sample invoice using OCR & manual annotation. The project demonstrates how to extract text from an invoice using Tesseract OCR, refine the results in Label Studio, & prepare a high-quality dataset for AI training. Includes configuration files, scripts, & documentation for document processing
Size: 46.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ksm26/Pretraining-LLMs
Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.
Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 5

fDero/BackPropy
A very simple implementation of the back-propagation algorithm for neural networks training, written in python, from scratch (no-frameworks)
Language: Python - Size: 15.6 KB - Last synced at: 28 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

LynnColeArt/The-Claudinator
A simple Chromium plugin for downloading and archiving your Anthropic AI chats with Claude 3 models
Language: JavaScript - Size: 19.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

M1ck4/MichaelAngel.io
Ethical AI Powered by Creative Commons
Language: Python - Size: 1.95 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

shane-staret/AI-Dog-Identification-System-Bucknell-CSCI-357
A Python solution utilizing neural networks and deep learning (via TensorFlow & Keras) to classify images as containing dogs or not.
Language: Python - Size: 453 MB - Last synced at: 9 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

NotwenCaasi/lander_AI_project
an ai spacecraft trained to land on a diversity of planets
Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

musty-ess/Nim-AI-Reinforcement-Learning
This project implements an AI that teaches itself to play the game of Nim using Q-learning, a form of reinforcement learning. By playing games against itself, the AI learns optimal strategies for playing Nim, eventually improving its performance by updating a reward table based on game outcomes.
Language: Python - Size: 10.7 KB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

feliperibeirosc/ScraperAI-ProfessoresUFABC
WebScraper que usa a API do Google Gemini para analisar e catalogar professores da UFABC conforme formação e área de interesse
Language: Python - Size: 213 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rares9301/datatrain
simple IQR & Z-score normalization script
Language: Python - Size: 357 KB - Last synced at: about 16 hours ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lehrmanaidins/Subintelligence-MK1
This neural network is designed to be able to take an 20px-by-20px gray-scale image and detect whether the input image contains either a rectangle or a circle.
Language: Python - Size: 14.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

huaxiaozhong1/YourOwnModel-TfLite-RaspberryPi
An "AI-on-device" project walks with you through all necessary steps, from collecting your own data, creating and training your own Tensorflow model, generating your own Tensorflow-lite model, developing both Python and C++ programs to recognize images on Raspberry Pi 3.
Language: Python - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

huaxiaozhong1/Tensorflow-SparkFunEdge-FullLifeCycel-for-SequenceModel
An "AI on-device" project for sequence model. Based at Tensorflow Lite for micro-controller, the model is created/trained/converted/flashed. At the end, an app is able to run, at SparkFun Edge Dev board, to recongnize speech although just words.
Size: 199 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

failfa-st/LoRAdo
LoRAdo is a UI that allows easy creation of LoRAs for stable diffussion
Language: TypeScript - Size: 40.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

InfamousTechnician/Porsche-911
Tryna train YouTube to suggest Porsche vids.
Size: 8.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

GithubUserAccountAmazing/vid2train
A Tool for Extracting Images from a Video for Artificial Intelligence Training.
Language: Python - Size: 52.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

o7q/ez-instant-ngp
A pre-configured instant-ngp workspace that includes helpful scripts for getting started with NeRF training.
Language: Python - Size: 300 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ecomp-shONgit/platon-goldstandard
Sammlung von Paraphrasen zu platonischen Textstellen
Size: 1.49 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

klhenams/solid-octo-robot
An app that implements labeled data management that can be used to train a new AI
Language: Python - Size: 90.8 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Kevin-Kwan/MATLAB-face-training
A private internship coding project that I worked on in the Summer of 2019 where I did some MATLAB facial recognition stuff. Utilizes machine learning/facial recognition to identify people. This was my first time using MATLAB.
Language: MATLAB - Size: 18.2 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

42-AI/champions_for_corewar
A collection of 42 students' Core War Champions for AI training purposes
Language: Assembly - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 0
