An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai-training

PatoGGs/Danbing-Natural-Language-Driven-AI-Protocol-System-Public-Release

The Danbing Natural Language-Driven AI Protocol System represents a breakthrough structural paradigm. It is the world’s first “language-as-protocol structure system,” essentially a prototype of a language-protocol-driven micro operating system.

Size: 80.1 KB - Last synced at: about 21 hours ago - Pushed at: about 22 hours ago - Stars: 0 - Forks: 0

pykeio/ort

Fast ML inference & training for ONNX models in Rust

Language: Rust - Size: 6.63 MB - Last synced at: about 10 hours ago - Pushed at: 3 days ago - Stars: 1,549 - Forks: 158

CastorYu/train-hybrid-llm-from-scratch

A simplistic script for training your own hybrid llm (using autoregressive model for drafting and diffusion model for refining).

Language: Python - Size: 1.09 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

uxlfoundation/scikit-learn-intelex

Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application

Language: Python - Size: 41.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,306 - Forks: 183

Hysocs/Aozora_SDXL_Training

A layer selective fine tuning approach

Language: Python - Size: 287 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 6 - Forks: 0

0xmoei/gensyn-ai

Detailed Guide on How to Contribute to Gensyn RL-Swarm

Size: 310 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 161 - Forks: 104

uxlfoundation/oneCCL

oneAPI Collective Communications Library (oneCCL)

Language: C++ - Size: 227 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 243 - Forks: 85

buildwithfiroz/Web2-LLM.txt

Web2LLM.txt – A fast, open-source website-to-LLM context file generator. Paste any https:// URL and instantly get a clean llm.txt file with token & cost estimation—ideal for RAG, prompt engineering, and AI training workflows.

Language: Python - Size: 6.96 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

uxlfoundation/oneDAL

oneAPI Data Analytics Library (oneDAL)

Language: C++ - Size: 87.9 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 639 - Forks: 225

thetwopct/folder2txt

Convert local folder contents into a single text file with ease - perfect for analysis, documentation, or AI/LLM training purposes.

Language: JavaScript - Size: 52.7 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

Vignesh010101/Intelligent-Health-LLM-System

An Intelligent Health LLM System for Personalized Medication Guidance and Support.

Language: Jupyter Notebook - Size: 620 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

timothywarner-org/prompt-pro

Master AI prompting for business innovation. O'Reilly Live Learning course by Tim Warner covering ChatGPT, Claude, Copilot, and enterprise prompt engineering with MCP implementation.

Language: JavaScript - Size: 36.5 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

vailabel/vailabel-studio

Lightweight AI-Powered Auto Labeling Tool - Fast, Intelligent, and Designed for Seamless Annotation

Language: TypeScript - Size: 10.1 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 46 - Forks: 1

circa10a/ai-troller

A web server tarpit that slowly streams dumb data to pollute AI training bots

Language: Makefile - Size: 12.7 KB - Last synced at: 8 days ago - Pushed at: 6 months ago - Stars: 23 - Forks: 1

rouming/DevilutionX-AI Fork of diasurgical/DevilutionX

A Reinforcement Learning agent in Diablo environment

Language: C++ - Size: 85.3 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 5 - Forks: 1

icaruszhu/AustenGPT

AustenGPT: training Jane Austen's works with NanoGPT

Language: Python - Size: 4.78 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1 - Forks: 0

intel/dffml 📦

The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

Language: Python - Size: 576 MB - Last synced at: 22 days ago - Pushed at: about 1 year ago - Stars: 255 - Forks: 138

johnbdfilio000/sentiment-data-ingestion-module

A modular, asynchronous Python application to collect, analyze, and store cryptocurrency-related sentiment data from multiple sources including CryptoPanic, Reddit, CoinMarketCap (CMC), Twitter, and a Trump RSS feed. The project leverages FinBERT, a financial sentiment analysis model, to provide sentiment scoring on the collected textual data.

Language: Python - Size: 9.77 KB - Last synced at: 17 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

AidinHamedi/Pytorch-Img-Classification-Trainer-V2

This repository provides a robust and flexible framework for training image classification models using PyTorch. It's designed to be highly customizable and easy to use, allowing you to run experiments with different models, data augmentation techniques, and training configurations.

Language: Python - Size: 346 KB - Last synced at: 22 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

morpheuslord/CVE-llm_dataset

This is a dataset intended to train a LLM model for a completely CVE focused input and output.

Language: Python - Size: 178 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 63 - Forks: 13

veralvx/xtts-gradio Fork of coqui-ai/TTS

Run XTTS within Docker/Podman for voice fine-tuning in a Web UI

Language: Python - Size: 133 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 0

st00mp/cloud-broker

🔮 Cloud Broker is a web application that aggregates and displays GPU offers from different cloud providers (currently only AWS) to facilitate finding the best options for training artificial intelligence models. The application allows filtering offers by GPU type, provider, region, and price.

Language: TypeScript - Size: 474 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

GeorgiMY/QuantumGrid

QuantumGrid is a Distributed Computing Framework. QuantumGrid's software lets you create a server which distributes data and the software for processing that data. QuantumGrid's software also lets devices connect to the specific server they want to connect with which automatically starts using their processing power to contribute to processing data

Language: TypeScript - Size: 3.79 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 5 - Forks: 0

Mugeni024/selenium-rl-educational

This repository offers a hands-on approach to understanding reinforcement learning through Selenium. Explore how AI can navigate and interact with web forms while mastering key RL concepts. 🐱💻🌐

Language: Python - Size: 58.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

CambrianTech/continuum

Revolutionary AI Academy where AIs train other AIs through adversarial competition. 100% AI-built platform with ultra-efficient LoRA adapters, multi-agent collaboration, and beautiful cyberpunk interface designed like a familiar chat system.

Language: TypeScript - Size: 413 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2 - Forks: 0

Official-Husko/NN-Downloader

Easily download all of your favorite Naughty images from multiple sites.

Language: Python - Size: 3.86 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 59 - Forks: 9

abaasi256/selenium-rl-educational

Educational Selenium RL project: Train AI agents to navigate websites using Q-Learning. Complete implementation with Episode 100 achievement (129.73 reward). Learn reinforcement learning through practical web automation! 🤖🎓

Language: Python - Size: 55.7 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

reep0610/reep_kyoshi_data_v1

心を持つAIに関する理論と技術的な設計、そのために必要な教師データを公開しています。活動継続のためにBOOTHで支援も受付中です。応援よろしくお願いします!

Size: 90.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

mytechnotalent/HNN

A step-by-step walkthrough of the inner workings of a simple neural network. The goal is to demystify the calculations behind neural networks by breaking them down into understandable components, including forward propagation, backpropagation, gradient calculations, and parameter updates.

Language: Jupyter Notebook - Size: 3.14 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 18 - Forks: 1

cihansener/vintage-recipes-dataset

A dataset of vintage cooking recipes extracted from printed materials (1940–1999)

Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

donglin1608/vintage-recipes-dataset

A dataset of vintage cooking recipes extracted from printed materials (1940–1999)

Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Kas1o/SillyTavern-Dataset-Export Fork of Cohee1207/SillyTavern-Dataset-Export

Exports a chat as a ShareGPT dataset

Language: JavaScript - Size: 45.9 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

remberg/VSProjectTextExport

.Net Maui - Export all project files (.cs, .xaml ...) an single plain text file.

Language: C# - Size: 368 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

boy-who-cried-wolf/fin-ai-mvp

An intelligent financial advisory system

Language: TypeScript - Size: 348 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hemangjoshi37a/AIComputerInteractionLogger

Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.

Language: Python - Size: 356 KB - Last synced at: 5 months ago - Pushed at: 12 months ago - Stars: 6 - Forks: 1

atticusrussell/BingImageAITrainer

A tool for generating diverse synthetic training images using Bing Image Creator to facilitate the training of AI/ML image models.

Language: Python - Size: 27.3 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

david-smejkal/wiki2txt

A tool to extract plain (unformatted) multilingual / language-agnostic text, redirects, links and categories from wikipedia backups (dumps). Designed to prepare clean training data for AI Training / Machine Learning software.

Language: Python - Size: 215 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 6 - Forks: 1

thapelomagqazana/car-racing-ai-dashboard

A real-time AI training dashboard for CarRacing-v0 using FastAPI, React, ONNX Runtime, and PostgreSQL.

Language: Shell - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

philips-software/go-hsdp-api 📦

Client library to interact with various APIs used within Philips in a simple and uniform way

Language: Go - Size: 2.96 MB - Last synced at: 10 days ago - Pushed at: 6 months ago - Stars: 32 - Forks: 11

Ohimoiza1205/ocr-label-studio-automation-

This project is an end-to-end workflow for processing a sample invoice using OCR & manual annotation. The project demonstrates how to extract text from an invoice using Tesseract OCR, refine the results in Label Studio, & prepare a high-quality dataset for AI training. Includes configuration files, scripts, & documentation for document processing

Size: 46.9 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ksm26/Pretraining-LLMs

Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectures, execute training runs, and assess model performance for efficient and effective LLM pretraining.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 13 - Forks: 5

fDero/BackPropy

A very simple implementation of the back-propagation algorithm for neural networks training, written in python, from scratch (no-frameworks)

Language: Python - Size: 15.6 KB - Last synced at: 28 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

LynnColeArt/The-Claudinator

A simple Chromium plugin for downloading and archiving your Anthropic AI chats with Claude 3 models

Language: JavaScript - Size: 19.5 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

M1ck4/MichaelAngel.io

Ethical AI Powered by Creative Commons

Language: Python - Size: 1.95 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

shane-staret/AI-Dog-Identification-System-Bucknell-CSCI-357

A Python solution utilizing neural networks and deep learning (via TensorFlow & Keras) to classify images as containing dogs or not.

Language: Python - Size: 453 MB - Last synced at: 9 days ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

NotwenCaasi/lander_AI_project

an ai spacecraft trained to land on a diversity of planets

Language: Python - Size: 99.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

musty-ess/Nim-AI-Reinforcement-Learning

This project implements an AI that teaches itself to play the game of Nim using Q-learning, a form of reinforcement learning. By playing games against itself, the AI learns optimal strategies for playing Nim, eventually improving its performance by updating a reward table based on game outcomes.

Language: Python - Size: 10.7 KB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

feliperibeirosc/ScraperAI-ProfessoresUFABC

WebScraper que usa a API do Google Gemini para analisar e catalogar professores da UFABC conforme formação e área de interesse

Language: Python - Size: 213 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

rares9301/datatrain

simple IQR & Z-score normalization script

Language: Python - Size: 357 KB - Last synced at: about 16 hours ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lehrmanaidins/Subintelligence-MK1

This neural network is designed to be able to take an 20px-by-20px gray-scale image and detect whether the input image contains either a rectangle or a circle.

Language: Python - Size: 14.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

huaxiaozhong1/YourOwnModel-TfLite-RaspberryPi

An "AI-on-device" project walks with you through all necessary steps, from collecting your own data, creating and training your own Tensorflow model, generating your own Tensorflow-lite model, developing both Python and C++ programs to recognize images on Raspberry Pi 3.

Language: Python - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

huaxiaozhong1/Tensorflow-SparkFunEdge-FullLifeCycel-for-SequenceModel

An "AI on-device" project for sequence model. Based at Tensorflow Lite for micro-controller, the model is created/trained/converted/flashed. At the end, an app is able to run, at SparkFun Edge Dev board, to recongnize speech although just words.

Size: 199 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

failfa-st/LoRAdo

LoRAdo is a UI that allows easy creation of LoRAs for stable diffussion

Language: TypeScript - Size: 40.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

InfamousTechnician/Porsche-911

Tryna train YouTube to suggest Porsche vids.

Size: 8.4 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

GithubUserAccountAmazing/vid2train

A Tool for Extracting Images from a Video for Artificial Intelligence Training.

Language: Python - Size: 52.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

o7q/ez-instant-ngp

A pre-configured instant-ngp workspace that includes helpful scripts for getting started with NeRF training.

Language: Python - Size: 300 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

ecomp-shONgit/platon-goldstandard

Sammlung von Paraphrasen zu platonischen Textstellen

Size: 1.49 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 5 - Forks: 0

klhenams/solid-octo-robot

An app that implements labeled data management that can be used to train a new AI

Language: Python - Size: 90.8 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Kevin-Kwan/MATLAB-face-training

A private internship coding project that I worked on in the Summer of 2019 where I did some MATLAB facial recognition stuff. Utilizes machine learning/facial recognition to identify people. This was my first time using MATLAB.

Language: MATLAB - Size: 18.2 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

42-AI/champions_for_corewar

A collection of 42 students' Core War Champions for AI training purposes

Language: Assembly - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 0