GitHub topics: llm-training-data

Repositories

deepakshroff/Capston-Gemini-ChatBot

👨‍🏫This project was developed under the guidance of Mr. Lokesh Sir as part of the AI & ML Training Program. It explores LLM integration using Google Gemini APIs with a custom UI built on Streamlit.

Language: Python - Size: 117 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Cre4T3Tiv3/Cre4T3Tiv3

I’m a senior software engineer crafting scalable systems end to end. With 10+ years across fintech, ad-tech, and enterprise SaaS, I deliver production-grade software that fuses robust backend architecture, seamless frontend UX, and cutting-edge AI & ML native tools.

Size: 714 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 0

vinsblack/The-Stach-Processed-v2

Sample edition of The Stack Enriched: annotated, secure, and optimized code dataset, this is a sample version

Language: Python - Size: 199 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

BlazeWild/Custom_LLM_DataGen_Template

🔧 Modular pipeline for generating high-quality, domain-specific datasets for LLM fine-tuning — from PDFs and web scraping to synthetic Q&A generation, quality filtering, and training-ready formatting.

Language: Python - Size: 24.4 KB - Last synced at: about 2 hours ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

emailmarketingdataset/Open-Email-Marketing-Dataset

Following is the Open Email Marketing Dataset; you can use it without any restrictions.

Size: 172 KB - Last synced at: 19 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

Related Keywords

llm-training-data 5 llm-training 3 code-quality 1 machine-learning-dataset 1 premium-dataset 1 programming-languages 1 finetuning-large-language-models 1 finetuning-llms 1 llama3 1 lora-fine-tuning 1 synthetic-dataset-generation 1 template-generic-repo 1 b2b-dataset 1 cold-email 1 email-marketing 1 gdpr-compliant 1 jsonl 1 lead-generation 1 marketing-dataset 1 open-dataset 1 seo-dataset 1 verified-emails 1 api-client 1 artifical-intelligence-engineer 1 artificial-intelligence 1 artificial-intelligence-algorithms 1 artificial-neural-networks 1 data-structures-and-algorithms 1 llm-inference 1 llms 1 machine-learning 1 machine-learning-algorithms 1 machine-learning-engineer 1 machine-learning-models 1 machine-learning-projects 1 software-architecture 1 software-development 1 software-engineer 1 software-engineering 1 software-testing 1 ai-code-generation 1 code-generation 1