GitHub topics: gemma
LearningCircuit/local-deep-research
Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterative analysis using any LLM across diverse knowledge sources including academic databases, scientific repositories, web content, and private document collections.
Language: Python - Size: 2.64 MB - Last synced at: about 6 hours ago - Pushed at: about 21 hours ago - Stars: 2,637 - Forks: 268

xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language: Python - Size: 44.9 MB - Last synced at: about 5 hours ago - Pushed at: about 9 hours ago - Stars: 7,804 - Forks: 665

google-gemini/gemma-cookbook
A collection of guides and examples for the Gemma open models from Google.
Language: Jupyter Notebook - Size: 116 MB - Last synced at: about 9 hours ago - Pushed at: about 19 hours ago - Stars: 1,427 - Forks: 244

Relaxolotl17/gemma-3-tutorial
Detailed guide on Google's Gemma 3 AI
Language: Python - Size: 50.8 KB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 0 - Forks: 0

unslothai/unsloth
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
Language: Python - Size: 6.48 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 38,528 - Forks: 3,017

ollama/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Language: Go - Size: 41.9 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 140,249 - Forks: 11,716

mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language: Go - Size: 18.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 32,524 - Forks: 2,475

LostRuins/koboldcpp Fork of ggml-org/llama.cpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language: C++ - Size: 250 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 7,289 - Forks: 466

seph1709/Wingman
Run AI language models locally on android.
Language: Dart - Size: 381 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language: Python - Size: 5.39 MB - Last synced at: about 2 hours ago - Pushed at: about 2 months ago - Stars: 5,441 - Forks: 538

KudoAI/googlegpt
🤖 AI chat & search summaries in Google Search, powered by the latest LLMs
Language: JavaScript - Size: 56.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 164 - Forks: 15

heilcheng/gemma-benchmark
A comprehensive benchmarking suite for Gemma language models (work-in-progress implementation)
Language: Python - Size: 28.3 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

darrenburns/elia
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
Language: Python - Size: 567 KB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 2,145 - Forks: 131

yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language: Python - Size: 6.24 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 6,382 - Forks: 576

magpie-align/magpie
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
Language: Python - Size: 1.08 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 695 - Forks: 61

beloveddie/AI-Craft
A collection of Jupyter notebook experiments and applications centered around Generative AI with LLMs.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 0

cgjosephlee/ollama-save-load
Save and load ollama models just like operating docker images.
Language: Python - Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 16 - Forks: 4

Mobile-Artificial-Intelligence/llama_sdk
lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
Language: Dart - Size: 1.64 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 82 - Forks: 20

clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.
Language: JavaScript - Size: 14 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 3,081 - Forks: 112

tattn/LocalLLMClient
A local LLM client for iOS, macOS
Language: Swift - Size: 215 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

mlc-ai/web-llm-chat
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
Language: TypeScript - Size: 23.3 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 734 - Forks: 124

mohamedsaid-sd/3
A repository dedicated to exploring the significance of the number 3 in various cultures, mathematics, and symbolism. Delve into the mystical and mathematical properties of this enigmatic digit through code, analysis, and creative interpretations.
Size: 0 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

NLPForUA/ZNO
Structured test tasks and model tuning scripts for multiple subjects from ZNO - the Ukrainian External Independent Evaluation (ЗНО)
Language: Python - Size: 2.27 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 4 - Forks: 0

SchBenedikt/ai-agent
Testing macOS AI Agent with Google Gemini Live Web API
Language: Python - Size: 38.1 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Picovoice/picollm
On-device LLM Inference Powered by X-Bit Quantization
Language: Python - Size: 94.2 MB - Last synced at: 3 days ago - Pushed at: 8 days ago - Stars: 237 - Forks: 13

AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language: Python - Size: 6.35 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 322 - Forks: 39

saurabhnative/GithubProfileAnalysisLLM
Github Profile Analyzer using LLM and datascience tools
Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

hihumanzone/AI-Discord-Bot-GEMM-X
AI Discord Bot (GEMM-X) is an intelligent assistant for Discord, leveraging AI technologies from multiple providers to generate images, create music, produce speech, and more. It supports custom personality settings and advanced user/server configurations.
Language: JavaScript - Size: 137 KB - Last synced at: about 1 hour ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

GaiZhenbiao/ChuanhuChatGPT
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Language: Python - Size: 3.11 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 15,422 - Forks: 2,284

fly-apps/ollama-open-webui
Self-host a ChatGPT-style web interface for Ollama 🦙
Language: Shell - Size: 28.3 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 86 - Forks: 29

Tsai1030/rag-air-pollution
A RAG-based retrieval system for air pollution topics using LangChain and ChromaDB.
Language: Python - Size: 5.46 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

johnsonhk88/AI-Bank-Statement-Document-Automation-By-LLM-And-Personal-Finanical-Analysis-Prediction
AI Bank Statement Document Automation By LLM model and Personal Finanical Analysis
Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 23 - Forks: 5

InternLM/InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Language: Python - Size: 6.78 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 386 - Forks: 64

muhammadawaisshaikh/ai-python-Javascript
Opensource full fledged Ai recipes backed with python, angular, react
Language: TypeScript - Size: 2.89 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

google/generative-ai-docs
Documentation for Google's Gen AI site - including the Gemini API and Gemma
Language: Jupyter Notebook - Size: 52.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,987 - Forks: 693

alphasecio/replicate
A Streamlit app for running text, image, code, audio, and music generation models on Replicate.
Language: Python - Size: 3.81 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 0

alphasecio/fireworks
A Streamlit app for running open-source text and image models on Fireworks AI.
Language: Python - Size: 2.53 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

RATHOD-SHUBHAM/Finetuning-LLMs
This repository contains experiments on fine-tuning LLMs (Llama, Llama3.1, Gemma). It includes notebooks for model tuning, data preprocessing, and hyperparameter optimization to enhance model performance.
Language: Jupyter Notebook - Size: 5.12 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

sozercan/aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
Language: Go - Size: 4.26 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 448 - Forks: 38

EmbeddedLLM/embeddedllm
EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
Language: Python - Size: 12.6 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 38 - Forks: 1

zake7749/Kyara
Lightweight yet Effective Chinese LLM.
Language: Jupyter Notebook - Size: 255 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 27 - Forks: 2

alphasecio/groq
A Streamlit chatbot for running open-source text models on Groq.
Language: Python - Size: 616 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 1

inferflow/inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Language: C++ - Size: 1.89 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 243 - Forks: 25

Gholamrezadar/ollama-image-captioning
Captions images using Ollama and a multimodal model like Gemma3:4b.
Language: Python - Size: 1000 Bytes - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

VNG-Realisatie/GEMMA-TA-Archi-repository
Opzet GEMMA technische architectuur. De TA is onderdeel van GEMMA online. De meest recente versie is beschikbaar met de link hieronder
Size: 6.55 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Genta-Technology/Kolosal
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
Language: C++ - Size: 67.2 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 231 - Forks: 18

QuantiusBenignus/BlahST
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs.
Language: Shell - Size: 1.05 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 75 - Forks: 6

RivuChk/GolpoAI
Demo GenAI project with KMP, Gemini and Gemma
Language: Kotlin - Size: 1.25 MB - Last synced at: 2 days ago - Pushed at: 16 days ago - Stars: 5 - Forks: 0

konyshevgmbh/epub-bilingual-bakery
German EPUB to bilingual German-Russian converter using NLLB and Gemma (via Ollama). For fun and experiments with NLP and translation.
Language: Python - Size: 5.61 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

ambideXtrous9/Streamlit-App
Streamlit App
Language: Jupyter Notebook - Size: 210 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 5

curious-rookiee/RAG-based-Legal-Document-Chatbot
Submission of RAG-based Legal Document Chatbot project for CSI-509 Deep Learning Lab. by: Avinesh Pambally & Chaturthi Naik . Class: MSc. Artificial Intelligence – Part 1
Language: Jupyter Notebook - Size: 763 KB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

KalpataruLabs/youtube-stocks-analyser-crewai-local-ollama
YouTube stocks analysis with Ollama and CrewAI on local machine
Language: Python - Size: 239 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 0

Ki-Seki/Awesome-Transformer-Visualization
Explore visualization tools for understanding Transformer-based large language models (LLMs)
Size: 23.4 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 12 - Forks: 2

KazKozDev/KazKozDev
⚡AI solutions with language models.
Size: 67.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

arafkarsh/ms-springboot-ai
Java 23, SpringBoot 3.4.1 Examples using Deep Learning 4 Java & LangChain4J for Generative AI using ChatGPT LLM, RAG and other open source LLMs. Sentiment Analysis, Application Context based ChatBots. Custom Data Handling. LLMs - GPT 3.5 / 4o, Gemini Pro 1.5, Claude 3, Llama 3.1, Phi-3, Gemma 2, Falcon 3, Qwen 2.5, Mistral Nemo, Wizard Math
Language: Java - Size: 22.3 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 27 - Forks: 13

eliranwong/groqchat
A terminal chatbot, powered by Groq Cloud API (Windows / macOS / Linux / Android / iOS)
Language: Python - Size: 77.1 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 15 - Forks: 3

AI-Hypercomputer/jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
Language: Python - Size: 1.41 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 17

tanyuqian/redco
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
Language: Python - Size: 11.5 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 65 - Forks: 7

papersgpt/papersgpt-for-zotero
Zotero chat PDF with AI, DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini, Llama 4
Language: JavaScript - Size: 21.4 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1,526 - Forks: 48

mdda/getting-to-aha-with-tpus
Reasoning-from-Zero using gemma.JAX.nnx on TPUs
Language: Python - Size: 292 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 0

ariya/gamal
Research tool leveraging LLM for answers
Language: JavaScript - Size: 201 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 52 - Forks: 2

arssite/GeminiAi-
Using FineTune Models for Document Q/A and Chatbots
Language: Jupyter Notebook - Size: 4.54 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 3 - Forks: 0

luo-anthony/DeveloperGPT
DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal commands and in-terminal chat.
Language: Python - Size: 6.98 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 42 - Forks: 5

jpmanson/llm_templates
Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.
Language: Python - Size: 302 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 29 - Forks: 1

SeoyeonPark1223/Gemma-FineTuning
Gemma FineTuning Project
Language: Jupyter Notebook - Size: 167 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 0

ganeshnikhil/J.A.R.V.I.S.2.0
open source assistant using small models (2b - 5b) , with agentic and tool calling capabilities and integration of RAG with effiecient memory.android support using adb
Language: Python - Size: 569 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 125 - Forks: 21

ibrahimhabibeg/Dahih-Al-Dofaa
Personalized AI assistant for university students who answers from the students' slides, textbooks, and notes.
Language: TypeScript - Size: 768 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 7 - Forks: 0

jakobhoeg/nextjs-ollama-llm-ui
Fully-featured web interface for Ollama LLMs
Language: TypeScript - Size: 5.87 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,184 - Forks: 284

albertstarfield/project-zephyrine
Introducing Project Zephyrine: Elevating Your Interaction Plug and Play, and Employing GPU Acceleration within a Modernized Automata Local Graphical User Interface.
Language: JavaScript - Size: 647 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 20 - Forks: 1

NotShrirang/LLM-Garden
Implementing different LLM architectures in single repo
Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

marklysze/LlamaIndex-RAG-WSL-CUDA
Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
Language: Jupyter Notebook - Size: 316 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 125 - Forks: 13

AstraBert/qdurllm
Search your favorite websites and chat with them, on your desktop🌐
Language: Python - Size: 1.06 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 30 - Forks: 2

jhordyess/chat-bot-ollama
A simple chat bot application built using Next.js and Ollama with the chosen model.
Language: TypeScript - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

PavlidisLab/gemma.R
An R wrapper for the Gemma RESTful API
Language: R - Size: 84.1 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

MaxMLang/RAG-nificent
Production-ready Chainlit RAG application with Pinecone pipeline offering all Groq and OpenAI Models, to chat with your documents.
Language: Python - Size: 35.4 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 11 - Forks: 0

QuantiusBenignus/Zshelf
Zsh-centric command-line interface for interacting with local Large Language Models (LLMs). Chat directly on the command line with non-contiguous command line calls.
Language: Shell - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

loong64/ollama
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Language: Dockerfile - Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 1

atomostechhq/google-gemma-for-web-exp
A web-based implementation of Google's Gemma language model using MediaPipe Tasks for GenAI.
Language: JavaScript - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

proflead/gemma-3-tutorial
Detailed guide on Google's Gemma 3 AI
Language: Python - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

johnsonhk88/Deep-Research-With-Web-Scraping-by-LLM-And-AI-Agent
Use LLM/AI agent for Web scraping (collection data) and analysis data with deep research
Language: Jupyter Notebook - Size: 217 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

snowkylin/refsheet_chat
Chat with a character via reference images!
Language: Python - Size: 24.4 KB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Ananyaiitbhilai/KGViz
[KGC '24] This application is for visualisation of Knowledge Graphs. We employe a novel technique which uses LLM based agent for triple extraction from unstructured text. It also got accepted at Text2KG 2024 (ESWC). However, it has better prompting strategy to carry. This tool's backend can be considered as an extension.
Language: JavaScript - Size: 1.48 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 2

kkdai/linebot-gemma
A LINE Bot demo showcasing how to use a local LLM (Gemma) via Groq to modify personal information and detect the need for LLM assistance.
Language: Python - Size: 1.72 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 17 - Forks: 4

iakashpaul/Ghudsavar
Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes
Language: Dockerfile - Size: 16.6 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

jorge-armando-navarro-flores/chat_with_your_docs
Discover and converse with advanced AI models like Mistral, LLAMA2, and GPT-3.5 from leading sources like OLLAMA, Hugging Face, and OpenAI. Easily extract insights from PDFs, web pages, and YouTube videos with our intuitive interface. Unlock the power of knowledge with seamless chat interactions.
Language: Python - Size: 10.1 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 147 - Forks: 14

akshat2602/Omistral
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Language: Rust - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

smartloop-ai/smartloop
Smartloop is an open-source SLM platform to train and run models on an edge device
Language: Python - Size: 75.2 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

JohnClaw/chatllm.v
V-lang api wrapper for llm-inference chatllm.cpp
Language: C - Size: 804 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

get2kiran/llm-finetune
使用trl、peft、transformers等库,实现对huggingface上模型的微调。
Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

alemiaschi/LLM_profiling
[EMNLP 2024] Materials for the paper "Evaluating Large Language Models via Linguistic Profiling"
Size: 2.82 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

leliuga/cohere-configurations
Co:Here Inference configurations
Language: Go - Size: 9.48 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 1

proflead/google-ai-studio-tutorial
Google AI Studio Tutorial for Beginners
Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/Gemma-3-Multimodal
Gemma 3 [ Image-text-text ] [ video inference ] [ multi image chat ]
Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sitamgithub-MSIT/sveltekit-huggingface
A pirate-themed chatbot using Gemma2 9B-it via Groq, Vercel AI SDK, and SvelteKit.
Language: CSS - Size: 174 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ShahDishank/neuron
Neuron is a conversational AI model using the Gemma LLM by Google from Hugging Face. It is designed to engage in a variety of topics and provide information on a wide range of subjects. With its ability to learn and adapt, this chatbot can provide a unique and engaging experience.
Language: Python - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

coding-chemist/GroqWarp
GroqWarp is a Streamlit app that compares the performance of RAG using Groq and Ollama models, visualizing response times and accuracy. It leverages FAISS for document retrieval and displays a side-by-side performance chart.
Language: Python - Size: 176 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Opillion-GmbH-Co-KG/easy-stack-deepseek
Need a seamless AI model hosting setup with Ollama, efficient vector search with Qdrant, and an intuitive WebUI? This is your stack!
Language: Shell - Size: 866 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

vs4vijay/AI-Playground
All-in-One AI Playground for LLM, Chat, RAG, Agents, etc.
Language: Python - Size: 165 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

Beomi/Gemma-EasyLM
Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)
Language: Python - Size: 410 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 47 - Forks: 10

marklysze/LlamaIndex-RAG-Linux-CUDA
Examples of RAG using Llamaindex with local LLMs in Linux - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
Language: Jupyter Notebook - Size: 175 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 4
