An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gemma

LearningCircuit/local-deep-research

Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterative analysis using any LLM across diverse knowledge sources including academic databases, scientific repositories, web content, and private document collections.

Language: Python - Size: 2.64 MB - Last synced at: about 6 hours ago - Pushed at: about 21 hours ago - Stars: 2,637 - Forks: 268

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language: Python - Size: 44.9 MB - Last synced at: about 5 hours ago - Pushed at: about 9 hours ago - Stars: 7,804 - Forks: 665

google-gemini/gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

Language: Jupyter Notebook - Size: 116 MB - Last synced at: about 9 hours ago - Pushed at: about 19 hours ago - Stars: 1,427 - Forks: 244

Relaxolotl17/gemma-3-tutorial

Detailed guide on Google's Gemma 3 AI

Language: Python - Size: 50.8 KB - Last synced at: about 22 hours ago - Pushed at: about 23 hours ago - Stars: 0 - Forks: 0

unslothai/unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

Language: Python - Size: 6.48 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 38,528 - Forks: 3,017

ollama/ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Language: Go - Size: 41.9 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 140,249 - Forks: 11,716

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Language: Go - Size: 18.6 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 32,524 - Forks: 2,475

LostRuins/koboldcpp Fork of ggml-org/llama.cpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

Language: C++ - Size: 250 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 7,289 - Forks: 466

seph1709/Wingman

Run AI language models locally on android.

Language: Dart - Size: 381 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

google/gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language: Python - Size: 5.39 MB - Last synced at: about 2 hours ago - Pushed at: about 2 months ago - Stars: 5,441 - Forks: 538

KudoAI/googlegpt

🤖 AI chat & search summaries in Google Search, powered by the latest LLMs

Language: JavaScript - Size: 56.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 164 - Forks: 15

heilcheng/gemma-benchmark

A comprehensive benchmarking suite for Gemma language models (work-in-progress implementation)

Language: Python - Size: 28.3 KB - Last synced at: 3 days ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

darrenburns/elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Language: Python - Size: 567 KB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 2,145 - Forks: 131

yangjianxin1/Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language: Python - Size: 6.24 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 6,382 - Forks: 576

magpie-align/magpie

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Language: Python - Size: 1.08 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 695 - Forks: 61

beloveddie/AI-Craft

A collection of Jupyter notebook experiments and applications centered around Generative AI with LLMs.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 0

cgjosephlee/ollama-save-load

Save and load ollama models just like operating docker images.

Language: Python - Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 16 - Forks: 4

Mobile-Artificial-Intelligence/llama_sdk

lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

Language: Dart - Size: 1.64 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 82 - Forks: 20

clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure and all OpenAI API compatible Services to automatically analyze and tag your documents.

Language: JavaScript - Size: 14 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 3,081 - Forks: 112

tattn/LocalLLMClient

A local LLM client for iOS, macOS

Language: Swift - Size: 215 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

mlc-ai/web-llm-chat

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

Language: TypeScript - Size: 23.3 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 734 - Forks: 124

mohamedsaid-sd/3

A repository dedicated to exploring the significance of the number 3 in various cultures, mathematics, and symbolism. Delve into the mystical and mathematical properties of this enigmatic digit through code, analysis, and creative interpretations.

Size: 0 Bytes - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

NLPForUA/ZNO

Structured test tasks and model tuning scripts for multiple subjects from ZNO - the Ukrainian External Independent Evaluation (ЗНО)

Language: Python - Size: 2.27 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 4 - Forks: 0

SchBenedikt/ai-agent

Testing macOS AI Agent with Google Gemini Live Web API

Language: Python - Size: 38.1 KB - Last synced at: 5 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

Language: Python - Size: 94.2 MB - Last synced at: 3 days ago - Pushed at: 8 days ago - Stars: 237 - Forks: 13

AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Language: Python - Size: 6.35 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 322 - Forks: 39

saurabhnative/GithubProfileAnalysisLLM

Github Profile Analyzer using LLM and datascience tools

Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

hihumanzone/AI-Discord-Bot-GEMM-X

AI Discord Bot (GEMM-X) is an intelligent assistant for Discord, leveraging AI technologies from multiple providers to generate images, create music, produce speech, and more. It supports custom personality settings and advanced user/server configurations.

Language: JavaScript - Size: 137 KB - Last synced at: about 1 hour ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

GaiZhenbiao/ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Language: Python - Size: 3.11 MB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 15,422 - Forks: 2,284

fly-apps/ollama-open-webui

Self-host a ChatGPT-style web interface for Ollama 🦙

Language: Shell - Size: 28.3 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 86 - Forks: 29

Tsai1030/rag-air-pollution

A RAG-based retrieval system for air pollution topics using LangChain and ChromaDB.

Language: Python - Size: 5.46 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

johnsonhk88/AI-Bank-Statement-Document-Automation-By-LLM-And-Personal-Finanical-Analysis-Prediction

AI Bank Statement Document Automation By LLM model and Personal Finanical Analysis

Language: Jupyter Notebook - Size: 42.5 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 23 - Forks: 5

InternLM/InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Language: Python - Size: 6.78 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 386 - Forks: 64

muhammadawaisshaikh/ai-python-Javascript

Opensource full fledged Ai recipes backed with python, angular, react

Language: TypeScript - Size: 2.89 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 3 - Forks: 0

google/generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

Language: Jupyter Notebook - Size: 52.5 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,987 - Forks: 693

alphasecio/replicate

A Streamlit app for running text, image, code, audio, and music generation models on Replicate.

Language: Python - Size: 3.81 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 4 - Forks: 0

alphasecio/fireworks

A Streamlit app for running open-source text and image models on Fireworks AI.

Language: Python - Size: 2.53 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

RATHOD-SHUBHAM/Finetuning-LLMs

This repository contains experiments on fine-tuning LLMs (Llama, Llama3.1, Gemma). It includes notebooks for model tuning, data preprocessing, and hyperparameter optimization to enhance model performance.

Language: Jupyter Notebook - Size: 5.12 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1 - Forks: 0

sozercan/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Language: Go - Size: 4.26 MB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 448 - Forks: 38

EmbeddedLLM/embeddedllm

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

Language: Python - Size: 12.6 MB - Last synced at: 3 days ago - Pushed at: 7 months ago - Stars: 38 - Forks: 1

zake7749/Kyara

Lightweight yet Effective Chinese LLM.

Language: Jupyter Notebook - Size: 255 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 27 - Forks: 2

alphasecio/groq

A Streamlit chatbot for running open-source text models on Groq.

Language: Python - Size: 616 KB - Last synced at: 3 days ago - Pushed at: 11 days ago - Stars: 1 - Forks: 1

inferflow/inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language: C++ - Size: 1.89 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 243 - Forks: 25

Gholamrezadar/ollama-image-captioning

Captions images using Ollama and a multimodal model like Gemma3:4b.

Language: Python - Size: 1000 Bytes - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

VNG-Realisatie/GEMMA-TA-Archi-repository

Opzet GEMMA technische architectuur. De TA is onderdeel van GEMMA online. De meest recente versie is beschikbaar met de link hieronder

Size: 6.55 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

Genta-Technology/Kolosal

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

Language: C++ - Size: 67.2 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 231 - Forks: 18

QuantiusBenignus/BlahST

Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak with local LLMs.

Language: Shell - Size: 1.05 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 75 - Forks: 6

RivuChk/GolpoAI

Demo GenAI project with KMP, Gemini and Gemma

Language: Kotlin - Size: 1.25 MB - Last synced at: 2 days ago - Pushed at: 16 days ago - Stars: 5 - Forks: 0

konyshevgmbh/epub-bilingual-bakery

German EPUB to bilingual German-Russian converter using NLLB and Gemma (via Ollama). For fun and experiments with NLP and translation.

Language: Python - Size: 5.61 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

ambideXtrous9/Streamlit-App

Streamlit App

Language: Jupyter Notebook - Size: 210 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 5

curious-rookiee/RAG-based-Legal-Document-Chatbot

Submission of RAG-based Legal Document Chatbot project for CSI-509 Deep Learning Lab. by: Avinesh Pambally & Chaturthi Naik . Class: MSc. Artificial Intelligence – Part 1

Language: Jupyter Notebook - Size: 763 KB - Last synced at: 14 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

KalpataruLabs/youtube-stocks-analyser-crewai-local-ollama

YouTube stocks analysis with Ollama and CrewAI on local machine

Language: Python - Size: 239 KB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 4 - Forks: 0

Ki-Seki/Awesome-Transformer-Visualization

Explore visualization tools for understanding Transformer-based large language models (LLMs)

Size: 23.4 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 12 - Forks: 2

KazKozDev/KazKozDev

⚡AI solutions with language models.

Size: 67.4 KB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

arafkarsh/ms-springboot-ai

Java 23, SpringBoot 3.4.1 Examples using Deep Learning 4 Java & LangChain4J for Generative AI using ChatGPT LLM, RAG and other open source LLMs. Sentiment Analysis, Application Context based ChatBots. Custom Data Handling. LLMs - GPT 3.5 / 4o, Gemini Pro 1.5, Claude 3, Llama 3.1, Phi-3, Gemma 2, Falcon 3, Qwen 2.5, Mistral Nemo, Wizard Math

Language: Java - Size: 22.3 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 27 - Forks: 13

eliranwong/groqchat

A terminal chatbot, powered by Groq Cloud API (Windows / macOS / Linux / Android / iOS)

Language: Python - Size: 77.1 KB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 15 - Forks: 3

AI-Hypercomputer/jetstream-pytorch

PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"

Language: Python - Size: 1.41 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 60 - Forks: 17

tanyuqian/redco

NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

Language: Python - Size: 11.5 MB - Last synced at: 3 days ago - Pushed at: 5 months ago - Stars: 65 - Forks: 7

papersgpt/papersgpt-for-zotero

Zotero chat PDF with AI, DeepSeek, GPT 4.5, ChatGPT, Claude, Gemini, Llama 4

Language: JavaScript - Size: 21.4 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1,526 - Forks: 48

mdda/getting-to-aha-with-tpus

Reasoning-from-Zero using gemma.JAX.nnx on TPUs

Language: Python - Size: 292 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 0

ariya/gamal

Research tool leveraging LLM for answers

Language: JavaScript - Size: 201 KB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 52 - Forks: 2

arssite/GeminiAi-

Using FineTune Models for Document Q/A and Chatbots

Language: Jupyter Notebook - Size: 4.54 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 3 - Forks: 0

luo-anthony/DeveloperGPT

DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal commands and in-terminal chat.

Language: Python - Size: 6.98 MB - Last synced at: 5 days ago - Pushed at: 5 months ago - Stars: 42 - Forks: 5

jpmanson/llm_templates

Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.

Language: Python - Size: 302 KB - Last synced at: 3 days ago - Pushed at: 9 months ago - Stars: 29 - Forks: 1

SeoyeonPark1223/Gemma-FineTuning

Gemma FineTuning Project

Language: Jupyter Notebook - Size: 167 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 1 - Forks: 0

ganeshnikhil/J.A.R.V.I.S.2.0

open source assistant using small models (2b - 5b) , with agentic and tool calling capabilities and integration of RAG with effiecient memory.android support using adb

Language: Python - Size: 569 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 125 - Forks: 21

ibrahimhabibeg/Dahih-Al-Dofaa

Personalized AI assistant for university students who answers from the students' slides, textbooks, and notes.

Language: TypeScript - Size: 768 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 7 - Forks: 0

jakobhoeg/nextjs-ollama-llm-ui

Fully-featured web interface for Ollama LLMs

Language: TypeScript - Size: 5.87 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 1,184 - Forks: 284

albertstarfield/project-zephyrine

Introducing Project Zephyrine: Elevating Your Interaction Plug and Play, and Employing GPU Acceleration within a Modernized Automata Local Graphical User Interface.

Language: JavaScript - Size: 647 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 20 - Forks: 1

NotShrirang/LLM-Garden

Implementing different LLM architectures in single repo

Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

marklysze/LlamaIndex-RAG-WSL-CUDA

Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B

Language: Jupyter Notebook - Size: 316 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 125 - Forks: 13

AstraBert/qdurllm

Search your favorite websites and chat with them, on your desktop🌐

Language: Python - Size: 1.06 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 30 - Forks: 2

jhordyess/chat-bot-ollama

A simple chat bot application built using Next.js and Ollama with the chosen model.

Language: TypeScript - Size: 54.7 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

PavlidisLab/gemma.R

An R wrapper for the Gemma RESTful API

Language: R - Size: 84.1 MB - Last synced at: 15 days ago - Pushed at: about 1 month ago - Stars: 10 - Forks: 1

MaxMLang/RAG-nificent

Production-ready Chainlit RAG application with Pinecone pipeline offering all Groq and OpenAI Models, to chat with your documents.

Language: Python - Size: 35.4 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 11 - Forks: 0

QuantiusBenignus/Zshelf

Zsh-centric command-line interface for interacting with local Large Language Models (LLMs). Chat directly on the command line with non-contiguous command line calls.

Language: Shell - Size: 80.1 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

loong64/ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Language: Dockerfile - Size: 18.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 8 - Forks: 1

atomostechhq/google-gemma-for-web-exp

A web-based implementation of Google's Gemma language model using MediaPipe Tasks for GenAI.

Language: JavaScript - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

proflead/gemma-3-tutorial

Detailed guide on Google's Gemma 3 AI

Language: Python - Size: 49.8 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

johnsonhk88/Deep-Research-With-Web-Scraping-by-LLM-And-AI-Agent

Use LLM/AI agent for Web scraping (collection data) and analysis data with deep research

Language: Jupyter Notebook - Size: 217 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 1

snowkylin/refsheet_chat

Chat with a character via reference images!

Language: Python - Size: 24.4 KB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Ananyaiitbhilai/KGViz

[KGC '24] This application is for visualisation of Knowledge Graphs. We employe a novel technique which uses LLM based agent for triple extraction from unstructured text. It also got accepted at Text2KG 2024 (ESWC). However, it has better prompting strategy to carry. This tool's backend can be considered as an extension.

Language: JavaScript - Size: 1.48 MB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 2

kkdai/linebot-gemma

A LINE Bot demo showcasing how to use a local LLM (Gemma) via Groq to modify personal information and detect the need for LLM assistance.

Language: Python - Size: 1.72 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 17 - Forks: 4

iakashpaul/Ghudsavar

Ghudsavar (Horse rider) - Is a quick llama.cpp server for CPU only runtimes

Language: Dockerfile - Size: 16.6 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

jorge-armando-navarro-flores/chat_with_your_docs

Discover and converse with advanced AI models like Mistral, LLAMA2, and GPT-3.5 from leading sources like OLLAMA, Hugging Face, and OpenAI. Easily extract insights from PDFs, web pages, and YouTube videos with our intuitive interface. Unlock the power of knowledge with seamless chat interactions.

Language: Python - Size: 10.1 MB - Last synced at: 3 days ago - Pushed at: 8 months ago - Stars: 147 - Forks: 14

akshat2602/Omistral

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.

Language: Rust - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

smartloop-ai/smartloop

Smartloop is an open-source SLM platform to train and run models on an edge device

Language: Python - Size: 75.2 KB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

JohnClaw/chatllm.v

V-lang api wrapper for llm-inference chatllm.cpp

Language: C - Size: 804 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

get2kiran/llm-finetune

使用trl、peft、transformers等库,实现对huggingface上模型的微调。

Language: Python - Size: 8.79 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

alemiaschi/LLM_profiling

[EMNLP 2024] Materials for the paper "Evaluating Large Language Models via Linguistic Profiling"

Size: 2.82 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

leliuga/cohere-configurations

Co:Here Inference configurations

Language: Go - Size: 9.48 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 10 - Forks: 1

proflead/google-ai-studio-tutorial

Google AI Studio Tutorial for Beginners

Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/Gemma-3-Multimodal

Gemma 3 [ Image-text-text ] [ video inference ] [ multi image chat ]

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

sitamgithub-MSIT/sveltekit-huggingface

A pirate-themed chatbot using Gemma2 9B-it via Groq, Vercel AI SDK, and SvelteKit.

Language: CSS - Size: 174 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ShahDishank/neuron

Neuron is a conversational AI model using the Gemma LLM by Google from Hugging Face. It is designed to engage in a variety of topics and provide information on a wide range of subjects. With its ability to learn and adapt, this chatbot can provide a unique and engaging experience.

Language: Python - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

coding-chemist/GroqWarp

GroqWarp is a Streamlit app that compares the performance of RAG using Groq and Ollama models, visualizing response times and accuracy. It leverages FAISS for document retrieval and displays a side-by-side performance chart.

Language: Python - Size: 176 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Opillion-GmbH-Co-KG/easy-stack-deepseek

Need a seamless AI model hosting setup with Ollama, efficient vector search with Qdrant, and an intuitive WebUI? This is your stack!

Language: Shell - Size: 866 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 3 - Forks: 0

vs4vijay/AI-Playground

All-in-One AI Playground for LLM, Chat, RAG, Agents, etc.

Language: Python - Size: 165 KB - Last synced at: 3 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 1

Beomi/Gemma-EasyLM

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

Language: Python - Size: 410 KB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 47 - Forks: 10

marklysze/LlamaIndex-RAG-Linux-CUDA

Examples of RAG using Llamaindex with local LLMs in Linux - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B

Language: Jupyter Notebook - Size: 175 KB - Last synced at: 30 days ago - Pushed at: about 1 year ago - Stars: 9 - Forks: 4