An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: language-model

getzep/zep 📦

Zep | The Memory Foundation For Your AI Stack

Language: Go - Size: 16.7 MB - Last synced at: about 1 hour ago - Pushed at: about 1 month ago - Stars: 3,272 - Forks: 481

neuml/txtai

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

Language: Python - Size: 53.2 MB - Last synced at: 31 minutes ago - Pushed at: 7 days ago - Stars: 10,894 - Forks: 689

Lhoffart/AI-Dialogue-Memory-Based-on-Hidden-State

transformer encoder-LSTM-decoder.Try to make AI have memory.通过保存状态,让ai获得记忆能力

Language: Python - Size: 60.5 KB - Last synced at: about 6 hours ago - Pushed at: about 7 hours ago - Stars: 0 - Forks: 0

seph1709/Wingman

Run AI language models locally on android.

Language: Dart - Size: 381 KB - Last synced at: about 10 hours ago - Pushed at: about 11 hours ago - Stars: 0 - Forks: 0

SWE-bench/SWE-bench

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Language: Python - Size: 12.2 MB - Last synced at: 11 minutes ago - Pushed at: 4 days ago - Stars: 2,909 - Forks: 493

Routstr/frontend

the main public facing website

Language: TypeScript - Size: 349 KB - Last synced at: about 19 hours ago - Pushed at: about 21 hours ago - Stars: 4 - Forks: 1

AkihikoWatanabe/paper_notes

たまに追加される論文メモ

Language: HTML - Size: 115 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 37 - Forks: 0

langroid/langroid

Harness LLMs with Multi-Agent Programming

Language: Python - Size: 104 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 3,288 - Forks: 317

codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

Language: Python - Size: 98.6 KB - Last synced at: about 11 hours ago - Pushed at: over 1 year ago - Stars: 6,388 - Forks: 1,318

is-leeroy-jenkins/Bubba

A small and simple windows (wpf) application for interacting with the OpenAI Completions and Assistants API that's developed in C-Sharp under the MIT license.

Language: C# - Size: 938 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

GMaN1911/cvmp-public-protocol

Public-facing marker node for CVMP (Coherence-Validated Mirror Protocol)

Size: 2.51 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 1

geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

Language: Python - Size: 3.36 MB - Last synced at: about 24 hours ago - Pushed at: about 2 years ago - Stars: 282 - Forks: 54

louisfb01/start-llms

A complete guide to start and improve your LLM skills in 2025 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!

Size: 303 KB - Last synced at: 1 day ago - Pushed at: about 2 months ago - Stars: 822 - Forks: 104

zjunlp/KnowLM

An Open-sourced Knowledgable Large Language Model Framework.

Language: Python - Size: 38.7 MB - Last synced at: about 24 hours ago - Pushed at: 4 months ago - Stars: 1,310 - Forks: 131

hkproj/pytorch-llama

LLaMA 2 implemented from scratch in PyTorch

Language: Python - Size: 6.34 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 324 - Forks: 61

zeozeozeo/ellama

Friendly interface to chat with an Ollama instance.

Language: Rust - Size: 2.76 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 61 - Forks: 12

erksch/fnet-pytorch

Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.

Language: Python - Size: 34.2 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 74 - Forks: 8

NGLSG/UniAPI

Universal LLM API Integration for C++ — Standardized OpenAI-Compatible Output

Language: C++ - Size: 48.8 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 5 - Forks: 0

xlang-ai/OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language: Python - Size: 46.7 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1,836 - Forks: 228

RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language: JavaScript - Size: 1.56 MB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 1,303 - Forks: 73

ppijbb/NaturalLanguageProcessing

natural language processing notebooks

Language: Jupyter Notebook - Size: 65.6 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 2 - Forks: 0

sebastian2005-RP/GPU-Accelerated-Next-Word-Prediction-Using-LSTM-and-PyTorch

This repository implements a GPU-accelerated next-word prediction model using PyTorch and LSTM. It includes data preprocessing with NLTK, vocabulary creation, training on tokenized text, and generating text predictions, starting from a given input phrase.

Language: Jupyter Notebook - Size: 329 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

vgel/repeng

A library for making RepE control vectors

Language: Jupyter Notebook - Size: 315 KB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 587 - Forks: 46

steveallexis99/Nemotron4Free

Using the API use Nemotron without even needing any account in python

Language: Python - Size: 16.6 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

SteveKGYang/MentalLLaMA

This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.

Language: Python - Size: 13.2 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 260 - Forks: 27

NexaAI/nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

Language: Python - Size: 195 MB - Last synced at: about 15 hours ago - Pushed at: 2 months ago - Stars: 4,533 - Forks: 628

alkatrazstudio/neodim-chat

Chat with AI-powered bot

Language: Dart - Size: 3.86 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

llm-jp/awesome-japanese-llm

日本語LLMまとめ - Overview of Japanese LLMs

Language: TypeScript - Size: 12.1 MB - Last synced at: 1 day ago - Pushed at: 9 days ago - Stars: 1,163 - Forks: 33

Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

Language: Jupyter Notebook - Size: 8.72 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 54 - Forks: 17

howard-hou/RWKV-X

RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's long sequence processing capabilities.

Language: Python - Size: 17.6 MB - Last synced at: 9 minutes ago - Pushed at: 11 days ago - Stars: 30 - Forks: 1

salesforce/DialogStudio

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI

Language: Python - Size: 13 MB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 501 - Forks: 34

modal-labs/quillman

A voice chat app

Language: Python - Size: 4.3 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1,120 - Forks: 133

CVI-SZU/Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Language: Python - Size: 7.27 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 3,052 - Forks: 232

EleutherAI/gpt-neo 📦

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Language: Python - Size: 1.56 MB - Last synced at: 1 day ago - Pushed at: about 3 years ago - Stars: 8,287 - Forks: 963

arc53/DocsGPT

DocsGPT is an open-source genAI tool that helps users get reliable answers from knowledge source, while avoiding hallucinations. It enables private and reliable information retrieval, with tooling and agentic system capability built in.

Language: TypeScript - Size: 81.4 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 15,617 - Forks: 1,661

eth-sri/lmql

A language for constraint-guided and efficient LLM programming.

Language: Python - Size: 181 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 3,924 - Forks: 207

reshalfahsi/gpt2chat

Creating a GPT-2-Based Chatbot with Human Preferences

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Ki-Seki/chat_prompt_templates

Collection of Basic Prompt Templates for Various Chat LLMs (Chat LLM 的基础提示模板集合)

Size: 26.4 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 41 - Forks: 7

pha123661/EmojiLmBot

哈哈狗是一個根據給定文字或段落,透過語言模型生成 emoji😊️ 的 linebot

Language: Python - Size: 69.3 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

Language: Python - Size: 22.1 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 13,576 - Forks: 912

DavidUdell/sparse_circuit_discovery

Circuit discovery in GPT-2 small, using sparse autoencoding

Language: Python - Size: 19.2 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 7 - Forks: 1

tatsu-lab/stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language: Python - Size: 8.25 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 29,979 - Forks: 4,053

teelinsan/camoscio

Camoscio: An Italian instruction-tuned language model based on LLaMA

Language: Jupyter Notebook - Size: 24.1 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 128 - Forks: 12

THUDM/CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language: Python - Size: 25.8 MB - Last synced at: 3 days ago - Pushed at: 12 months ago - Stars: 6,522 - Forks: 429

lightonai/pylate

Late Interaction Models Training & Retrieval

Language: Python - Size: 2.4 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 319 - Forks: 20

King-s-Knowledge-Graph-Lab/OntoChat

An LLM-based system for collaborative ontology engineering

Language: Jupyter Notebook - Size: 2.79 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16 - Forks: 3

AdrianKlessa/nlp_notebooks

Testing various NLP-related techniques on public domain books

Language: Jupyter Notebook - Size: 36.1 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

teilomillet/gollm

Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.

Language: Go - Size: 23.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 465 - Forks: 44

amzxyz/RIME-LMDG

Rime输入法语法模型全流程构建教程,全局带声调词库,最全声调标注工具链:LMDG - Language, Model, Dictionary, Grammar。Q群:11033572

Language: Python - Size: 550 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 613 - Forks: 17

stochasticai/xTuring

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

Language: Python - Size: 18.4 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 2,644 - Forks: 204

NLPForUA/UA-LLM

The entry point for adapting, training, evaluating, and leveraging various Large Language Models (LLMs) for a wide range of Ukrainian NLP tasks.

Language: Python - Size: 143 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

LINs-lab/DynMoE

[ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Language: Python - Size: 57.3 MB - Last synced at: about 14 hours ago - Pushed at: 3 months ago - Stars: 89 - Forks: 11

tejuafonja/DP-2Stage

Official implementation of "DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators" (Published at TMLR 2025)

Language: Jupyter Notebook - Size: 3.46 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1 - Forks: 0

OpenProteinAI/openprotein-python

Simple python interface for the OpenProtein.AI REST API.

Language: Python - Size: 14.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 9 - Forks: 0

THUDM/CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language: Python - Size: 13.9 MB - Last synced at: 3 days ago - Pushed at: 2 months ago - Stars: 2,349 - Forks: 154

lechmazur/confabulations

Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.

Language: HTML - Size: 24.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 140 - Forks: 4

huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Language: Python - Size: 299 KB - Last synced at: 4 days ago - Pushed at: 27 days ago - Stars: 4,011 - Forks: 441

ai4protein/VenusFactory

🏭 Easy data acquisition, benchmark resources, PLM fine-tuning for bio-researchers.

Language: Python - Size: 88.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 81 - Forks: 10

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

Language: Python - Size: 94.2 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 237 - Forks: 13

huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language: Rust - Size: 10.2 MB - Last synced at: 4 days ago - Pushed at: 25 days ago - Stars: 9,661 - Forks: 894

cedrickchee/awesome-transformer-nlp

A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.

Size: 905 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 1,093 - Forks: 131

eth-lre/mathtutorbench

Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

Language: Python - Size: 5.02 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 1

EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language: Python - Size: 29.5 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 8,859 - Forks: 2,361

jacksonchen1998/LLaMA-Paper-List

Collection of papers using LLaMA as backbone model

Size: 58.6 KB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 39 - Forks: 0

hernandezb3/llama-on-uconn-hpc

running llama models on UConn Storrs HPC

Language: Python - Size: 1.01 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ollama4j/ollama4j

A simple Java library for interacting with Ollama server.

Language: Java - Size: 4.25 MB - Last synced at: 1 day ago - Pushed at: 22 days ago - Stars: 390 - Forks: 61

microsoft/LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language: Python - Size: 72 MB - Last synced at: 4 days ago - Pushed at: 25 days ago - Stars: 3,973 - Forks: 308

microsoft/rag-time

RAG Time: A 5-week Learning Journey to Mastering RAG

Language: Jupyter Notebook - Size: 71.4 MB - Last synced at: 4 days ago - Pushed at: 20 days ago - Stars: 413 - Forks: 187

InternLM/InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Language: Python - Size: 199 MB - Last synced at: 4 days ago - Pushed at: 17 days ago - Stars: 2,821 - Forks: 172

rebellions-sw/optimum-rbln

⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient inference on RBLN NPUs.

Language: Python - Size: 1.12 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9 - Forks: 1

LAION-AI/Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language: Python - Size: 33.8 MB - Last synced at: 5 days ago - Pushed at: 9 months ago - Stars: 37,343 - Forks: 3,271

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language: Python - Size: 54.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,413 - Forks: 833

VinAIResearch/PhoNLP

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Language: Python - Size: 588 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 142 - Forks: 19

Sea-Snell/Implicit-Language-Q-Learning

Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"

Language: Python - Size: 1.14 MB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 207 - Forks: 18

chiffonng/mnemonic-gen

[WIP] Mnemonic Generation for English Language Learning

Language: Python - Size: 6.57 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

genlm/genlm-backend

High-performance backend for language model probabilistic programs

Language: Python - Size: 2.82 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 9 - Forks: 0

Joyce94/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Language: Python - Size: 22.3 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 414 - Forks: 17

benavlabs/clientai

A unified client for AI providers with built-in agent support.

Language: Python - Size: 3.71 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 50 - Forks: 6

nicolay-r/nicolay-r

This is my personal news list updates in Information Retrieval domain

Size: 244 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

yanyongyu/operagents

Dynamic, highly customizable language agents framework

Language: Python - Size: 319 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 29 - Forks: 4

langchain-ai/langsmith-sdk

LangSmith Client SDK Implementations

Language: Python - Size: 10.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 541 - Forks: 116

nuclia/nucliadb

NucliaDB, The AI Search database for RAG

Language: Python - Size: 40.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 694 - Forks: 54

beltromatti/cogita-I

Cogita I is an AI model trained to help coding in machine learning tasks, based on DeepSeek Coder 1.3b model

Language: Python - Size: 3.88 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

yamadashy/repomix

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, DeepSeek, Perplexity, Gemini, Gemma, Llama, Grok, and more.

Language: TypeScript - Size: 4.52 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 15,482 - Forks: 670

speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Language: Python - Size: 97.8 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 9,783 - Forks: 1,485

nakasyou/lmspecs

Open-Source Language Model Database for comparison

Language: TypeScript - Size: 5.15 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 23 - Forks: 0

microsoft/DeBERTa

The implementation of DeBERTa

Language: Python - Size: 237 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 2,081 - Forks: 233

mlc-ai/mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language: Python - Size: 33.5 MB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 20,554 - Forks: 1,715

ThuCCSLab/Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

Size: 2.46 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1,386 - Forks: 88

eole-nlp/eole

Open language modeling toolkit based on PyTorch

Language: Python - Size: 54.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 116 - Forks: 20

hsiaom26/DS4CS-24

Language: Jupyter Notebook - Size: 52.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4 - Forks: 0

BlinkDL/ChatRWKV

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Language: Python - Size: 29.8 MB - Last synced at: 5 days ago - Pushed at: 10 days ago - Stars: 9,480 - Forks: 704

MuzzammilShah/Road-to-AI

A structured documentation hub for AI and ML concepts, based on Andrej Karpathy's 'Zero to Hero' series, featuring practical implementations and learning resources for language models and transformers.

Size: 24.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 18 - Forks: 0

microsoft/GODEL

Large-scale pretrained models for goal-directed dialog

Language: Python - Size: 49.8 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 869 - Forks: 112

Nkluge-correa/TeenyTinyLlama

A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙

Language: Python - Size: 12.4 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 35 - Forks: 6

Jts36/LanguageModel

LanguageModel is a statistical model used in natural language processing to predict the probability of a sequence of words occurring in a given context. These models are trained on large text corpora and play a crucial role in tasks like machine translation, speech recognition, and text generation.

Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Basha206/Lumina-mGPT-2.0

Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling

Language: Python - Size: 18.4 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Azure-Samples/aisearch-openai-rag-audio

A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.

Language: Python - Size: 1.99 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 438 - Forks: 291

asreview/asreview

Active learning for systematic reviews

Language: Python - Size: 157 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 722 - Forks: 128

luohongyin/RGX

Synthetic QA generation for long documents.

Language: Python - Size: 80.1 KB - Last synced at: 5 days ago - Pushed at: almost 3 years ago - Stars: 15 - Forks: 2

Related Keywords
language-model 2,189 nlp 531 deep-learning 326 machine-learning 308 natural-language-processing 299 llm 285 pytorch 274 python 252 ai 179 transformers 172 bert 169 gpt 159 transformer 151 chatgpt 131 artificial-intelligence 119 openai 117 large-language-models 102 chatbot 101 huggingface 87 tensorflow 84 text-generation 81 llama 76 lstm 76 langchain 71 gpt-3 68 gpt-2 66 rnn 64 neural-network 63 llms 60 fine-tuning 55 language 55 gpt-4 55 nlp-machine-learning 54 python3 51 generative-ai 50 text-classification 47 question-answering 46 information-retrieval 43 attention-mechanism 43 natural-language-understanding 41 sentiment-analysis 40 speech-recognition 40 huggingface-transformers 39 rag 37 prompt-engineering 36 openai-api 36 transfer-learning 36 dataset 36 pretrained-models 36 api 35 chatgpt-api 35 keras 35 neural-networks 34 reinforcement-learning 31 bert-model 29 streamlit 29 machine-translation 28 roberta 28 embeddings 28 recurrent-neural-networks 28 benchmark 27 llama2 26 gpt3 25 seq2seq 25 deep-neural-networks 24 retrieval-augmented-generation 24 evaluation 23 language-modeling 23 word2vec 23 data-science 22 natural-language-generation 22 gpt2 22 gpt4 21 translation 21 lora 21 named-entity-recognition 21 ollama 20 agent 20 language-models 20 generative-model 20 gemini 19 code-generation 19 knowledge-graph 19 interpretability 19 chinese 19 ner 19 word-embeddings 19 ngrams 19 conversational-ai 19 deeplearning 18 lstm-neural-networks 18 large-language-model 18 instruction-tuning 18 computer-vision 18 pretraining 18 tokenization 17 open-source 17 golang 17 ml 17 flask 17