An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: embedding

modelscope/ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Language: Python - Size: 63.3 MB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 8,286 - Forks: 711

TIGER-AI-Lab/VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR25]

Language: Python - Size: 10.3 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 270 - Forks: 19

pg-space/panspace

Embedding-based indexing for compact storage, rapid querying, and curation of bacterial pan-genomes

Language: Jupyter Notebook - Size: 23.9 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 10 - Forks: 0

ContextualAI/gritlm

Generative Representational Instruction Tuning

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: about 11 hours ago - Pushed at: about 12 hours ago - Stars: 654 - Forks: 47

llm-tools/embedJs

A NodeJS RAG framework to easily work with LLMs and embeddings

Language: TypeScript - Size: 3.13 MB - Last synced at: about 18 hours ago - Pushed at: 10 days ago - Stars: 529 - Forks: 64

zach10w/IA-Classification-Analysis

This GitHub repository hosts my AI evaluation work, featuring a Kaggle dataset analysis, experiments with three ML algorithms (including hyperparameter tuning), and a detailed exploration of wine quality data through outlier detection, correlation, and normalization techniques.

Language: Jupyter Notebook - Size: 5.59 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

redis/redis-vl-python

Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

Language: Python - Size: 79.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 308 - Forks: 53

vercel/modelfusion

The TypeScript library for building AI applications.

Language: TypeScript - Size: 15.6 MB - Last synced at: 1 day ago - Pushed at: 11 months ago - Stars: 1,279 - Forks: 90

withcatai/node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

Language: TypeScript - Size: 21.9 MB - Last synced at: 2 days ago - Pushed at: 14 days ago - Stars: 1,552 - Forks: 133

LnYo-Cly/ai4j

一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等等,提供统一的输入输出(对齐OpenAi)消除差异化,优化函数调用(Tool Call),优化RAG调用、支持向量数据库(Pinecone)、内置联网增强,并且支持JDK1.8,为用户提供快速整合AI的能力。

Language: Java - Size: 313 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 317 - Forks: 40

chatchat-space/Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language: TypeScript - Size: 138 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 35,353 - Forks: 5,923

benedekrozemberczki/awesome-community-detection

A curated list of community detection research papers with implementations.

Language: Python - Size: 2.21 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 2,382 - Forks: 362

groupultra/telegram-search

🔍 一个功能强大的 Telegram 聊天记录搜索工具,支持向量搜索和语义匹配。A powerful Telegram chat search tool with vector search and semantic matching capabilities.

Language: TypeScript - Size: 6.94 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,350 - Forks: 167

sebastian2005-RP/GPU-Accelerated-Next-Word-Prediction-Using-LSTM-and-PyTorch

This repository implements a GPU-accelerated next-word prediction model using PyTorch and LSTM. It includes data preprocessing with NLTK, vocabulary creation, training on tokenized text, and generating text predictions, starting from a given input phrase.

Language: Jupyter Notebook - Size: 329 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

titanium0202/Coffee_Shop_AI_Agents

About This project is an innovative coffee shop application designed to bring an engaging and personalized experience to coffee lovers. The app leverages AI-powered agents for chat-based interactions and integrates modern web and mobile development techniques to provide seamless ordering and delivery services.

Size: 31.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

MakerCorn/nbedr

Embedding Toolkit for RAG Applications

Language: Python - Size: 895 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

shell-nlp/gpt_server

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR和TTS的开源框架。

Language: Python - Size: 4.28 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 193 - Forks: 16

devflowinc/trieve

All-in-one platform for search, recommendations, RAG, and analytics offered via API

Language: TypeScript - Size: 172 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,301 - Forks: 201

techdomegh/ai-news-scraper

AI News Scraper & Semantic Search: A Python application that scrapes news articles, uses GenAI to generate summaries and identify topics, and provides semantic search capabilities through vector embeddings

Language: Python - Size: 258 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

tomcz/golang-webapp

Basic golang web application skeleton

Language: JavaScript - Size: 695 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 6 - Forks: 1

apple/embedding-atlas

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

Language: TypeScript - Size: 18.5 MB - Last synced at: 6 days ago - Pushed at: 14 days ago - Stars: 161 - Forks: 5

infiniflow/infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

Language: C++ - Size: 64.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,755 - Forks: 352

quarkiverse/quarkus-docling

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem

Language: Java - Size: 96.7 KB - Last synced at: 7 days ago - Pushed at: 12 days ago - Stars: 2 - Forks: 0

PaddlePaddle/ERNIE-SDK

ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.

Language: Jupyter Notebook - Size: 3.3 MB - Last synced at: 7 days ago - Pushed at: 10 months ago - Stars: 366 - Forks: 54

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Language: Python - Size: 110 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 12,637 - Forks: 3,046

acelee0621/MemeMind

FastAPI Local RAG Q&A Demo

Language: Python - Size: 839 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

DevExpress-Examples/wpf-dashboard-how-to-embed-devexpress-control

This example demonstrates how to embed a DevExpress Gantt Control in a dashboard loaded in WPF Viewer and bind the Gantt control to the dashboard's data.

Language: C# - Size: 210 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 3 - Forks: 3

run-llama/LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

Language: TypeScript - Size: 76.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,690 - Forks: 449

marcominerva/ChatGptNet

A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service

Language: C# - Size: 4.29 MB - Last synced at: about 8 hours ago - Pushed at: 8 months ago - Stars: 316 - Forks: 38

myreader-io/myGPTReader

A community-driven way to read and chat with AI bots - powered by chatGPT.

Language: Python - Size: 9.76 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 4,439 - Forks: 452

hyeonsangjeon/AWS-LLM-SageMaker

SageMaker Ployglot based RAG opensearch

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 16 - Forks: 2

PotatoHD404/QwenRag

A powerful RAG system for querying code repositories using tree-sitter parsing, LanceDB vector storage, and Qwen models

Language: Python - Size: 35.2 KB - Last synced at: 9 days ago - Pushed at: 10 days ago - Stars: 1 - Forks: 0

opensolon/solon-ai

Java AI(智能体) 全场景应用开发框架(LLM,Function Call,RAG,Embedding,Reranking,Flow,MCP Server,Mcp Client,Mcp Proxy)。同时兼容 java8 ~ java24。

Language: Java - Size: 10.2 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 77 - Forks: 13

bitwise74/vxinst

Blazing fast embedder for instagram posts

Language: Go - Size: 330 KB - Last synced at: 2 days ago - Pushed at: 3 months ago - Stars: 10 - Forks: 1

microsoft/rag-experiment-accelerator

The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.

Language: Python - Size: 4.36 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 257 - Forks: 90

Michael-JB/bm25

A BM25 embedder, scorer, and search engine, written in Rust.

Language: Rust - Size: 204 KB - Last synced at: 10 days ago - Pushed at: 11 days ago - Stars: 22 - Forks: 3

NicolasSiver/nodebb-plugin-ns-embed

Embed media and rich content in posts: YouTube, Vimeo, Twitch and more.

Language: JavaScript - Size: 1.07 MB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 30 - Forks: 18

sysread/fnord

fnord is an AI-powered conversational interface to your code, docs, and notes.

Language: Elixir - Size: 22.9 MB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 25 - Forks: 3

metehan777/embedding-analysis-with-gemini-claude

Run a content embedding analysis with Gemini's 3k dimensions model with Claude 3.7 Sonnet Thinking

Language: Python - Size: 13.7 KB - Last synced at: 12 days ago - Pushed at: 2 months ago - Stars: 17 - Forks: 1

kexi-bq/embedding-explainer

Interactive editor for text meaning via embedding vector control

Language: Python - Size: 238 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Language: Python - Size: 1.93 MB - Last synced at: 7 days ago - Pushed at: about 2 years ago - Stars: 207 - Forks: 30

microsoft/MSVBASE

MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrates high-dimensional vector indices into PostgreSQL, a relational database to facilitate complex approximate similarity queries.

Language: C++ - Size: 39.3 MB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 94 - Forks: 12

Ash-World/ash-space

Ash's Space

Size: 80.7 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

OpenBMB/UltraRAG

Build & Optimize your RAG.

Language: Python - Size: 20.3 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 688 - Forks: 51

pr0mila/MediRag-Guard

A RAG Proof of Concept that delivers comprehensive, context-aware insights on healthcare data privacy through a novel knowledge tree.

Language: Python - Size: 6.05 MB - Last synced at: 1 day ago - Pushed at: 26 days ago - Stars: 10 - Forks: 0

RealAlexandreAI/json-repair

🔧 Repair JSON!Solution for JSON Anomalies from LLMs.

Language: Go - Size: 236 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 257 - Forks: 11

Shakiba-Alipour/Data-Mining-Project

Data mining on university of twente website

Language: Python - Size: 48.8 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

Anishgoswamicode/wikipedia-semantic-clustering

Unsupervised semantic clustering of Wikipedia topics using Sentence-BERT embeddings, UMAP for visualization, and DBSCAN for topic discovery

Language: Jupyter Notebook - Size: 149 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 0 - Forks: 0

gusye1234/nano-vectordb

A simple, easy-to-hack Vector Database

Language: Python - Size: 24.4 KB - Last synced at: 18 days ago - Pushed at: 8 months ago - Stars: 144 - Forks: 6

tomMEM/RAG_with_LM-studio

RAG with LM studio, local LLMs, Scientific PDF text extraction,

Language: Jupyter Notebook - Size: 537 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

TilmanGriesel/chipper

✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)

Language: Python - Size: 84.5 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 450 - Forks: 42

wallehxz/goods_vectors

图片向量转化,通过图片查找相似的商品,商品后台向量管理,图片物品识别擦肩,Yolo 模型微调训练增强

Language: JavaScript - Size: 82.3 MB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 4 - Forks: 0

chataize/semantic-index

Lightweight, single-file vector database for experiments and small projects.

Language: C# - Size: 66.4 KB - Last synced at: 22 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

OguzhanUmutlu/html-node

This module lets you to run node codes in your html!

Language: JavaScript - Size: 7.81 KB - Last synced at: 14 days ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

chrislemke/autoembedder

PyTorch autoencoder with additional embeddings layer for categorical data 🚘

Language: Python - Size: 935 KB - Last synced at: 5 days ago - Pushed at: 9 days ago - Stars: 8 - Forks: 0

AleksNeStu/ai-news-scraper

AI News Scraper & Semantic Search: A Python application that scrapes news articles, uses GenAI to generate summaries and identify topics, and provides semantic search capabilities through vector embeddings

Language: Python - Size: 18.5 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

cmccomb/map-of-eng

Language: HTML - Size: 55.7 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

tiansztiansz/python-data-science

b站 AI日日新 不定期更新使用Python框架完成机器学习、深度学习、数据科学任务

Language: Jupyter Notebook - Size: 4.78 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 3 - Forks: 0

MrRobot-t/image-search-by-image

Image Search by Image, Image Search Engine Based on Graph Vector And Vector Database(以图搜图,基于图向量和向量数据库的图片搜索引擎)

Language: Python - Size: 11.3 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 11 - Forks: 3

awslabs/amazon-denseclus

Clustering for mixed-type data

Language: Jupyter Notebook - Size: 4.53 MB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 99 - Forks: 21

adambielski/siamese-triplet

Siamese and triplet networks with online pair/triplet mining in PyTorch

Language: Python - Size: 12.3 MB - Last synced at: 28 days ago - Pushed at: about 2 years ago - Stars: 3,141 - Forks: 634

zhezhaoa/ngram2vec

Four word embedding models implemented in Python. Supporting arbitrary context features

Language: Python - Size: 722 KB - Last synced at: 22 days ago - Pushed at: almost 6 years ago - Stars: 851 - Forks: 174

WoodFY/MSMCE

The GitHub repository for the paper MSMCE.

Language: Jupyter Notebook - Size: 1.76 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

marl/openl3

OpenL3: Open-source deep audio and image embeddings

Language: Jupyter Notebook - Size: 687 MB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 517 - Forks: 60

smx-smx/EzDotnet

Load a C# assembly from a native executable or a shared library

Language: CMake - Size: 120 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 13 - Forks: 4

EvilFreelancer/docker-llama.cpp-rpc

Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, необходимые для реализации распределённого инференса конвертированных в GGUF формат Больших Языковых Моделей (БЯМ) и Эмбеддинговых Моделей.

Language: Shell - Size: 294 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 17 - Forks: 3

Project-Unicron/Monocle

Monocle is a multi-modal embedding service designed for easy integration into modern applications. It provides HTTP API endpoints for generating text and image embeddings using state-of-the-art models. Monocle is ideal for semantic search, recommendation, and AI-powered content understanding.

Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise data and questionnaire items. Open source.

Language: Python - Size: 23.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 28 - Forks: 45

cair/pyTsetlinMachine

Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget

Language: C - Size: 611 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 138 - Forks: 29

zhaodiwa/findme

一款电脑上通过自然语言语义来搜索文件的工具,注重数据隐私和数据安全

Language: JavaScript - Size: 118 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

cblearn/cblearn

Comparison-based Machine Learning in Python

Language: Python - Size: 2.41 MB - Last synced at: 10 days ago - Pushed at: about 1 year ago - Stars: 19 - Forks: 9

geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

Language: Python - Size: 3.36 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 281 - Forks: 54

Dannyj1/mistral-java-client

Mistral-java-client is a client for the Mistral.ai API. It allows you to easily interact with the Mistral AI models. Currently supports all mistral chat completion, OCR and embedding models.

Language: Java - Size: 708 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

Charmve/StegaStamp-plus

Improved the original repo, 'Invisible Hyperlinks in Physical Photographs', embedded with longer string than the original

Language: Jupyter Notebook - Size: 373 KB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 0

Embedding/Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Language: Python - Size: 1.42 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 12,010 - Forks: 2,329

pavlin-policar/openTSNE

Extensible, parallel implementations of t-SNE

Language: Python - Size: 70 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 1,530 - Forks: 173

Ryota-Kawamura/LangChain-Chat-with-Your-Data

Start building practical applications that allow you to interact with data using LangChain and LLMs.

Language: Jupyter Notebook - Size: 71.8 MB - Last synced at: 5 days ago - Pushed at: almost 2 years ago - Stars: 44 - Forks: 43

greener-group/progres

Fast protein structure searching or your money back

Language: Python - Size: 217 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 103 - Forks: 4

deepomicslab/GCNFrame

This is a python package for genomics study with a GCN framework.

Language: Python - Size: 2.46 MB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 41 - Forks: 8

bunyaminergen/WavLMMSDD

This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 3

Divyansh6799/Chat-with-Multiple-Pdfs

This app is a Python application that allows you to chat with multiple PDF documents. You can ask questions about the PDFs using natural language, and the application will provide relevant responses based on the content of the documents. This app utilizes a language model to generate accurate answers to your queries.

Language: Python - Size: 1.56 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

DevsHero/db2vec

db2vec: High-performance Rust CLI to parse database dumps (.sql, .surql), generate vector embeddings via Ollama, TEI, Gemini, and load into vector databases (Pinecone, Redis, Chroma, Milvus, Qdrant, SurrealDB). Optimized for speed on large datasets.

Language: Rust - Size: 31.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 20 - Forks: 1

milvus-io/milvus-model

A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.

Language: Python - Size: 127 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 42 - Forks: 27

guangzhengli/vectorhub

Quickly and easily build AI website or application by using embeddings!

Language: TypeScript - Size: 3.42 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 379 - Forks: 44

fredsiika/huxley-pdf

Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on Streamlit

Language: Python - Size: 1.62 MB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 37 - Forks: 10

Echtzeitsysteme/iflye

Incremental Fast Lightweight (y) virtual network Embedding framework

Language: Java - Size: 48.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 3

yongzhuo/Macadam

Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。

Language: Python - Size: 975 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 325 - Forks: 39

svjack/Genshin-Impact-RAG

A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat

Language: Python - Size: 83 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

cvxgrp/pymde

Minimum-distortion embedding with PyTorch

Language: Python - Size: 46.8 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 548 - Forks: 26

centre-for-humanities-computing/embedding-explorer

Tools for interactive visual exploration of semantic embeddings.

Language: Python - Size: 3.01 MB - Last synced at: 30 days ago - Pushed at: 10 months ago - Stars: 33 - Forks: 4

UCL-CCS/Nbed

Python package to reduce the qubit requirements of quantum simulation by embedding into DFT.

Language: Python - Size: 134 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8 - Forks: 2

snap-stanford/KGReasoning

Multi-Hop Logical Reasoning in Knowledge Graphs

Language: Python - Size: 20.5 KB - Last synced at: 17 days ago - Pushed at: about 3 years ago - Stars: 296 - Forks: 59

VDA231-301/VDA231-300

description and embedding of material and surface requirements in the 3D data set

Size: 115 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

setanarut/fif

A command line tool for embedding any file into PNG, GIF, WAV file formats and decoding back.

Language: Python - Size: 609 KB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

benedekrozemberczki/TADW

An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).

Language: Python - Size: 1.45 MB - Last synced at: 1 day ago - Pushed at: almost 3 years ago - Stars: 61 - Forks: 14

kdjlyy/LangGraphChatBot

基于 LangGraph 实现的聊天机器人,接入 DeepSeek、Qwen、智谱 AI 多个 LLM 模型,支持在线搜索和文件解析。

Language: Python - Size: 681 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

luyug/GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Language: Python - Size: 43.9 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 387 - Forks: 24

arkenidar/luajit_embed

basic luajit embedded into C app ( maybe NDK too ) . see also Sol for Lua . MiniLua too . Bellard's TCC too .

Language: C - Size: 2.05 MB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

basic-1/basic-1c

BASIC language compiler for STM8 microcontrollers

Language: C++ - Size: 41.4 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3 - Forks: 1

MUHAMMADAKMAL137/GPU-Accelerated-Next-Word-Prediction-Using-LSTM-and-PyTorch

This repository implements a GPU-accelerated next-word prediction model using PyTorch and LSTM. It includes data preprocessing with NLTK, vocabulary creation, training on tokenized text, and generating text predictions, starting from a given input phrase.

Language: Jupyter Notebook - Size: 332 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0