Topic: "embedding"
chatchat-space/Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Language: Python - Size: 138 MB - Last synced at: 28 days ago - Pushed at: about 2 months ago - Stars: 36,661 - Forks: 6,070
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
Language: Python - Size: 112 MB - Last synced at: about 4 hours ago - Pushed at: 8 days ago - Stars: 12,887 - Forks: 3,073
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
Language: Python - Size: 1.42 MB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 12,010 - Forks: 2,329
modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
Language: Python - Size: 72.4 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 11,565 - Forks: 1,041
zilliztech/claude-context
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
Language: TypeScript - Size: 7.4 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 4,816 - Forks: 440
apple/embedding-atlas
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Language: TypeScript - Size: 19.2 MB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 4,463 - Forks: 237
myreader-io/myGPTReader
A community-driven way to read and chat with AI bots - powered by chatGPT.
Language: Python - Size: 9.76 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 4,435 - Forks: 449
infiniflow/infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Language: C++ - Size: 74.9 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 4,276 - Forks: 406
groupultra/telegram-search
🔍 导出并模糊搜索 Telegram 聊天记录 | Export and fuzzy search your Telegram chat history
Language: TypeScript - Size: 12.1 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 3,613 - Forks: 233
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
Language: Python - Size: 12.3 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3,141 - Forks: 634
run-llama/LlamaIndexTS
Data framework for your LLM applications. Focus on server side solution
Language: TypeScript - Size: 82.7 MB - Last synced at: 26 days ago - Pushed at: 28 days ago - Stars: 2,960 - Forks: 498
devflowinc/trieve
All-in-one platform for search, recommendations, RAG, and analytics offered via API
Language: Rust - Size: 177 MB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 2,578 - Forks: 230
benedekrozemberczki/awesome-community-detection
A curated list of community detection research papers with implementations.
Language: Python - Size: 2.22 MB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 2,422 - Forks: 358
datawhalechina/all-in-rag
🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/
Language: Python - Size: 58.3 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 2,354 - Forks: 1,098
OpenBMB/UltraRAG
UltraRAG v2: A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Language: Python - Size: 44 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,342 - Forks: 201
withcatai/node-llama-cpp
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
Language: TypeScript - Size: 23 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 1,792 - Forks: 159
pavlin-policar/openTSNE
Extensible, parallel implementations of t-SNE
Language: Python - Size: 70 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 1,581 - Forks: 175
vercel/modelfusion
The TypeScript library for building AI applications.
Language: TypeScript - Size: 15.6 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 1,305 - Forks: 91
onestardao/WFGY
WFGY 2.0. Semantic Reasoning Engine for LLMs (MIT). Fixes RAG/OCR drift, collapse & “ghost matches” via symbolic overlays + logic patches. Autoboot; OneLine & Flagship. ⭐ Star if you explore semantic RAG or hallucination mitigation.
Language: Python - Size: 285 MB - Last synced at: 22 days ago - Pushed at: 2 months ago - Stars: 1,266 - Forks: 106
myscale/MyScaleDB
A @ClickHouse fork that supports high-performance vector search and full-text search.
Language: C++ - Size: 818 MB - Last synced at: 6 months ago - Pushed at: 11 months ago - Stars: 975 - Forks: 61
SkywalkerDarren/chatWeb
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Language: Python - Size: 98.6 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 910 - Forks: 137
zhezhaoa/ngram2vec
Four word embedding models implemented in Python. Supporting arbitrary context features
Language: Python - Size: 722 KB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 851 - Forks: 174
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 668 - Forks: 48
llm-tools/embedJs
A NodeJS RAG framework to easily work with LLMs and embeddings
Language: TypeScript - Size: 3.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 587 - Forks: 71
cvxgrp/pymde
Minimum-distortion embedding with PyTorch
Language: Python - Size: 46.8 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 562 - Forks: 27
OysterQAQ/ACG2vec
ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)
Size: 77.1 KB - Last synced at: 8 months ago - Pushed at: almost 2 years ago - Stars: 560 - Forks: 25
shawroad/NLP_pytorch_project
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
Language: Python - Size: 164 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 536 - Forks: 113
TIGER-AI-Lab/VLM2Vec
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Language: Python - Size: 12.2 MB - Last synced at: 3 days ago - Pushed at: 6 days ago - Stars: 520 - Forks: 48
marl/openl3
OpenL3: Open-source deep audio and image embeddings
Language: Jupyter Notebook - Size: 687 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 517 - Forks: 60
cvqluu/Angular-Penalty-Softmax-Losses-Pytorch
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
Language: Python - Size: 9.35 MB - Last synced at: 9 months ago - Pushed at: about 2 years ago - Stars: 487 - Forks: 93
xing61/xiaoyi-robot
优质稳定的OpenAI的API接口-For企业和开发者。OpenAI的api proxy,支持ChatGPT的API调用,支持openai的API接口,支持:gpt-4,gpt-3.5。不需要openai Key, 不需要买openai的账号,不需要美元的银行卡,通通不用的,直接调用就行,稳定好用!!智增增
Language: PHP - Size: 384 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 465 - Forks: 35
TilmanGriesel/chipper
✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)
Language: Python - Size: 84.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 450 - Forks: 42
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
Language: Python - Size: 43.9 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 409 - Forks: 25
Aquila-Network/aquila
An easy to use Neural Search Engine. Index latent vectors along with JSON metadata and do efficient k-NN search.
Language: HTML - Size: 1.5 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 380 - Forks: 25
PaddlePaddle/ERNIE-SDK
ERNIE Bot Agent is a Large Language Model (LLM) Agent Framework, powered by the advanced capabilities of ERNIE Bot and the platform resources of Baidu AI Studio.
Language: Jupyter Notebook - Size: 3.3 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 379 - Forks: 54
guangzhengli/vectorhub
Quickly and easily build AI website or application by using embeddings!
Language: TypeScript - Size: 3.42 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 379 - Forks: 44
redis/redis-vl-python
Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.
Language: Python - Size: 79.1 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 356 - Forks: 64
LnYo-Cly/ai4j
一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等等,提供统一的输入输出(对齐OpenAi)消除差异化,优化函数调用(Tool Call),优化RAG调用、支持向量数据库(Pinecone)、内置联网增强,并且支持JDK1.8,为用户提供快速整合AI的能力。
Language: Java - Size: 411 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 353 - Forks: 47
askaitools/askaitools-community-edition
A cutting-edge search engine project tailored specifically for the AI product
Language: TypeScript - Size: 742 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 347 - Forks: 29
yongzhuo/Macadam
Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRNN、RCNN、DCNN、CRNN、DeepMoji、SelfAttention、HAN、Capsule等文本分类算法; 支持CRF、Bi-LSTM-CRF、CNN-LSTM、DGCNN、Bi-LSTM-LAN、Lattice-LSTM-Batch、MRC等序列标注算法。
Language: Python - Size: 975 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 326 - Forks: 39
marcominerva/ChatGptNet
A ChatGPT integration library for .NET, supporting both OpenAI and Azure OpenAI Service
Language: C# - Size: 4.29 MB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 317 - Forks: 38
LucaOne/LucaOne
The resources of LucaOne, including: the model code, training scripts, embedding inference code, and trained checkpoints.
Language: Python - Size: 15.2 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 311 - Forks: 34
snap-stanford/KGReasoning
Multi-Hop Logical Reasoning in Knowledge Graphs
Language: Python - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 303 - Forks: 63
geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
Language: Python - Size: 3.36 MB - Last synced at: 7 days ago - Pushed at: over 2 years ago - Stars: 288 - Forks: 54
microsoft/rag-experiment-accelerator
The RAG Experiment Accelerator is a versatile tool designed to expedite and facilitate the process of conducting experiments and evaluations using Azure Cognitive Search and RAG pattern.
Language: Python - Size: 4.36 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 286 - Forks: 101
wzdavid/ThinkRAG
A LLM RAG system runs on your laptop. 大模型检索增强生成系统,可以轻松部署在笔记本电脑上,实现本地知识库智能问答。
Language: Python - Size: 7.98 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 275 - Forks: 40
RealAlexandreAI/json-repair
🔧 Repair JSON!Solution for JSON Anomalies from LLMs.
Language: Go - Size: 218 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 273 - Forks: 12
benedekrozemberczki/GEMSEC
The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).
Language: Python - Size: 10.4 MB - Last synced at: 5 months ago - Pushed at: about 3 years ago - Stars: 260 - Forks: 52
shell-nlp/gpt_server
gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。
Language: Python - Size: 5.28 MB - Last synced at: 4 days ago - Pushed at: 7 days ago - Stars: 241 - Forks: 21
amansrivastava17/embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Language: Python - Size: 1.93 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 210 - Forks: 32
shahsohil/DCC
This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper
Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 206 - Forks: 53
benedekrozemberczki/DANMF
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
Language: Python - Size: 2.31 MB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 205 - Forks: 41
sajari/word2vec
Go library for performing computations in word2vec binary models
Language: Go - Size: 58.6 KB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 200 - Forks: 35
opensolon/solon-ai
Java AI(智能体) 全场景应用开发框架(LLM,Function Call,RAG,Embedding,Reranking,Flow,MCP Server,Mcp Client,Mcp Proxy)。同时兼容 java8 ~ java25。也可嵌入到 SpringBoot2、jFinal、Vert.x 等框架中使用。。支持 MCP_2025_06_18(mcp streamable)
Language: Java - Size: 11.4 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 195 - Forks: 29
dilolabs/nosia
Self-hosted AI RAG + MCP Platform
Language: Ruby - Size: 673 KB - Last synced at: 3 days ago - Pushed at: 9 days ago - Stars: 188 - Forks: 17
benedekrozemberczki/GraphWaveMachine
A scalable implementation of "Learning Structural Node Embeddings Via Diffusion Wavelets (KDD 2018)".
Language: Python - Size: 831 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 187 - Forks: 34
benedekrozemberczki/MUSAE
The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)
Language: Python - Size: 19.4 MB - Last synced at: 9 months ago - Pushed at: over 3 years ago - Stars: 168 - Forks: 21
gusye1234/nano-vectordb
A simple, easy-to-hack Vector Database
Language: Python - Size: 24.4 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 166 - Forks: 6
malllabiisc/HyTE
EMNLP 2018: HyTE: Hyperplane-based Temporally aware Knowledge Graph Embedding
Language: Python - Size: 420 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 166 - Forks: 49
xiaoxiong74/Cool-NLPCV
Some Cool NLP and CV Repositories and Solutions (收集NLP中常见任务的开源解决方案、数据集、工具、学习资料等)
Size: 52.7 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 152 - Forks: 47
cair/pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
Language: C - Size: 611 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 145 - Forks: 31
cvqluu/Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Language: Python - Size: 278 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 140 - Forks: 34
benedekrozemberczki/diff2vec
Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Language: Python - Size: 3.9 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 126 - Forks: 20
kevinzakka/tsne-viz 📦
Python Wrapper for t-SNE Visualization
Language: Python - Size: 7.62 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 119 - Forks: 22
MAGICS-LAB/DNABERT_S
[ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models
Language: Python - Size: 2.72 MB - Last synced at: 17 days ago - Pushed at: 11 months ago - Stars: 118 - Forks: 29
greener-group/progres
Fast protein structure searching or your money back
Language: Python - Size: 217 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 114 - Forks: 5
naksyn/Embedder
Embedder is a collection of sources in different languages to embed Python interpreter with minimal dependencies
Language: C++ - Size: 25.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 106 - Forks: 13
benedekrozemberczki/walklets
A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).
Language: Python - Size: 2.99 MB - Last synced at: 9 months ago - Pushed at: almost 3 years ago - Stars: 105 - Forks: 22
microsoft/MSVBASE
MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrates high-dimensional vector indices into PostgreSQL, a relational database to facilitate complex approximate similarity queries.
Language: C++ - Size: 39.3 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 101 - Forks: 12
benedekrozemberczki/tigerlily
TigerLily: Finding drug interactions in silico with the Graph.
Language: Jupyter Notebook - Size: 14.3 MB - Last synced at: 28 days ago - Pushed at: about 3 years ago - Stars: 100 - Forks: 9
awslabs/amazon-denseclus
Clustering for mixed-type data
Language: Jupyter Notebook - Size: 4.53 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 99 - Forks: 21
firstbatchxyz/hollowdb-vector
A decentralized vector database for building vector search applications
Language: TypeScript - Size: 872 KB - Last synced at: 12 days ago - Pushed at: almost 2 years ago - Stars: 99 - Forks: 8
lijqhs/text-classification-cn
中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法
Language: Python - Size: 933 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 96 - Forks: 26
VQLite/VQLite
VQLite - Simple and Lightweight Vector Search Engine based on Google ScaNN
Language: Go - Size: 89.8 KB - Last synced at: 9 months ago - Pushed at: over 1 year ago - Stars: 90 - Forks: 6
benedekrozemberczki/BANE
A sparsity aware implementation of "Binarized Attributed Network Embedding" (ICDM 2018).
Language: Python - Size: 797 KB - Last synced at: 5 months ago - Pushed at: almost 3 years ago - Stars: 87 - Forks: 17
benedekrozemberczki/ASNE
A sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Language: Python - Size: 1.01 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 82 - Forks: 27
HITsz-TMG/KaLM-Embedding
Code for KaLM-Embedding models
Language: Python - Size: 319 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 81 - Forks: 6
julien040/hn-recommendation-api
A recommendation system for Hacker News. Get the most similar posts for a given URL
Language: TypeScript - Size: 144 KB - Last synced at: 8 months ago - Pushed at: over 2 years ago - Stars: 81 - Forks: 9
ikergarcia1996/MetaVec
A monolingual and cross-lingual meta-embedding generation and evaluation framework
Language: Python - Size: 69.3 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 80 - Forks: 6
maja42/ember
Embed arbitrary resources into a go executable at runtime, after the executable has been built.
Language: Go - Size: 45.9 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 79 - Forks: 7
tech1024/goai
A friendly API and abstractions for developing AI applications.
Language: Go - Size: 14.6 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 77 - Forks: 30
theSage21/lorentz-embeddings
Embed arbitrary graphs in Hyperbolic space
Language: Python - Size: 188 KB - Last synced at: 8 months ago - Pushed at: almost 4 years ago - Stars: 74 - Forks: 17
Lapis-Hong/TransE-Knowledge-Graph-Embedding
TensorFlow implementation of TransE and its extended models for Knowledge Representation Learning
Language: Python - Size: 8.77 MB - Last synced at: almost 3 years ago - Pushed at: over 7 years ago - Stars: 73 - Forks: 20
benedekrozemberczki/RolX
An alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)
Language: Python - Size: 5.06 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 69 - Forks: 20
LucaOne/LucaOneTasks
The project of the downstream tasks based on LucaOne's Embedding.
Language: Python - Size: 3.64 MB - Last synced at: 20 days ago - Pushed at: 23 days ago - Stars: 68 - Forks: 7
littleredxh/DREML
PyTorch implementation of Deep Randomized Ensembles for Metric Learning(ECCV2018)
Language: Python - Size: 80.1 KB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 67 - Forks: 14
benedekrozemberczki/GraRep
A SciPy implementation of "GraRep: Learning Graph Representations with Global Structural Information" (WWW 2015).
Language: Python - Size: 6.88 MB - Last synced at: 6 months ago - Pushed at: about 3 years ago - Stars: 65 - Forks: 25
thiswillbeyourgithub/AnnA_Anki_neuronal_Appendix
Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity
Language: Python - Size: 3.89 MB - Last synced at: 9 months ago - Pushed at: about 1 year ago - Stars: 64 - Forks: 1
snakers4/playing_with_vae
Comparing FC VAE / FCN VAE / PCA / UMAP on MNIST / FMNIST
Language: Jupyter Notebook - Size: 4.48 MB - Last synced at: 5 months ago - Pushed at: over 7 years ago - Stars: 64 - Forks: 13
LogicJake/tuling-video-click-top3
图灵联邦视频点击预测大赛线上第三-【ctr, embedding, 穿越特征】
Language: Jupyter Notebook - Size: 43.9 KB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 63 - Forks: 18
benedekrozemberczki/TADW
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Language: Python - Size: 1.45 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 61 - Forks: 14
CyberZHG/keras-pos-embd 📦
Position embedding layers in Keras
Language: Python - Size: 24.4 KB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 58 - Forks: 23
edenartlab/sd-lora-trainer
LoRa trainer for SDXL and SD15
Language: Python - Size: 6.42 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 57 - Forks: 12
milvus-io/milvus-model
A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.
Language: Python - Size: 127 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 57 - Forks: 30
lumpenspace/raft
RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly suited for the creation of agents that realistically emulate a specific human target.
Language: Python - Size: 4.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 57 - Forks: 5
nianticlabs/image-box-overlap
[ECCV 2020] Training neural networks to predict visual overlap of images, through interpretable non-metric box embeddings
Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 9 months ago - Pushed at: over 5 years ago - Stars: 56 - Forks: 6
LucaOne/LucaOneApp
LucaOne’s representational inference code. Use this project for embedding inference.
Language: Python - Size: 3.23 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 54 - Forks: 4
fatsnk/forksilly.doc
Documents repo of ForkSilly. ForkSilly:兼容sillytavern(酒馆)角色卡、世界书、正则、预设、聊天记录的安卓移动端应用;同时也可作为stable diffusion客户端使用。
Language: Kotlin - Size: 59.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 47 - Forks: 10
benedekrozemberczki/NMFADMM
A sparsity aware implementation of "Alternating Direction Method of Multipliers for Non-Negative Matrix Factorization with the Beta-Divergence" (ICASSP 2014).
Language: Python - Size: 7.13 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 47 - Forks: 11
zhangyafeikimi/word2vec-win32
A word2vec port for Windows.
Language: C - Size: 144 KB - Last synced at: almost 2 years ago - Pushed at: over 8 years ago - Stars: 47 - Forks: 42