An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mteb

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

Language: Python - Size: 43.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,810 - Forks: 462

ContextualAI/gritlm

Generative Representational Instruction Tuning

Language: Jupyter Notebook - Size: 13.3 MB - Last synced at: 5 days ago - Pushed at: 2 months ago - Stars: 668 - Forks: 48

jina-ai/mlx-retrieval

Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX

Language: Python - Size: 195 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 149 - Forks: 7

SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Language: Python - Size: 889 KB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 552 - Forks: 37

stanleylsx/text_embedding

一个用于训练句子embedding的工具,支持Cosent以及Simcse

Language: Python - Size: 13.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 18 - Forks: 1

worldbank/GISTEmbed

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings

Language: Python - Size: 1.3 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 3

su-park/mteb_ko_leaderboard

한글 텍스트 임베딩 모델 리더보드

Size: 2.51 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 17 - Forks: 1

louisbrulenaudet/tax-retrieval-benchmark

An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.

Language: Jupyter Notebook - Size: 85 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 1

rishisim/Chitti-Chatbot

Chitti is a retrieval-augmented-generation (RAG) application which utilizes a Mistral Large Language Model (LLM) for generation and a bge-m3 model developed by BAAI for retrieval. Chitti can help you answer questions about the IoT Summer Program, Projects in the program curriculum, Innovations of AIoT SMART Labs, Certification process and more!

Language: Python - Size: 50.8 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

devflowinc/openembeddings 📦

Self-hostable pay for what you use embedding server for bge-large-en and arbitrary embedding models using crypto

Language: JavaScript - Size: 282 KB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

llmrails/ember-v1

State-of-the-Art Ember embedding model for retrieval augmented generation

Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0