An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-embeddings"

michaelfeil/infinity

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Language: Python - Size: 12.1 MB - Last synced at: 4 days ago - Pushed at: 30 days ago - Stars: 2,153 - Forks: 145

linkedin/detext

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

Language: Python - Size: 10.7 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 1,264 - Forks: 135

nomic-ai/contrastors

Train Models Contrastively in Pytorch

Language: Python - Size: 3.75 MB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 706 - Forks: 56

ZachNagengast/similarity-search-kit

๐Ÿ”Ž SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.

Language: Swift - Size: 175 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 454 - Forks: 43

yusufhilmi/client-vector-search

A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.

Language: TypeScript - Size: 314 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 202 - Forks: 14

limcheekin/open-text-embeddings

Open Source Text Embedding Models with OpenAI Compatible API

Language: Python - Size: 224 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 151 - Forks: 20

milosgajdos/go-embeddings

Go module for fetching embeddings from embeddings providers

Language: Go - Size: 4.35 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 53 - Forks: 0

md-experiments/picture_text

Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)

Language: Python - Size: 39.2 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 30 - Forks: 9

Sid2697/Word-recognition-EmbedNet-CAB

Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"

Language: Python - Size: 172 MB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 21 - Forks: 5

lakeraai/canica

A text embedding viewer for the Jupyter environment

Language: TypeScript - Size: 1.68 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

deadbits/vector-embedding-api

Flask API for generating text embeddings using OpenAI or sentence_transformers

Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 1

amazon-science/text_generation_diffusion_llm_topic

Topic Embedding, Text Generation and Modeling using diffusion

Language: Python - Size: 154 KB - Last synced at: 14 days ago - Pushed at: 29 days ago - Stars: 12 - Forks: 3

easonlai/product_recommendations_with_gpt

I have improved the demo by using Azure OpenAIโ€™s Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.

Language: Jupyter Notebook - Size: 72.3 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 6

tlack/semantics

Semantic similarity via text embeddings in Elixir - powered by SentenceTransformers by SBert.net

Language: Elixir - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 1

zer0int/CLIP-gradient-ascent-embeddings

Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training

Language: Python - Size: 5.64 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

Navy10021/KRLawGPT

KRLawGPT : Generative Pre-trained Transformer for producing Korean Legal Text

Language: Python - Size: 111 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

lh0x00/docsifer

Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.

Language: Python - Size: 150 KB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

themaximalist/embedding.js ๐Ÿ“ฆ

Easy embeddings for LLMs like gpt-3.5-turbo and gpt-4 using text-embedding-ada-002

Language: JavaScript - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

BjornMelin/stardex

๐ŸŒŸ Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's starred repositories. Discover hidden patterns and find your next favorite project with intelligent, AI-powered exploration.

Language: TypeScript - Size: 549 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4 - Forks: 0

amirmasoudaz/chatgpt-history-search

A Python-based search engine for OpenAI's ChatGPT conversation history, enabling efficient semantic search and interactive engagement with archived chats using text embeddings

Language: Python - Size: 65.4 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4 - Forks: 1

lh0x00/lightweight-embeddings

LightweightEmbeddings is a fast, free, and unlimited API service for multilingual embeddings and reranking, with support for both text and images and guaranteed uptime.

Language: Python - Size: 107 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

ksm26/Understanding-and-Applying-Text-Embeddings

Dive into the world of text embeddings. This course will guide you through leveraging text embeddings to enhance various natural language processing (NLP) tasks.

Language: Jupyter Notebook - Size: 4.58 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 6

cjboy76/askpdf

Read PDF with AI.

Language: Vue - Size: 3.35 MB - Last synced at: about 21 hours ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

salgadev/dolly-expert-lite

A lightweight Dolly-v2 powered assistant that can answer domain-specific questions and keep a conversation. It's expert systems in the era of LLMs.

Language: Jupyter Notebook - Size: 46.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

vinaykanigicherla/amazon_reviews_sentiment

Sentiment Analysis on the Amazon Reviews Dataset using BERT-based transfer learning approach.

Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

joshle298/Debrief

A news insight synthesizer designed to cut the noise out of media consumption (using LLMs & text-embeddings)

Language: JavaScript - Size: 24.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

seonglae/tei

Text Embeddings Inference (TEI)'s unofficial python wrapper library for batch processing with asyncio

Language: Python - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

dice-group/GATES

Graph Attention Networks for Entity Summarization is the model that applies deep learning on graphs and ensemble learning on entity summarization tasks.

Language: Python - Size: 137 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Rana-Shankani/Customer-Support-Bot

A customer support chatbot using Retrieval Augmented Generation (RAG) to answer questions from documentation. Upload PDFs or text files, and let the system handle document processing, embedding generation, and semantic search. Built with LangChain, FAISS vector database, and HuggingFace models with a simple Flask web interface.

Language: Python - Size: 7.81 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

masaad01/website-categorizer

The Website Categorizer is a service that classifies websites by extracting metadata and content, generating embeddings, and matching them to predefined tags using cosine similarity.

Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

gurbaj5124871/rag-app-deepseek

A RAG (Retrieval-Augmented Generation) application which combines retrieval-based and generative approaches to improve the accuracy and relevance of AI-generated responses.

Language: Python - Size: 1.16 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Gupta-Aryaman/MediMate

XAI Medical Chatbot for Prescribing Medications and Treatment Plans

Language: Jupyter Notebook - Size: 7.04 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 3

mahadev0811/CollegeChatbot

This project is a Q&A chatbot designed for the Global Academy of Technology (GAT), utilizing LLMs, Embeddings, RAG techniques and Prompt Engineering to provide accurate and context-aware responses to user queries about the college.

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ksmin23/semantic-vector-search-with-sagemaker-pgvector

A search application using Aurora Postgresql and pgvector for an online retail store product catalog

Language: Jupyter Notebook - Size: 872 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

PappaPaj/qdrant-cookbook

A collection of scripts and utilities for working with Qdrant, OpenAI, and embeddings.

Language: Python - Size: 1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

turian/embeddingcache

Retrieve text embeddings, but cache them locally if we have already computed them.

Language: Python - Size: 36.1 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Owaiskhan9654/Clinical-Trial-Article-Search

Search using Attention based Sentence Transformers

Language: HTML - Size: 41.3 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Aditya1001001/similarity-and-embedding-app

Learn about text similarity measures & text embedding methods.

Language: Python - Size: 6.75 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

wooyakob/restaurant-vector-search

This web app demos vector search of restaurants in California stored in Couchbase Capella.

Language: Python - Size: 4.37 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

shreyakolluru/Youtube-Chatbot-using-Langchain

An AI YouTube Assistant that lets you talk to videos! Automatically fetch transcripts from YouTube, store them in a FAISS vector database, and use Cohere's powerful models to chat or summarize video content. Powered by LangChain, FAISS, and Cohere โ€” bringing conversational AI to video learning!

Language: Jupyter Notebook - Size: 198 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

aankitkumargupta/langchain_model

LangChain-based exploration of chat models, embeddings, and document similarity using OpenAI, Anthropic, Google Gemini, and Hugging Face models.

Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rimonim/embedplyr

Tools for Working With Text Embeddings in R

Language: R - Size: 69.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

CompNet/AlertEmbeddings

Abuse detection in online conversations with text and graph embeddings

Language: Python - Size: 28.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

damoncrockett/embeddingworld

Fully client side web app for visualizing text embeddings

Language: JavaScript - Size: 12.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

adityapathak-cubastion/cubastion-hr-chatbot

Presenting, Cubastion's HR chatbot - it can answer queries based on all the latest HR documents published by Cubastion's HR team. This conveniently saves time, allowing a Cubastion employee to resolve their query without having to comb through the actual documents. <<Developed with Python, sentence-transformers, Pinecone, llama3.2, and Streamlit>>

Language: Python - Size: 33.4 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

anmol52490/RAG

RAG-Powered Chatbot: An intelligent chatbot that uses RAG (Retrieval-Augmented Generation) to provide responses based on information retrieved from a document database. Integrates Groq for response generation, Chroma for document management, and HuggingFace for embeddings.

Language: Python - Size: 5.46 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

chaitanya-basava/Image-Search-Engine

end-to-end image search app

Language: TypeScript - Size: 14.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

pratheeshkumar99/Document-based-Question-Answering-System

This project demonstrates a Retrieval-Augmented Generation (RAG) system for question answering. It integrates OpenAIโ€™s GPT-4 model with FAISS for vector similarity search, enabling the system to provide accurate and contextually relevant answers based on a given document or dataset.

Language: Python - Size: 13.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

heymeowcat/VectorSearchShop

This app allows users to search for products by either entering text or uploading an image, and retrieves relevant products from a database

Language: Python - Size: 3.77 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ZikunFu/Embedding-Model-with-Instructions

Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

yoboBUETGenesis/vectordatabase

This repository deals with vector database preparation.

Language: Jupyter Notebook - Size: 3.72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LLeon360/aiprojects-nlp-quora-questions

Uses NLP & LSTM to detect insincere Quora Questions

Language: Python - Size: 2.86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

abibatoki/Large-Language-Models

Large language models offer new opportunities for processing and generating text. I used text embeddings, clustering, and the ChatGPT API to examine the reasons for startup failure.

Language: Python - Size: 3.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LazerLambda/THU-ML-RAG

Homework 3 for the machine learning class at Tsinghua University (fall term 23/24)

Language: Python - Size: 3.92 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aws-samples/text-embeddings-pipeline-for-rag

A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store

Language: TypeScript - Size: 215 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vilsonrodrigues/text-embeddings-server

A simple and scalable open-source solution to text embeddings โ˜„๏ธ๐Ÿ“„

Language: Python - Size: 1.03 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zeno129/DYANE

DYnamic Attributed Node rolEs (DYANE) is an attributed dynamic-network generative model based on temporal motifs and attributed node behavior.

Language: Python - Size: 2.97 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zarzouram/aics-project ๐Ÿ“ฆ

Language: TeX - Size: 14.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

theatina/Stress_Detection

M.Sc. mini project for NLP class (M908)

Language: Python - Size: 79.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Extremesarova/nlp

Investigation of NLP techniques based on Stepik NLP course and my developments.

Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

Related Topics
embeddings 12 nlp 10 rag 9 langchain 9 natural-language-processing 9 python 8 huggingface 7 sentence-transformers 7 openai 7 vector-database 7 transformers 6 text-embedding 6 image-embeddings 5 fastapi 5 large-language-models 5 machine-learning 5 retrieval-augmented-generation 4 text-generation 4 clustering 4 llm 4 vector-search 4 semantic-search 3 qdrant 3 word-embeddings 3 chatbot 3 sentiment-analysis 3 ai 3 deep-learning 3 nlp-machine-learning 3 streamlit 3 llms 2 nextjs 2 hierarchical-clustering 2 deep-neural-networks 2 aws-lambda 2 question-answering 2 clip 2 pytorch 2 gpt-4 2 kafka 2 classification 2 vector 2 reactjs 2 semantic-similarity 2 sentence-embeddings 2 chatgpt 2 embedding-vectors 2 embedding 2 vertex-ai 2 aws 2 bedrock 2 generative-ai 2 pgvector 2 python3 2 langchain-python 2 prompt-engineering 2 ollama 2 cosine-similarity 2 faiss 2 information-retrieval 2 web-scraping 2 bert-embeddings 2 vector-embeddings 2 llama-index 2 autogen 2 visualization 2 chroma-database 1 embedding-models 1 t5 1 langchain-framework 1 topic 1 topic-modeling 1 topic-models 1 azure 1 azure-openai 1 azure-openai-api 1 product-recommendation 1 recommender-system 1 word-embedding 1 ollama-api 1 website-categorization 1 chat-models 1 d3js 1 onnx 1 cache-storage 1 api-server 1 transformersjs 1 couchbase-capella 1 detext-framework 1 ranking 1 document-management 1 groq-api 1 huggingface-transformers 1 interactive-chatbot 1 similarity-measurement 1 golang 1 similarity-search 1 go 1 document-processing 1 cohere 1