Topic: "text-embeddings"
michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Language: Python - Size: 12.1 MB - Last synced at: 4 days ago - Pushed at: 30 days ago - Stars: 2,153 - Forks: 145

linkedin/detext
DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
Language: Python - Size: 10.7 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 1,264 - Forks: 135

nomic-ai/contrastors
Train Models Contrastively in Pytorch
Language: Python - Size: 3.75 MB - Last synced at: 2 days ago - Pushed at: about 2 months ago - Stars: 706 - Forks: 56

ZachNagengast/similarity-search-kit
๐ SimilaritySearchKit is a Swift package providing on-device text embeddings and semantic search functionality for iOS and macOS applications.
Language: Swift - Size: 175 MB - Last synced at: 8 days ago - Pushed at: 12 months ago - Stars: 454 - Forks: 43

yusufhilmi/client-vector-search
A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.
Language: TypeScript - Size: 314 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 202 - Forks: 14

limcheekin/open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
Language: Python - Size: 224 KB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 151 - Forks: 20

milosgajdos/go-embeddings
Go module for fetching embeddings from embeddings providers
Language: Go - Size: 4.35 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 53 - Forks: 0

md-experiments/picture_text
Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)
Language: Python - Size: 39.2 MB - Last synced at: 8 days ago - Pushed at: 5 months ago - Stars: 30 - Forks: 9

Sid2697/Word-recognition-EmbedNet-CAB
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
Language: Python - Size: 172 MB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 21 - Forks: 5

lakeraai/canica
A text embedding viewer for the Jupyter environment
Language: TypeScript - Size: 1.68 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

deadbits/vector-embedding-api
Flask API for generating text embeddings using OpenAI or sentence_transformers
Language: Python - Size: 36.1 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 1

amazon-science/text_generation_diffusion_llm_topic
Topic Embedding, Text Generation and Modeling using diffusion
Language: Python - Size: 154 KB - Last synced at: 14 days ago - Pushed at: 29 days ago - Stars: 12 - Forks: 3

easonlai/product_recommendations_with_gpt
I have improved the demo by using Azure OpenAIโs Embedding model (text-embedding-ada-002), which has a powerful word embedding capability. This model can also vectorize product key phrases and recommend products based on cosine similarity, but with better results. You can find the updated repo here.
Language: Jupyter Notebook - Size: 72.3 KB - Last synced at: 21 days ago - Pushed at: about 2 years ago - Stars: 9 - Forks: 6

tlack/semantics
Semantic similarity via text embeddings in Elixir - powered by SentenceTransformers by SBert.net
Language: Elixir - Size: 26.4 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 1

zer0int/CLIP-gradient-ascent-embeddings
Use CLIP to create matching texts + embeddings for given images; useful for XAI, adversarial training
Language: Python - Size: 5.64 MB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 6 - Forks: 0

Navy10021/KRLawGPT
KRLawGPT : Generative Pre-trained Transformer for producing Korean Legal Text
Language: Python - Size: 111 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 1

lh0x00/docsifer
Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.
Language: Python - Size: 150 KB - Last synced at: 24 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

themaximalist/embedding.js ๐ฆ
Easy embeddings for LLMs like gpt-3.5-turbo and gpt-4 using text-embedding-ada-002
Language: JavaScript - Size: 11.7 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

BjornMelin/stardex
๐ Stardex: Explore GitHub Stars Intelligently. Stardex is a powerful web app that lets you search, filter, and cluster any GitHub user's starred repositories. Discover hidden patterns and find your next favorite project with intelligent, AI-powered exploration.
Language: TypeScript - Size: 549 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4 - Forks: 0

amirmasoudaz/chatgpt-history-search
A Python-based search engine for OpenAI's ChatGPT conversation history, enabling efficient semantic search and interactive engagement with archived chats using text embeddings
Language: Python - Size: 65.4 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 4 - Forks: 1

lh0x00/lightweight-embeddings
LightweightEmbeddings is a fast, free, and unlimited API service for multilingual embeddings and reranking, with support for both text and images and guaranteed uptime.
Language: Python - Size: 107 KB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

ksm26/Understanding-and-Applying-Text-Embeddings
Dive into the world of text embeddings. This course will guide you through leveraging text embeddings to enhance various natural language processing (NLP) tasks.
Language: Jupyter Notebook - Size: 4.58 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 6

cjboy76/askpdf
Read PDF with AI.
Language: Vue - Size: 3.35 MB - Last synced at: about 21 hours ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

salgadev/dolly-expert-lite
A lightweight Dolly-v2 powered assistant that can answer domain-specific questions and keep a conversation. It's expert systems in the era of LLMs.
Language: Jupyter Notebook - Size: 46.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 3 - Forks: 2

vinaykanigicherla/amazon_reviews_sentiment
Sentiment Analysis on the Amazon Reviews Dataset using BERT-based transfer learning approach.
Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

joshle298/Debrief
A news insight synthesizer designed to cut the noise out of media consumption (using LLMs & text-embeddings)
Language: JavaScript - Size: 24.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

seonglae/tei
Text Embeddings Inference (TEI)'s unofficial python wrapper library for batch processing with asyncio
Language: Python - Size: 9.77 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

dice-group/GATES
Graph Attention Networks for Entity Summarization is the model that applies deep learning on graphs and ensemble learning on entity summarization tasks.
Language: Python - Size: 137 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 2 - Forks: 0

Rana-Shankani/Customer-Support-Bot
A customer support chatbot using Retrieval Augmented Generation (RAG) to answer questions from documentation. Upload PDFs or text files, and let the system handle document processing, embedding generation, and semantic search. Built with LangChain, FAISS vector database, and HuggingFace models with a simple Flask web interface.
Language: Python - Size: 7.81 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

masaad01/website-categorizer
The Website Categorizer is a service that classifies websites by extracting metadata and content, generating embeddings, and matching them to predefined tags using cosine similarity.
Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

gurbaj5124871/rag-app-deepseek
A RAG (Retrieval-Augmented Generation) application which combines retrieval-based and generative approaches to improve the accuracy and relevance of AI-generated responses.
Language: Python - Size: 1.16 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Gupta-Aryaman/MediMate
XAI Medical Chatbot for Prescribing Medications and Treatment Plans
Language: Jupyter Notebook - Size: 7.04 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 3

mahadev0811/CollegeChatbot
This project is a Q&A chatbot designed for the Global Academy of Technology (GAT), utilizing LLMs, Embeddings, RAG techniques and Prompt Engineering to provide accurate and context-aware responses to user queries about the college.
Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

ksmin23/semantic-vector-search-with-sagemaker-pgvector
A search application using Aurora Postgresql and pgvector for an online retail store product catalog
Language: Jupyter Notebook - Size: 872 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

PappaPaj/qdrant-cookbook
A collection of scripts and utilities for working with Qdrant, OpenAI, and embeddings.
Language: Python - Size: 1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

turian/embeddingcache
Retrieve text embeddings, but cache them locally if we have already computed them.
Language: Python - Size: 36.1 KB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Owaiskhan9654/Clinical-Trial-Article-Search
Search using Attention based Sentence Transformers
Language: HTML - Size: 41.3 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Aditya1001001/similarity-and-embedding-app
Learn about text similarity measures & text embedding methods.
Language: Python - Size: 6.75 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 1

wooyakob/restaurant-vector-search
This web app demos vector search of restaurants in California stored in Couchbase Capella.
Language: Python - Size: 4.37 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

shreyakolluru/Youtube-Chatbot-using-Langchain
An AI YouTube Assistant that lets you talk to videos! Automatically fetch transcripts from YouTube, store them in a FAISS vector database, and use Cohere's powerful models to chat or summarize video content. Powered by LangChain, FAISS, and Cohere โ bringing conversational AI to video learning!
Language: Jupyter Notebook - Size: 198 KB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

aankitkumargupta/langchain_model
LangChain-based exploration of chat models, embeddings, and document similarity using OpenAI, Anthropic, Google Gemini, and Hugging Face models.
Language: Python - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

rimonim/embedplyr
Tools for Working With Text Embeddings in R
Language: R - Size: 69.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

CompNet/AlertEmbeddings
Abuse detection in online conversations with text and graph embeddings
Language: Python - Size: 28.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

damoncrockett/embeddingworld
Fully client side web app for visualizing text embeddings
Language: JavaScript - Size: 12.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

adityapathak-cubastion/cubastion-hr-chatbot
Presenting, Cubastion's HR chatbot - it can answer queries based on all the latest HR documents published by Cubastion's HR team. This conveniently saves time, allowing a Cubastion employee to resolve their query without having to comb through the actual documents. <<Developed with Python, sentence-transformers, Pinecone, llama3.2, and Streamlit>>
Language: Python - Size: 33.4 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

anmol52490/RAG
RAG-Powered Chatbot: An intelligent chatbot that uses RAG (Retrieval-Augmented Generation) to provide responses based on information retrieved from a document database. Integrates Groq for response generation, Chroma for document management, and HuggingFace for embeddings.
Language: Python - Size: 5.46 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

chaitanya-basava/Image-Search-Engine
end-to-end image search app
Language: TypeScript - Size: 14.1 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

pratheeshkumar99/Document-based-Question-Answering-System
This project demonstrates a Retrieval-Augmented Generation (RAG) system for question answering. It integrates OpenAIโs GPT-4 model with FAISS for vector similarity search, enabling the system to provide accurate and contextually relevant answers based on a given document or dataset.
Language: Python - Size: 13.7 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

heymeowcat/VectorSearchShop
This app allows users to search for products by either entering text or uploading an image, and retrieves relevant products from a database
Language: Python - Size: 3.77 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ZikunFu/Embedding-Model-with-Instructions
Language: Jupyter Notebook - Size: 5.86 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

yoboBUETGenesis/vectordatabase
This repository deals with vector database preparation.
Language: Jupyter Notebook - Size: 3.72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

LLeon360/aiprojects-nlp-quora-questions
Uses NLP & LSTM to detect insincere Quora Questions
Language: Python - Size: 2.86 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

abibatoki/Large-Language-Models
Large language models offer new opportunities for processing and generating text. I used text embeddings, clustering, and the ChatGPT API to examine the reasons for startup failure.
Language: Python - Size: 3.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

LazerLambda/THU-ML-RAG
Homework 3 for the machine learning class at Tsinghua University (fall term 23/24)
Language: Python - Size: 3.92 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aws-samples/text-embeddings-pipeline-for-rag
A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store
Language: TypeScript - Size: 215 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

vilsonrodrigues/text-embeddings-server
A simple and scalable open-source solution to text embeddings โ๏ธ๐
Language: Python - Size: 1.03 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zeno129/DYANE
DYnamic Attributed Node rolEs (DYANE) is an attributed dynamic-network generative model based on temporal motifs and attributed node behavior.
Language: Python - Size: 2.97 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

zarzouram/aics-project ๐ฆ
Language: TeX - Size: 14.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

theatina/Stress_Detection
M.Sc. mini project for NLP class (M908)
Language: Python - Size: 79.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Extremesarova/nlp
Investigation of NLP techniques based on Stepik NLP course and my developments.
Language: Jupyter Notebook - Size: 15.2 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0
