An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: embedding-vectors

harehimself/pinecone-lab

Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, benchmark performance across different embedding models, and better understand what is possible with vector search in production environments.

Language: Python - Size: 301 KB - Last synced at: 35 minutes ago - Pushed at: about 2 hours ago - Stars: 1 - Forks: 0

yavuzsyl/Aspire.EShop.GenAI

Aspire, Distrubted Apps, GenAI, Ollama, Vector DB, Minimal APIs, YARP - Api Gateway , Keycloak, Azure Container Apps

Language: C# - Size: 749 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

rafay123321/embedding-hallucinations

This repo shows how foundational model hallucinates and how we can fix such hallucinations using fine-tuning them

Language: Python - Size: 476 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

lab-rasool/HoneyBee

🐝 | From Data to Prognosis: Embedding Multimodal Oncology Data for Precision Medicine

Language: Jupyter Notebook - Size: 525 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8 - Forks: 0

crate/langchain-cratedb

CrateDB provider for LangChain.

Language: Python - Size: 232 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

BBC-Esq/VectorDB-Plugin

Plugin that lets you ask questions about your documents including audio and video files.

Language: Python - Size: 34.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 340 - Forks: 44

anwitac246/test-generator-web

A test series generator for JEE-Mains using RAG and LLM

Language: JavaScript - Size: 1.62 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

rajadilipkolli/ai-playground

AI implementation using langchain4j and springAI frameworks with Java

Language: Java - Size: 1.08 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 14 - Forks: 4

aicubetechnology/aicube-embedding2embedding

AICUBE Embedding2Embedding - Unlock advanced embedding translation between distinct vector spaces with the AICUBE Embedding2Embedding. Seamlessly transform embeddings across various domains to enhance the flexibility and precision of your AI models, enabling smarter integrations.

Language: Python - Size: 95.7 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

yusufhilmi/client-vector-search

A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.

Language: TypeScript - Size: 314 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 210 - Forks: 14

towhee-io/towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language: Python - Size: 37.2 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 3,382 - Forks: 260

Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Language: Python - Size: 7.25 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1,017 - Forks: 61

Dicklesworthstone/fast_vector_similarity

The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.

Language: Rust - Size: 3.42 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 393 - Forks: 20

Dadmatech/DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co.

Language: Python - Size: 92.6 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 198 - Forks: 44

monirsayah/RAG-Model-with-LangChain

# RAG Model with LangChainA Retrieval-Augmented Generation (RAG) chatbot built with Streamlit that allows users to upload text documents and ask questions about their content. The application uses LangChain, ChromaDB, and Groq's language model to provide intelligent responses based on the uploaded documents. ## Features- **Document Upload**:

Language: Python - Size: 8.79 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

deatos/HyperVectorDB

Local Vector Database coded in c# supports Cosine Similarity, Jaccard Dissimilarity as well as Euclidean , Manhattan, ChebyShev and Canberra distances

Language: C# - Size: 67.4 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 4

patterns-ai-core/qdrant-ruby

Ruby wrapper for the Qdrant vector search database API

Language: Ruby - Size: 85 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 48 - Forks: 9

aws-samples/rss-aggregator-using-cohere-embeddings-bedrock

A sample rss aggregator application demonstrating the use of Cohere Embeddings

Language: TypeScript - Size: 3.41 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

patterns-ai-core/weaviate-ruby

Ruby wrapper for the Weaviate vector search database API

Language: Ruby - Size: 166 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 55 - Forks: 19

nitaiaharoni1/vector-storage

Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.

Language: TypeScript - Size: 175 KB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 226 - Forks: 38

7-4-7/SoilViTv1_annam

This is the repository for kaggle competion conducted by Annam Hackathnon. Contains set of 2 challenges.

Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Khalidparvaiz/pdf-ai-project

An intelligent PDF question-answering app using Retrieval-Augmented Generation (RAG), built with LangChain, Ollama (Gemma), and Chroma, via Streamlit

Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

geeks-of-data/knowledge-gpt

Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.

Language: Python - Size: 3.36 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 281 - Forks: 54

mellivora24/JobFIT 📦

An AI-powered web tool that evaluates the compatibility between a CV and a JD

Language: HTML - Size: 495 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

fredsiika/huxley-pdf

Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on Streamlit

Language: Python - Size: 1.62 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 37 - Forks: 10

Gurubase/gurubase

Gurubase lets you add an "Ask AI" button to your technical docs, turning your content into an AI assistant. It uses web pages, PDFs, YouTube videos, and GitHub repos as sources to generate instant, accurate answers with references. Deploy it via Slack, Discord, GitHub or a web widget.

Language: Shell - Size: 22.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 686 - Forks: 53

hi-tech-AI/help-scout-assistant-using-Pinecone-vector-database

Help Scout Assistant is a document processing and query-response system that leverages Pinecone for vector storage and retrieval. The tool allows you to load PDF documents into a vector store, where they can be queried using OpenAI's language models.

Language: Python - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hpi-swa-lab/Squeak-SemanticText

ChatGPT, embedding search, and retrieval-augmented generation for Squeak/Smalltalk

Language: Smalltalk - Size: 1.22 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 1

samadpls/BestRAG

BestRAG: A library for hybrid RAG, combining dense, sparse, and late interaction methods for efficient document storage and search.

Language: Python - Size: 36.1 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 0

acantarero/embedding_service

FastAPI service to generate text embeddings. Currently supports instructor models and has GPU support.

Language: Python - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

leogomesdev/moviesflix

This project enables semantic search of movies using natural language queries. It leverages the OpenAI Embeddings API to generate vector representations of movie descriptions and MongoDB Atlas Vector Search to perform efficient similarity searches based on user input.

Language: TypeScript - Size: 7.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

aws-samples/rag-using-langchain-amazon-bedrock-and-opensearch

RAG with langchain using Amazon Bedrock and Amazon OpenSearch

Language: Python - Size: 49.8 KB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 214 - Forks: 42

hkproj/retrieval-augmented-generation-notes

Slides for "Retrieval Augmented Generation" video

Language: Jupyter Notebook - Size: 4.58 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

hummusonrails/couchbase-azure-blog-vector-search-cli

CLI tool for scraping dynamic iframe-based blog content, generating vector embeddings with Azure OpenAI, and enabling semantic search with Couchbase.

Language: Python - Size: 1.01 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ritesh-modi/embedding-hallucinations

This repo shows how foundational model hallucinates and how we can fix such hallucinations using fine-tuning them

Language: Python - Size: 474 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

awa-ai/awadb

AI Native database for embedding vectors

Language: C++ - Size: 4.14 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 172 - Forks: 16

AmirLayegh/airbnb-semantic-search

A semantic search system for Airbnb listings in Stockholm, built with Superlinked and Qdrant. It leverages multi-attribute vector search and Retrieval-Augmented Generation (RAG) to enhance search accuracy, embedding different data types (e.g., price, description) with specialized models. Powered by FastAPI and Streamlit.

Language: Python - Size: 8.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

taherfattahi/recommendation-systems-by-llms

Enhancing Recommendation Systems with Large Language Models (RAG - LangChain - OpenAI)

Language: Jupyter Notebook - Size: 147 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 34 - Forks: 3

jager47X/VibeMap

This project visualizes high-dimensional tweet embeddings using t-SNE, 1-Nearest Neighbor clustering with 10 emotional levels, and interactive Plotly 3D scatter plots. It enables users to explore tweet data by username and time through dropdown filters and a time-range slider.

Language: Python - Size: 2.02 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rgdavies92/tensorflow-spam

✉️ 🐖 Spam email identification using NLP and a RNN with TensorFlow

Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 1

dcarpintero/llamaindexchat

LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.

Language: Python - Size: 12.6 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 6

DuyTran04/N47_HeThongGoiYSanPham

NGHIÊN CỨU VÀ XÂY DỰNG HỆ THỐNG GỢI Ý SẢN PHẨM SỬ DỤNG THUẬT TOÁN DEEP MATRIX FACTORIZATION

Language: Jupyter Notebook - Size: 80 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pentoai/vectory

Vectory provides a collection of tools to track and compare embedding versions.

Language: Python - Size: 1.92 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 71 - Forks: 0

ML-KULeuven/PaTSEmb

Transform time series to a pattern-based embedding

Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

patterns-ai-core/milvus

Ruby wrapper for the Milvus vector search database API

Language: Ruby - Size: 76.2 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 29 - Forks: 6

suncloudsmoon/HyperVectorDB-APIFixes Fork of deatos/HyperVectorDB

HyperVectorDB – simple. powerful. A local vector database built in C#, engineered for effortless precision. Explore your data using Cosine, Jaccard, Euclidean, Manhattan, Chebyshev, and Canberra distances.

Language: C# - Size: 180 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

geekmaxi/developer_assistant

Developer Assistant,开源的开发者技术问答助手!它基于先进大模型技术,解答编程难题,知识库覆盖Python、Java等主流开发语言,助您高效开发!

Language: TypeScript - Size: 979 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

IngestAI/embedditor

⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.

Language: PHP - Size: 1.74 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 224 - Forks: 15

tegridydev/Face-Based-Attention-Circuits

Face-Based Attention Circuits (FBAC): A Theoretical Framework for Context-Aware Embeddings

Size: 135 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

chriskillpack/henri

LLM powered image search

Language: Go - Size: 531 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rajvirtual/ChatDocs

A .NET-based AI project leveraging Retrieval-Augmented Generation (RAG) and OpenAI to provide efficient, intelligent search capabilities for team documentation.

Language: C# - Size: 51.8 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

pwyp/embeddings

An introduction to vector embeddings, the fundamental concept widely used in machine learning. The Jupyter Notebook was prepared as part of internal presentation for work mates.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

0xIbra/linux-tower-gpt-embeddings-experiment

This project is a work-in-progress and serves as an experiment for context injection with GPT and code embeddings. The goal is to use GPT to develop the remaining features of the project.

Language: Python - Size: 3.06 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

pngo1997/Retrieval-Augmented-Retrieval-RAG-for-Cleantech-Media

Implements a Retrieval-Augmented Generation (RAG) system.

Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

namuan/notes-mind

Private / Local secure setup to chat with Apple Notes

Language: Python - Size: 814 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

38832/Natural-Language-Processing

This repository features NLP and deep learning projects using LSTM, Bidirectional LSTM, Word2Vec, and TF-IDF, implemented with TensorFlow, Keras, and Scikit-Learn.

Language: Jupyter Notebook - Size: 225 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

attarmau/Multimodal-Misinformation-Detection

Multimodal deep learning model for fake news classification.

Size: 9.77 KB - Last synced at: 24 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Blacknahil/semantic_search

A semantic search system for Wikipedia articles using Weaviate and Cohere. It indexes articles with custom embeddings and provides a query interface to retrieve the most relevant matches. The system demonstrates the power of vector-based search for natural language queries.

Language: Jupyter Notebook - Size: 7.47 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

dcarpintero/athena

Scientific Research Assistant built with LLMs, Retrieval Augmented Generation, and Semantic Search.

Language: Python - Size: 3.71 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0

dcarpintero/wikisearch

Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.

Language: Python - Size: 625 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 1

shamspias/langchain-chat

langchain-chat is an AI-driven Q&A system that leverages OpenAI's GPT-4 model and FAISS for efficient document indexing. It loads and splits documents from websites or PDFs, remembers conversations, and provides accurate, context-aware answers based on the indexed data. Easy to set up and extend.

Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 86 - Forks: 17

Frank40790/SemanticSpotlight

A tool for identifying related text in large chunk of text

Language: Python - Size: 1.01 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

get-convex/dryad

Dryad talks to you tree! Easy semantic code search on any repository

Language: TypeScript - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 5

rsvinicius/spring-ai-demo

This project is a Spring Boot application that demonstrates an REST API using Ollama AI. It features embedding vectors, function calling, and streaming capabilities.

Language: Kotlin - Size: 47.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ekantchandrakar/vectordb

A simple vector database for RAG applications

Language: Java - Size: 108 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Gabriellgpc/computer-vision-dataset-maker

The Power of Florence-2 with OpenVINO & FiftyOne: Real-World Applications in Image Analysis

Language: Python - Size: 11.7 KB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

pranshurastogi29/Post-analysis-and-Suggestion-Engine

This project is based on text generation techniques used for predictive keyboard and post generation under constraints, also provides sentiment and upvotes prediction on a Reddit post title

Language: Jupyter Notebook - Size: 8.63 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

easonlai/chat_with_pdf_table

The contents of this repository showcase how to extract table data from a PDF file and preprocess it to facilitate word embedding. This preprocessing step enhances the readability of table data for language models and enables us to extract more contextual information from the tables.

Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

ikergarcia1996/MetaVec

A monolingual and cross-lingual meta-embedding generation and evaluation framework

Language: Python - Size: 69.3 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 80 - Forks: 5

Amir-Entezari/Text-Classification-Enhancements

Enhancing Text Classification in Information Retrieval: Evaluating the effectiveness of Naive Bayes classifiers with various word embeddings (Word2Vec, GloVe, FastText) for natural language processing tasks. This project explores performance differences and offers insights into embedding impacts on text classification.

Language: Jupyter Notebook - Size: 583 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nssharmaofficial/review-sentiment-classifier

Review classification in pytorch using LSTM

Language: Python - Size: 30.1 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

DerartuDagne/The-Complete-LangChain-LLMs-Guide Fork of PacktPublishing/The-Complete-LangChain-LLMs-Guide

This repository, forked from Packt Publishing, serves as a comprehensive guide to LangChain and LLMs, encompassing all the resources and knowledge gained from the on-demand course.

Language: Python - Size: 2.43 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

g-despot/social-network-analyzer

A simple program that can perform social network analysis tasks on graph data.

Language: Python - Size: 346 MB - Last synced at: 11 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

norandom/log2ml

Master Thesis: Development and Evaluation of Software for Forensic Log-Analysis Using Machine Learning and Genetic Programming

Language: Jupyter Notebook - Size: 3.39 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

kozistr/triton-grpc-proxy-rs

Proxy server for triton gRPC server that inferences embedding model in Rust

Language: Rust - Size: 108 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 16 - Forks: 2

shahules786/Twitter-Sentiment

Sentiment analyzer for your tweets.

Language: Python - Size: 3.41 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 63 - Forks: 11

France-Travail/embcompare

A simple python tool for embedding comparison

Language: Python - Size: 27.9 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

coder-backend/Stanford-Sentiment-Treebank

rate customer reviews

Language: Jupyter Notebook - Size: 2.72 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

coder-backend/Predict-Job-Title-and-skills

Job Prediction given job description and skills

Language: Jupyter Notebook - Size: 324 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

taherfattahi/embedding-optimizer

Two approaches to generating optimized embeddings in the Retrieval-Augmented Generation (RAG) Pattern

Language: Python - Size: 161 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

FrankyKyaw/DeepMelodyLSTM

An LSTM based music generation model trained on midi data. The model takes in a sequence of a certain length and learns to predict the next note.

Language: Jupyter Notebook - Size: 2.66 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

egermano/poc-rag-ollama

Playing with Generative AI

Language: JavaScript - Size: 31.3 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

nabeel-ncz/document-ai-query

A web application that extracts, processes, and intelligently interacts with PDF content. Using natural language processing and vector embeddings, it transforms PDF text into high-dimensional vectors for efficient and accurate querying.

Language: TypeScript - Size: 188 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

olasunkanmi-SE/IntelliSearch

IntelliSearch is an advanced retrieval-based question-answering and recommendation system that leverages embeddings and a large language model (LLM) to provide accurate and relevant information to users.

Language: TypeScript - Size: 1010 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

danielkonecny/generating-faces

Generating faces with GANs and analyzing embedding space distribution for different classes as a Bachelor's Thesis at BUT FIT.

Language: Jupyter Notebook - Size: 97.5 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

langfield/embedding-encoder

Autoencoder to compress distance matrices of pretrained embedding files.

Language: Python - Size: 2.7 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

sjy-dv/mind-x

Mind-X is my intelligent alter ego that understands me the best. It assists with and resolves my bothersome tasks, growing in real-time as a next-generation PersonAI system.

Language: Go - Size: 39.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ziozzang/embedding-server

Testing Embedding Server (Compatible OpenAI API). model from LLaMa/Mistral

Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

labrijisaad/LLM-RAG

A Streamlit app leveraging a RAG LLM with FAISS to offer answers from uploaded files.

Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

JadenGeller/similarity-topology

Efficient nearest neighbor search in Swift

Language: Swift - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

enockjamin01/autocode

NLP LSTM model to predict python codes (Text prediction) (Tokenized special characters)

Language: Jupyter Notebook - Size: 3.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

crclark/graph-anns

Efficient approximate nearest neighbor search data structure

Language: Rust - Size: 2.42 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

webobite/Fact-Chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

venkat-a/Text_Processing_RNN_LSTM

Text Processing RNN leverages RNN and LSTM models for advanced text processing. It features deep learning techniques for NLP tasks, utilizing GloVe for word embeddings, aimed at both educational and practical applications.

Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

UBOS-tech/node-red-contrib-chromadb

Chroma is the open-source embedding database

Language: HTML - Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

sdsc-innovation/itembed

Python library to train shallow embeddings on unordered sequences

Language: Python - Size: 21.2 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

mickymultani/Streaming-LLM-Chat

Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases streaming LLM responses in a user-friendly web interface.

Language: Python - Size: 2.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

struct-chat/embedding

Vector Embedding Server in under 100 lines of code

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 2

DrRuin/Personalized-Real-Estate-Agent

In an industry where personalization is key to customer satisfaction, your company wants to revolutionize how clients interact with real estate listings. The goal is to create a personalized experience for each buyer, making the property search process more engaging and tailored to individual preferences.

Language: Jupyter Notebook - Size: 386 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

StepanTita/cam-bert

We propose a novel method of fine-tuning the model for a particular downstream task, which proves to be more efficient and generalizable. We show that in an example of a fake news detection task, utilizing three distinct datasets and outperforming the baseline model in both the same dataset and cross-dataset zero-shot test.

Language: Jupyter Notebook - Size: 119 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Keywords
embedding-vectors 136 embeddings 24 llm 22 python 20 openai 19 rag 17 langchain 17 machine-learning 16 nlp 16 vector-database 16 vector-search 14 natural-language-processing 13 retrieval-augmented-generation 12 deep-learning 12 embedding-models 11 vector 10 semantic-search 10 large-language-models 9 chatgpt 9 ai 8 embedding 7 streamlit 7 llms 6 openai-api 6 pinecone 6 classification 6 chatbot 6 langchain-python 6 nextjs 5 generative-ai 5 nlp-machine-learning 5 weaviate 5 chromadb 5 approximate-nearest-neighbor-search 4 cosine-similarity 4 genai 4 ml 4 docker 4 lstm 4 word2vec 4 tensorflow 4 vectordb 4 recommender-system 4 sentence-embeddings 4 pytorch 3 embedding-similarity 3 transformers 3 text-classification 3 pgvector 3 gpt-4 3 faiss 3 cohere 3 qdrant 3 api-client 3 rnn-tensorflow 3 ruby 3 rubyml 3 typescript 3 prompt-engineering 3 gpt-3 3 vector-db 3 vector-similarity 3 neural-networks 3 embedding-python 3 huggingface 3 language-model 3 artificial-intelligence 3 lstm-neural-networks 3 keras 3 data-science 3 ollama 3 embeddings-similarity 3 llama2 3 nodejs 2 sigmoid-function 2 cohere-ai 2 bedrock 2 data-visualization 2 vector-search-engine 2 matplotlib-pyplot 2 bm25 2 dropout-keras 2 similarity-search 2 retrival-augmented-generation 2 adam-optimizer 2 python3 2 langchain-expression-language 2 gpt4all 2 pypi-package 2 aws 2 tokenizer 2 hnsw 2 huggingface-transformers 2 text-generation 2 database 2 rag-chatbot 2 reactjs 2 transfer-learning 2 kaggle-dataset 2 research-project 2