GitHub topics: embedding-vectors
harehimself/pinecone-lab
Experimenting with Pinecone as vector data continues to take center stage in AI-native systems. The purpose of this project is to explore the core capabilities, benchmark performance across different embedding models, and better understand what is possible with vector search in production environments.
Language: Python - Size: 301 KB - Last synced at: 35 minutes ago - Pushed at: about 2 hours ago - Stars: 1 - Forks: 0

yavuzsyl/Aspire.EShop.GenAI
Aspire, Distrubted Apps, GenAI, Ollama, Vector DB, Minimal APIs, YARP - Api Gateway , Keycloak, Azure Container Apps
Language: C# - Size: 749 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

rafay123321/embedding-hallucinations
This repo shows how foundational model hallucinates and how we can fix such hallucinations using fine-tuning them
Language: Python - Size: 476 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

lab-rasool/HoneyBee
🐝 | From Data to Prognosis: Embedding Multimodal Oncology Data for Precision Medicine
Language: Jupyter Notebook - Size: 525 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8 - Forks: 0

crate/langchain-cratedb
CrateDB provider for LangChain.
Language: Python - Size: 232 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 2 - Forks: 0

BBC-Esq/VectorDB-Plugin
Plugin that lets you ask questions about your documents including audio and video files.
Language: Python - Size: 34.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 340 - Forks: 44

anwitac246/test-generator-web
A test series generator for JEE-Mains using RAG and LLM
Language: JavaScript - Size: 1.62 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

rajadilipkolli/ai-playground
AI implementation using langchain4j and springAI frameworks with Java
Language: Java - Size: 1.08 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 14 - Forks: 4

aicubetechnology/aicube-embedding2embedding
AICUBE Embedding2Embedding - Unlock advanced embedding translation between distinct vector spaces with the AICUBE Embedding2Embedding. Seamlessly transform embeddings across various domains to enhance the flexibility and precision of your AI models, enabling smarter integrations.
Language: Python - Size: 95.7 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

yusufhilmi/client-vector-search
A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAI's text-embedding-ada-002 and is way faster than Pinecone and other VectorDBs.
Language: TypeScript - Size: 314 KB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 210 - Forks: 14

towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Language: Python - Size: 37.2 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 3,382 - Forks: 260

Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
Language: Python - Size: 7.25 MB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 1,017 - Forks: 61

Dicklesworthstone/fast_vector_similarity
The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.
Language: Rust - Size: 3.42 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 393 - Forks: 20

Dadmatech/DadmaTools
DadmaTools is a Persian NLP tools developed by Dadmatech Co.
Language: Python - Size: 92.6 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 198 - Forks: 44

monirsayah/RAG-Model-with-LangChain
# RAG Model with LangChainA Retrieval-Augmented Generation (RAG) chatbot built with Streamlit that allows users to upload text documents and ask questions about their content. The application uses LangChain, ChromaDB, and Groq's language model to provide intelligent responses based on the uploaded documents. ## Features- **Document Upload**:
Language: Python - Size: 8.79 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

deatos/HyperVectorDB
Local Vector Database coded in c# supports Cosine Similarity, Jaccard Dissimilarity as well as Euclidean , Manhattan, ChebyShev and Canberra distances
Language: C# - Size: 67.4 KB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 4

patterns-ai-core/qdrant-ruby
Ruby wrapper for the Qdrant vector search database API
Language: Ruby - Size: 85 KB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 48 - Forks: 9

aws-samples/rss-aggregator-using-cohere-embeddings-bedrock
A sample rss aggregator application demonstrating the use of Cohere Embeddings
Language: TypeScript - Size: 3.41 MB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

patterns-ai-core/weaviate-ruby
Ruby wrapper for the Weaviate vector search database API
Language: Ruby - Size: 166 KB - Last synced at: 12 days ago - Pushed at: 7 months ago - Stars: 55 - Forks: 19

nitaiaharoni1/vector-storage
Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses OpenAI embeddings to convert documents into vectors and allows searching for similar documents based on cosine similarity.
Language: TypeScript - Size: 175 KB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 226 - Forks: 38

7-4-7/SoilViTv1_annam
This is the repository for kaggle competion conducted by Annam Hackathnon. Contains set of 2 challenges.
Language: Jupyter Notebook - Size: 17 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Khalidparvaiz/pdf-ai-project
An intelligent PDF question-answering app using Retrieval-Augmented Generation (RAG), built with LangChain, Ollama (Gemma), and Chroma, via Streamlit
Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

geeks-of-data/knowledge-gpt
Extract knowledge from all information sources using gpt and other language models. Index and make Q&A session with information sources.
Language: Python - Size: 3.36 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 281 - Forks: 54

mellivora24/JobFIT 📦
An AI-powered web tool that evaluates the compatibility between a CV and a JD
Language: HTML - Size: 495 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

fredsiika/huxley-pdf
Upload personal docs and Chat with your PDF files with this GPT4-powered app. Built with LangChain, Pinecone Vector Database, deployed on Streamlit
Language: Python - Size: 1.62 MB - Last synced at: 29 days ago - Pushed at: 7 months ago - Stars: 37 - Forks: 10

Gurubase/gurubase
Gurubase lets you add an "Ask AI" button to your technical docs, turning your content into an AI assistant. It uses web pages, PDFs, YouTube videos, and GitHub repos as sources to generate instant, accurate answers with references. Deploy it via Slack, Discord, GitHub or a web widget.
Language: Shell - Size: 22.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 686 - Forks: 53

hi-tech-AI/help-scout-assistant-using-Pinecone-vector-database
Help Scout Assistant is a document processing and query-response system that leverages Pinecone for vector storage and retrieval. The tool allows you to load PDF documents into a vector store, where they can be queried using OpenAI's language models.
Language: Python - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hpi-swa-lab/Squeak-SemanticText
ChatGPT, embedding search, and retrieval-augmented generation for Squeak/Smalltalk
Language: Smalltalk - Size: 1.22 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 12 - Forks: 1

samadpls/BestRAG
BestRAG: A library for hybrid RAG, combining dense, sparse, and late interaction methods for efficient document storage and search.
Language: Python - Size: 36.1 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 15 - Forks: 0

acantarero/embedding_service
FastAPI service to generate text embeddings. Currently supports instructor models and has GPU support.
Language: Python - Size: 37.1 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

leogomesdev/moviesflix
This project enables semantic search of movies using natural language queries. It leverages the OpenAI Embeddings API to generate vector representations of movie descriptions and MongoDB Atlas Vector Search to perform efficient similarity searches based on user input.
Language: TypeScript - Size: 7.2 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 4 - Forks: 0

aws-samples/rag-using-langchain-amazon-bedrock-and-opensearch
RAG with langchain using Amazon Bedrock and Amazon OpenSearch
Language: Python - Size: 49.8 KB - Last synced at: 25 days ago - Pushed at: 6 months ago - Stars: 214 - Forks: 42

hkproj/retrieval-augmented-generation-notes
Slides for "Retrieval Augmented Generation" video
Language: Jupyter Notebook - Size: 4.58 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

hummusonrails/couchbase-azure-blog-vector-search-cli
CLI tool for scraping dynamic iframe-based blog content, generating vector embeddings with Azure OpenAI, and enabling semantic search with Couchbase.
Language: Python - Size: 1.01 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

ritesh-modi/embedding-hallucinations
This repo shows how foundational model hallucinates and how we can fix such hallucinations using fine-tuning them
Language: Python - Size: 474 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

awa-ai/awadb
AI Native database for embedding vectors
Language: C++ - Size: 4.14 MB - Last synced at: 7 days ago - Pushed at: 8 months ago - Stars: 172 - Forks: 16

AmirLayegh/airbnb-semantic-search
A semantic search system for Airbnb listings in Stockholm, built with Superlinked and Qdrant. It leverages multi-attribute vector search and Retrieval-Augmented Generation (RAG) to enhance search accuracy, embedding different data types (e.g., price, description) with specialized models. Powered by FastAPI and Streamlit.
Language: Python - Size: 8.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 1

taherfattahi/recommendation-systems-by-llms
Enhancing Recommendation Systems with Large Language Models (RAG - LangChain - OpenAI)
Language: Jupyter Notebook - Size: 147 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 34 - Forks: 3

jager47X/VibeMap
This project visualizes high-dimensional tweet embeddings using t-SNE, 1-Nearest Neighbor clustering with 10 emotional levels, and interactive Plotly 3D scatter plots. It enables users to explore tweet data by username and time through dropdown filters and a time-range slider.
Language: Python - Size: 2.02 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rgdavies92/tensorflow-spam
✉️ 🐖 Spam email identification using NLP and a RNN with TensorFlow
Language: Jupyter Notebook - Size: 18.4 MB - Last synced at: 17 days ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 1

dcarpintero/llamaindexchat
LLM Chatbot w/ Retrieval Augmented Generation using Llamaindex. It demonstrates how to impl. chunking, indexing, and source citation.
Language: Python - Size: 12.6 MB - Last synced at: 20 days ago - Pushed at: over 1 year ago - Stars: 44 - Forks: 6

DuyTran04/N47_HeThongGoiYSanPham
NGHIÊN CỨU VÀ XÂY DỰNG HỆ THỐNG GỢI Ý SẢN PHẨM SỬ DỤNG THUẬT TOÁN DEEP MATRIX FACTORIZATION
Language: Jupyter Notebook - Size: 80 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

pentoai/vectory
Vectory provides a collection of tools to track and compare embedding versions.
Language: Python - Size: 1.92 MB - Last synced at: 14 days ago - Pushed at: over 2 years ago - Stars: 71 - Forks: 0

ML-KULeuven/PaTSEmb
Transform time series to a pattern-based embedding
Language: Jupyter Notebook - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

patterns-ai-core/milvus
Ruby wrapper for the Milvus vector search database API
Language: Ruby - Size: 76.2 KB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 29 - Forks: 6

suncloudsmoon/HyperVectorDB-APIFixes Fork of deatos/HyperVectorDB
HyperVectorDB – simple. powerful. A local vector database built in C#, engineered for effortless precision. Explore your data using Cosine, Jaccard, Euclidean, Manhattan, Chebyshev, and Canberra distances.
Language: C# - Size: 180 KB - Last synced at: 15 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

geekmaxi/developer_assistant
Developer Assistant,开源的开发者技术问答助手!它基于先进大模型技术,解答编程难题,知识库覆盖Python、Java等主流开发语言,助您高效开发!
Language: TypeScript - Size: 979 KB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 0

IngestAI/embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
Language: PHP - Size: 1.74 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 224 - Forks: 15

tegridydev/Face-Based-Attention-Circuits
Face-Based Attention Circuits (FBAC): A Theoretical Framework for Context-Aware Embeddings
Size: 135 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

chriskillpack/henri
LLM powered image search
Language: Go - Size: 531 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

rajvirtual/ChatDocs
A .NET-based AI project leveraging Retrieval-Augmented Generation (RAG) and OpenAI to provide efficient, intelligent search capabilities for team documentation.
Language: C# - Size: 51.8 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

pwyp/embeddings
An introduction to vector embeddings, the fundamental concept widely used in machine learning. The Jupyter Notebook was prepared as part of internal presentation for work mates.
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

0xIbra/linux-tower-gpt-embeddings-experiment
This project is a work-in-progress and serves as an experiment for context injection with GPT and code embeddings. The goal is to use GPT to develop the remaining features of the project.
Language: Python - Size: 3.06 MB - Last synced at: 4 months ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

pngo1997/Retrieval-Augmented-Retrieval-RAG-for-Cleantech-Media
Implements a Retrieval-Augmented Generation (RAG) system.
Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: 12 days ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

namuan/notes-mind
Private / Local secure setup to chat with Apple Notes
Language: Python - Size: 814 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

38832/Natural-Language-Processing
This repository features NLP and deep learning projects using LSTM, Bidirectional LSTM, Word2Vec, and TF-IDF, implemented with TensorFlow, Keras, and Scikit-Learn.
Language: Jupyter Notebook - Size: 225 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

attarmau/Multimodal-Misinformation-Detection
Multimodal deep learning model for fake news classification.
Size: 9.77 KB - Last synced at: 24 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Blacknahil/semantic_search
A semantic search system for Wikipedia articles using Weaviate and Cohere. It indexes articles with custom embeddings and provides a query interface to retrieve the most relevant matches. The system demonstrates the power of vector-based search for natural language queries.
Language: Jupyter Notebook - Size: 7.47 MB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

dcarpintero/athena
Scientific Research Assistant built with LLMs, Retrieval Augmented Generation, and Semantic Search.
Language: Python - Size: 3.71 MB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0

dcarpintero/wikisearch
Multilingual Semantic Search with Reranking on a prepared large vectorized dataset comprising 10 million Wikipedia documents. It supports dense retrieval, keyword search, and hybrid search.
Language: Python - Size: 625 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 13 - Forks: 1

shamspias/langchain-chat
langchain-chat is an AI-driven Q&A system that leverages OpenAI's GPT-4 model and FAISS for efficient document indexing. It loads and splits documents from websites or PDFs, remembers conversations, and provides accurate, context-aware answers based on the indexed data. Easy to set up and extend.
Language: Python - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 86 - Forks: 17

Frank40790/SemanticSpotlight
A tool for identifying related text in large chunk of text
Language: Python - Size: 1.01 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

get-convex/dryad
Dryad talks to you tree! Easy semantic code search on any repository
Language: TypeScript - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 29 - Forks: 5

rsvinicius/spring-ai-demo
This project is a Spring Boot application that demonstrates an REST API using Ollama AI. It features embedding vectors, function calling, and streaming capabilities.
Language: Kotlin - Size: 47.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ekantchandrakar/vectordb
A simple vector database for RAG applications
Language: Java - Size: 108 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Gabriellgpc/computer-vision-dataset-maker
The Power of Florence-2 with OpenVINO & FiftyOne: Real-World Applications in Image Analysis
Language: Python - Size: 11.7 KB - Last synced at: 5 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

pranshurastogi29/Post-analysis-and-Suggestion-Engine
This project is based on text generation techniques used for predictive keyboard and post generation under constraints, also provides sentiment and upvotes prediction on a Reddit post title
Language: Jupyter Notebook - Size: 8.63 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

easonlai/chat_with_pdf_table
The contents of this repository showcase how to extract table data from a PDF file and preprocess it to facilitate word embedding. This preprocessing step enhances the readability of table data for language models and enables us to extract more contextual information from the tables.
Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 4

ikergarcia1996/MetaVec
A monolingual and cross-lingual meta-embedding generation and evaluation framework
Language: Python - Size: 69.3 KB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 80 - Forks: 5

Amir-Entezari/Text-Classification-Enhancements
Enhancing Text Classification in Information Retrieval: Evaluating the effectiveness of Naive Bayes classifiers with various word embeddings (Word2Vec, GloVe, FastText) for natural language processing tasks. This project explores performance differences and offers insights into embedding impacts on text classification.
Language: Jupyter Notebook - Size: 583 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nssharmaofficial/review-sentiment-classifier
Review classification in pytorch using LSTM
Language: Python - Size: 30.1 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 3 - Forks: 0

DerartuDagne/The-Complete-LangChain-LLMs-Guide Fork of PacktPublishing/The-Complete-LangChain-LLMs-Guide
This repository, forked from Packt Publishing, serves as a comprehensive guide to LangChain and LLMs, encompassing all the resources and knowledge gained from the on-demand course.
Language: Python - Size: 2.43 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

g-despot/social-network-analyzer
A simple program that can perform social network analysis tasks on graph data.
Language: Python - Size: 346 MB - Last synced at: 11 months ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

norandom/log2ml
Master Thesis: Development and Evaluation of Software for Forensic Log-Analysis Using Machine Learning and Genetic Programming
Language: Jupyter Notebook - Size: 3.39 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

kozistr/triton-grpc-proxy-rs
Proxy server for triton gRPC server that inferences embedding model in Rust
Language: Rust - Size: 108 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 16 - Forks: 2

shahules786/Twitter-Sentiment
Sentiment analyzer for your tweets.
Language: Python - Size: 3.41 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 63 - Forks: 11

France-Travail/embcompare
A simple python tool for embedding comparison
Language: Python - Size: 27.9 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

coder-backend/Stanford-Sentiment-Treebank
rate customer reviews
Language: Jupyter Notebook - Size: 2.72 MB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

coder-backend/Predict-Job-Title-and-skills
Job Prediction given job description and skills
Language: Jupyter Notebook - Size: 324 KB - Last synced at: 6 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

taherfattahi/embedding-optimizer
Two approaches to generating optimized embeddings in the Retrieval-Augmented Generation (RAG) Pattern
Language: Python - Size: 161 KB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 0

FrankyKyaw/DeepMelodyLSTM
An LSTM based music generation model trained on midi data. The model takes in a sequence of a certain length and learns to predict the next note.
Language: Jupyter Notebook - Size: 2.66 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

egermano/poc-rag-ollama
Playing with Generative AI
Language: JavaScript - Size: 31.3 KB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

nabeel-ncz/document-ai-query
A web application that extracts, processes, and intelligently interacts with PDF content. Using natural language processing and vector embeddings, it transforms PDF text into high-dimensional vectors for efficient and accurate querying.
Language: TypeScript - Size: 188 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

olasunkanmi-SE/IntelliSearch
IntelliSearch is an advanced retrieval-based question-answering and recommendation system that leverages embeddings and a large language model (LLM) to provide accurate and relevant information to users.
Language: TypeScript - Size: 1010 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

danielkonecny/generating-faces
Generating faces with GANs and analyzing embedding space distribution for different classes as a Bachelor's Thesis at BUT FIT.
Language: Jupyter Notebook - Size: 97.5 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

langfield/embedding-encoder
Autoencoder to compress distance matrices of pretrained embedding files.
Language: Python - Size: 2.7 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

sjy-dv/mind-x
Mind-X is my intelligent alter ego that understands me the best. It assists with and resolves my bothersome tasks, growing in real-time as a next-generation PersonAI system.
Language: Go - Size: 39.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

ziozzang/embedding-server
Testing Embedding Server (Compatible OpenAI API). model from LLaMa/Mistral
Language: Python - Size: 6.84 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

labrijisaad/LLM-RAG
A Streamlit app leveraging a RAG LLM with FAISS to offer answers from uploaded files.
Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

JadenGeller/similarity-topology
Efficient nearest neighbor search in Swift
Language: Swift - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

enockjamin01/autocode
NLP LSTM model to predict python codes (Text prediction) (Tokenized special characters)
Language: Jupyter Notebook - Size: 3.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

crclark/graph-anns
Efficient approximate nearest neighbor search data structure
Language: Rust - Size: 2.42 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

webobite/Fact-Chatbot
A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.
Language: Jupyter Notebook - Size: 85.9 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

venkat-a/Text_Processing_RNN_LSTM
Text Processing RNN leverages RNN and LSTM models for advanced text processing. It features deep learning techniques for NLP tasks, utilizing GloVe for word embeddings, aimed at both educational and practical applications.
Language: Jupyter Notebook - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

UBOS-tech/node-red-contrib-chromadb
Chroma is the open-source embedding database
Language: HTML - Size: 27.3 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

sdsc-innovation/itembed
Python library to train shallow embeddings on unordered sequences
Language: Python - Size: 21.2 MB - Last synced at: 13 days ago - Pushed at: 11 months ago - Stars: 3 - Forks: 0

mickymultani/Streaming-LLM-Chat
Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases streaming LLM responses in a user-friendly web interface.
Language: Python - Size: 2.23 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 2

struct-chat/embedding
Vector Embedding Server in under 100 lines of code
Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 2

DrRuin/Personalized-Real-Estate-Agent
In an industry where personalization is key to customer satisfaction, your company wants to revolutionize how clients interact with real estate listings. The goal is to create a personalized experience for each buyer, making the property search process more engaging and tailored to individual preferences.
Language: Jupyter Notebook - Size: 386 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

StepanTita/cam-bert
We propose a novel method of fine-tuning the model for a particular downstream task, which proves to be more efficient and generalizable. We show that in an example of a fake news detection task, utilizing three distinct datasets and outperforming the baseline model in both the same dataset and cross-dataset zero-shot test.
Language: Jupyter Notebook - Size: 119 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
