Topic: "embedding-models"
rbroc/simcat
A Python package to simulate multi-agent cognitive association tasks 🤖 🧠 👥
Language: Jupyter Notebook - Size: 58.3 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

leokwsw/local-rag
A local rag demo
Language: Python - Size: 18.6 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

zlataSt/OntologyEmbeddings
Train and learn JOIE, TransO and ReasonKGE embedding models on DBpedia and YAGO datasets
Language: Python - Size: 19.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Arenzell/openai-chatfriend
AI Chat using ChatGPT AI and embeddings
Language: Vue - Size: 283 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Shr3yash/EmbedroW
t-SNE, UMAP & PCA Projector Tool for custom data projection. Checkout the source link for the real time demo!
Language: HTML - Size: 13.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

copev313/Chatbot-Using-Deep-Learning
We build a chatbot by implementing machine learning and natural language processing.
Language: Jupyter Notebook - Size: 368 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Koziev/sent_embedders
Experiments with sentence embedding models
Language: Python - Size: 7.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

anisyusof-sc/kg-cs6216
A study to find a suitable temporal-based embedding model in detecting IoT malware through network analysis
Size: 37.8 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

sumitsidana/NERvE
This repository contains the code for "Representation Learning and Pairwise Ranking for Implicit Feedback in Recommendation Systems". Read the paper here: https://arxiv.org/abs/1705.00105
Language: Python - Size: 7.48 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

bhargav-joshi/Baby-Names-Predictor
Baby Names Prediction
Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

negiaditya/PROJECT-Tensorflow-Keras_models
Tensorflow/keras based various models in the field of deep learning.
Language: Jupyter Notebook - Size: 9.12 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

Ulrich777/EY-NEXT-WAVE-CHALLENGE-2019
Language: Python - Size: 9.17 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

raj-oo8/ai-samples
AI Orchestration Samples
Language: C# - Size: 99.6 KB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 0 - Forks: 0

RideneFiras/KagglexGoogle
Language: Jupyter Notebook - Size: 176 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

leducanh95/topic-modeling
Topic modeling and document clustering
Language: Python - Size: 19.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

GALA-MDS/Gala-External-Resources
This repository compiles and data sources created for the CHIST ERA 2025 proposal GALA.
Language: Jupyter Notebook - Size: 70.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

7446Nguyen/COFFEE_RAG
Get personalized coffee recommendations using Retrieval-Augmented Generation (RAG) to match your preferences with expert insights.
Language: Python - Size: 16.4 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

yuniko-software/power-embeddings
PowerEmbeddings is a C# library that makes embedding generation easier in .NET applications. It is aimed at simplifying the implementation of semantic search, full-text search, RAG, and hybrid search solutions within the .NET ecosystem
Size: 9.77 KB - Last synced at: 12 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

emapco/chem-mrl
Chem-MRL: SMILES Matryoshka Representation Learning Embedding Model
Language: Python - Size: 31.4 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

DeepLearn1998/My_RAG
My first RAG
Language: Python - Size: 5.86 KB - Last synced at: 15 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

Jiaxi-Huang/HackerLLM
Simple Work
Language: Vue - Size: 75.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sathyaseelancr/RAGImplementation
Retrieval Augmented Generation - Buying a car
Language: Python - Size: 18 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pranav-kural/ledaa-load-data
AWS Lambda function handling data ingestion in RAG pipeline of LEDAA project.
Language: Python - Size: 14.6 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

04bhavyaa/langchain-models
This project explores various LLMs and embedding models using LangChain, integrating OpenAI, Hugging Face, Google Gemini, and Anthropic. It includes chat models, document similarity search, and embeddings with cosine similarity for retrieval. The setup is simple, making it easy to experiment with LLMs and vector search. 🚀 (Big Thankyou to CampusX)
Language: Python - Size: 7.81 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Scicrop/javaSentenceBertEmbedding
Java ONNX Embedding & Retrieval-Augmented Generation (RAG) Engine
Language: Java - Size: 23.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

pngo1997/Retrieval-Augmented-Retrieval-RAG-for-Cleantech-Media
Implements a Retrieval-Augmented Generation (RAG) system.
Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AspadaX/dim
Use LLMs for effective and refined vectorizations.
Language: Rust - Size: 81.1 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

itmo-mbss-lab/sr_lectures_book
The project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.
Language: TeX - Size: 1.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vstep-chatbot/benchmark
Benchmark Vietnamese Embedding models and Tokenizers for RAG
Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

davide-abbattista/SciQA
The Scientific Question Answering (SciQA) System is an end-to-end solution designed to provide accurate, contextually relevant, and citation-supported answers to user queries.
Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

huacenxu/Embedding-Models-for-AI-Retrieval
This project develops a domain-specific embedding model to enhance document retrieval in AI-powered search systems. It incorporates techniques like synthetic data generation, model fine-tuning, and vector search using FAISS, evaluated with MRR@5 for performance.
Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

chatterjeesaurabh/Natural-Language-Processing
Text Preprocessing, Embedding Methods such as BoW, TF-IDF and Word2Vec, Text Classification using LSTM, Topic Modeling with LDA and BERTopic.
Language: Jupyter Notebook - Size: 222 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

pengkenlim/CxNE_plants
Generation Co-expression Network Embeddings (CxNEs) for plant genes using Graph Attention Networks (GAT))
Language: Python - Size: 178 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

stackmodel/babyagi-autonomous-agents
Demonstrates how to implement BabyAGI by Yohei Nakajima.
Language: Python - Size: 31.3 KB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

TajaKuzman/pandachat-rag-benchmark
PandaChat-RAG benchmark for evaluation of RAG systems on a non-synthetic Slovenian test dataset.
Language: Python - Size: 842 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

celiason/museum-news
webapp to find out historic details about the museum
Language: Python - Size: 6.38 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hase3b/SCPRAG
This repository implements a Retrieval-Augmented Generation (RAG) system for the Supreme Court of Pakistan, utilizing different LLMs, embedding models, and retrieval and generation enhancement strategies. It processes SCP judgments, applies chunking, and generates legal summaries and answers based on relevant case data.
Language: Jupyter Notebook - Size: 57.4 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ai-lluminator/ai-training
This repository contains all of the AI training and data generation scripts for the AIlluminator project.
Language: Python - Size: 10.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akthammomani/Casual_Conversation_Chatbot
Build a Multi-turn Conversations Chit-Chat Bot
Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akshay-kamath/generative-ai
Language: Jupyter Notebook - Size: 4.53 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

templateprotection/AimNet-Mouse-Dynamics
An open sourced approach to One-Shot Learning for Mouse Dynamics recognition in PyTorch. This includes tools for data preprocessing, training both classification and embedding models, and evaluating model performance on a Minecraft dataset.
Language: Python - Size: 168 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

hillaryke/contract-qa-high-precision-rag
A RAG system for Contract Q&A that enables chatting with a contract and asking questions about the contract. It has an interface build with React and FastAPI in backend integrating rag-pipeline with Autogen agents and websockets for communication. Evaluation of the RAG is done using RAGAS.
Language: Jupyter Notebook - Size: 804 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

MAMMAD1381/RAG
RAG system made using IR and LLMs
Language: Jupyter Notebook - Size: 15.1 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

hiteshkumar9211/Hindi-Text-summarization-major Fork of akshaykumar46/Hindi-Text-summarization-major
Hindi-Text-summarization-major project
Size: 11.7 KB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

tusharpandey003/Basics-of-machine-learning
Basics of machine learning is END-TO-END Repository which includes very Basic Machine Learning Models and Notebook
Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

hwb96/M3E-Embedder
M3E-Embedder 是一个基于 Docker 的服务,旨在方便地部署和运行 m3e embedding嵌入模型,支持多种嵌入模型快速集成和高效计算。
Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

afmika/yw2v
Yet another word2vec implementation from scratch
Language: Python - Size: 25.4 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nux-ai/vectors
Toolkit designed for developers to evaluate, select, and deploy embedding models. It streamlines the lifecycle from model evaluation to data embedding and querying.
Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

songye38/2024_embedding_study
한국어 임베딩 책을 바탕으로 임베딩 모델에 대한 공부
Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

piyush-jaiswal/PDFConverse
Chat with your PDF!
Language: Python - Size: 340 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

julep-ai/dialog-fact-encoder
Dialog-Fact Encoder embeds conversational dialog turns and factual statements
Language: Jupyter Notebook - Size: 226 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jasminebilir/cs224N-transformer-ensemble-network
Ensemble Network Including Transformer Models for NLP Patient Text and ED Visit Prediction
Size: 6.64 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ethan-grinberg/misinformation-cluster-analysis
Clustering diffusion networks of low credibility sources by topology through unsupervised graph embeddings
Language: Python - Size: 208 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

krishnamohanathota/GenerativeAI
Generative AI concepts and POCs
Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ishaanjav/prot2tex-protein-search
Semantic search tool for proteins based off natural-language functional description
Language: TypeScript - Size: 4.92 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

singkuangtan/BSautonet
BSnet Autoencoder
Language: Python - Size: 21.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

santhu932/Topic-Modeling-ETM
Attempted to replicate the research paper on the subject of topic modeling within embedded spaces.
Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nomnomnonono/Paper-Search
Application to search for similar papers by title and abstract, keywords.
Language: Python - Size: 96.7 KB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

r3sist-uniq/BookWanderer
download any (almost) book you want
Language: Python - Size: 31.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 2

Ren-Mingyang/net-community-number-est
Consistent Estimation of the Number of Communities via Regularized Network Embedding
Language: R - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

huacenxu/COVID-Morality
This project builds a novel liberty dictionary to quantify liberty morality—a concept missing from the extended Moral Foundations Dictionary (eMFD)—and leverages it to study the relationship between audience engagement and COVID-related news.
Language: Jupyter Notebook - Size: 8.57 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Nourelhouda-Yahi/Preprocessing-impact-on-Doc2vec
Morphosyntactic Preprocessing Impact on Document Embedding
Language: Python - Size: 1.33 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gautamvr/SkimLit
An NLP (Natural Language Processing) model that analyzes a research paper and categorizing it into objective, methods, results, etc, providing researchers to skim through the research paper easily with brief details.
Language: PureBasic - Size: 18.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

TinsaeW/Word2Vec-from-Scratch
Word2vec algorithm from scratch
Language: Jupyter Notebook - Size: 6.76 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

VarunB-17/Personality-Inferencing-ML
Personality Inferencing ML
Language: Jupyter Notebook - Size: 166 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

LiadzZ/Pairs-Relation-Machine-Learning-Neural-Network
5 Neural Networks architecture, 3 types of datasets, 3 pre-processing pipelines to use.
Language: Python - Size: 2.77 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

nathan-az/multi-trait-sgns
PyTorch implementation of skip-gram negative sampling for learning weighted item embeddings for items with side information.
Language: Python - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Mya-Mya/Novel2VecWeb
Word2Vec の小説バージョン
Language: Python - Size: 3.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

resuly/Traffic-Embedding
Codes for "Revealing the hidden features in traffic prediction via entity embedding"
Language: Jupyter Notebook - Size: 8.87 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rajrohan/ramayanaocr
A Visual Narrative of Ramayana using Extractive Summarisation, Topic Modeling and NER tagging
Language: Jupyter Notebook - Size: 27.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

hossamhasanin/movies_recommender
Find new movies based on your last one our a story of movie you liked
Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

OmarMohammed88/Categorize-Documents-
NLP TASKS using BERT , USE and other techniques
Language: Jupyter Notebook - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rakibhhridoy/Natural-Language-Processing-Steps
Preprocess data in nlp text classification and text sequence in TensorFlow. There's different steps in both classification and sequence task, thus it need different steps. These steps in TensorFlow is so much easy if you get into it.
Language: Jupyter Notebook - Size: 2.88 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

RodolfoLSS/multi-label-text-classification
Multi-label Text Classification with Scikit-learn and Tensorflow
Language: Jupyter Notebook - Size: 932 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

JudePark96/nlp-embedding-tutorial
Embedding Model Study
Language: Jupyter Notebook - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

sontung/hci-intermodal-reasoning
Fachpraktikum project for Human-computer interaction course
Language: Jupyter Notebook - Size: 6.12 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

ChiragSaini/Textual-Similarity
This notebook provides textual similarity between given two paragraphs. Google universal sentence encoder is used to create embeddings for these words.
Language: Jupyter Notebook - Size: 33.6 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

anantSinghCross/poems_categorisation_neural_networks
A neural networks project that categorizes different poems in the dataset according to their genre
Language: Python - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Craq/quora-kaggle
Quora kaggle competition research.
Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Vvkmnn/touristAI
🇫🇷 English to French Translation via Python 3 and Keras RNNs.
Language: HTML - Size: 5.39 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

jerrygaoLondon/AdaGram.jl Fork of sbos/AdaGram.jl
Adaptive Skip-gram implementation in Julia
Language: Julia - Size: 9.89 MB - Last synced at: 5 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

jerrygaoLondon/mutli-sense-embedding Fork of jiweil/mutli-sense-embedding
Language: Java - Size: 1.99 MB - Last synced at: 5 months ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0
