An open API service providing repository metadata for many open source software ecosystems.

Topic: "embedding-models"

rbroc/simcat

A Python package to simulate multi-agent cognitive association tasks 🤖 🧠 👥

Language: Jupyter Notebook - Size: 58.3 MB - Last synced at: 24 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

leokwsw/local-rag

A local rag demo

Language: Python - Size: 18.6 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

zlataSt/OntologyEmbeddings

Train and learn JOIE, TransO and ReasonKGE embedding models on DBpedia and YAGO datasets

Language: Python - Size: 19.3 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Arenzell/openai-chatfriend

AI Chat using ChatGPT AI and embeddings

Language: Vue - Size: 283 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Shr3yash/EmbedroW

t-SNE, UMAP & PCA Projector Tool for custom data projection. Checkout the source link for the real time demo!

Language: HTML - Size: 13.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

copev313/Chatbot-Using-Deep-Learning

We build a chatbot by implementing machine learning and natural language processing.

Language: Jupyter Notebook - Size: 368 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Koziev/sent_embedders

Experiments with sentence embedding models

Language: Python - Size: 7.7 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

anisyusof-sc/kg-cs6216

A study to find a suitable temporal-based embedding model in detecting IoT malware through network analysis

Size: 37.8 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

sumitsidana/NERvE

This repository contains the code for "Representation Learning and Pairwise Ranking for Implicit Feedback in Recommendation Systems". Read the paper here: https://arxiv.org/abs/1705.00105

Language: Python - Size: 7.48 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

bhargav-joshi/Baby-Names-Predictor

Baby Names Prediction

Language: Jupyter Notebook - Size: 8.79 KB - Last synced at: 14 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

negiaditya/PROJECT-Tensorflow-Keras_models

Tensorflow/keras based various models in the field of deep learning.

Language: Jupyter Notebook - Size: 9.12 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

Ulrich777/EY-NEXT-WAVE-CHALLENGE-2019

Language: Python - Size: 9.17 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

raj-oo8/ai-samples

AI Orchestration Samples

Language: C# - Size: 99.6 KB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 0 - Forks: 0

RideneFiras/KagglexGoogle

Language: Jupyter Notebook - Size: 176 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

leducanh95/topic-modeling

Topic modeling and document clustering

Language: Python - Size: 19.5 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

GALA-MDS/Gala-External-Resources

This repository compiles and data sources created for the CHIST ERA 2025 proposal GALA.

Language: Jupyter Notebook - Size: 70.9 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

7446Nguyen/COFFEE_RAG

Get personalized coffee recommendations using Retrieval-Augmented Generation (RAG) to match your preferences with expert insights.

Language: Python - Size: 16.4 MB - Last synced at: 20 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

yuniko-software/power-embeddings

PowerEmbeddings is a C# library that makes embedding generation easier in .NET applications. It is aimed at simplifying the implementation of semantic search, full-text search, RAG, and hybrid search solutions within the .NET ecosystem

Size: 9.77 KB - Last synced at: 12 days ago - Pushed at: 22 days ago - Stars: 0 - Forks: 0

emapco/chem-mrl

Chem-MRL: SMILES Matryoshka Representation Learning Embedding Model

Language: Python - Size: 31.4 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

DeepLearn1998/My_RAG

My first RAG

Language: Python - Size: 5.86 KB - Last synced at: 15 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

Jiaxi-Huang/HackerLLM

Simple Work

Language: Vue - Size: 75.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sathyaseelancr/RAGImplementation

Retrieval Augmented Generation - Buying a car

Language: Python - Size: 18 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pranav-kural/ledaa-load-data

AWS Lambda function handling data ingestion in RAG pipeline of LEDAA project.

Language: Python - Size: 14.6 KB - Last synced at: 28 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

04bhavyaa/langchain-models

This project explores various LLMs and embedding models using LangChain, integrating OpenAI, Hugging Face, Google Gemini, and Anthropic. It includes chat models, document similarity search, and embeddings with cosine similarity for retrieval. The setup is simple, making it easy to experiment with LLMs and vector search. 🚀 (Big Thankyou to CampusX)

Language: Python - Size: 7.81 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Scicrop/javaSentenceBertEmbedding

Java ONNX Embedding & Retrieval-Augmented Generation (RAG) Engine

Language: Java - Size: 23.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

pngo1997/Retrieval-Augmented-Retrieval-RAG-for-Cleantech-Media

Implements a Retrieval-Augmented Generation (RAG) system.

Language: Jupyter Notebook - Size: 21.7 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

AspadaX/dim

Use LLMs for effective and refined vectorizations.

Language: Rust - Size: 81.1 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

itmo-mbss-lab/sr_lectures_book

The project is related to the development of Basics of Voice Biometrics lecture book for the ITMO Speaker Recognition Course.

Language: TeX - Size: 1.15 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

vstep-chatbot/benchmark

Benchmark Vietnamese Embedding models and Tokenizers for RAG

Language: Jupyter Notebook - Size: 23.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

davide-abbattista/SciQA

The Scientific Question Answering (SciQA) System is an end-to-end solution designed to provide accurate, contextually relevant, and citation-supported answers to user queries.

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

huacenxu/Embedding-Models-for-AI-Retrieval

This project develops a domain-specific embedding model to enhance document retrieval in AI-powered search systems. It incorporates techniques like synthetic data generation, model fine-tuning, and vector search using FAISS, evaluated with MRR@5 for performance.

Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

chatterjeesaurabh/Natural-Language-Processing

Text Preprocessing, Embedding Methods such as BoW, TF-IDF and Word2Vec, Text Classification using LSTM, Topic Modeling with LDA and BERTopic.

Language: Jupyter Notebook - Size: 222 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

pengkenlim/CxNE_plants

Generation Co-expression Network Embeddings (CxNEs) for plant genes using Graph Attention Networks (GAT))

Language: Python - Size: 178 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

stackmodel/babyagi-autonomous-agents

Demonstrates how to implement BabyAGI by Yohei Nakajima.

Language: Python - Size: 31.3 KB - Last synced at: 26 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

TajaKuzman/pandachat-rag-benchmark

PandaChat-RAG benchmark for evaluation of RAG systems on a non-synthetic Slovenian test dataset.

Language: Python - Size: 842 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

celiason/museum-news

webapp to find out historic details about the museum

Language: Python - Size: 6.38 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

hase3b/SCPRAG

This repository implements a Retrieval-Augmented Generation (RAG) system for the Supreme Court of Pakistan, utilizing different LLMs, embedding models, and retrieval and generation enhancement strategies. It processes SCP judgments, applies chunking, and generates legal summaries and answers based on relevant case data.

Language: Jupyter Notebook - Size: 57.4 MB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

ai-lluminator/ai-training

This repository contains all of the AI training and data generation scripts for the AIlluminator project.

Language: Python - Size: 10.3 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akthammomani/Casual_Conversation_Chatbot

Build a Multi-turn Conversations Chit-Chat Bot

Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

akshay-kamath/generative-ai

Language: Jupyter Notebook - Size: 4.53 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

templateprotection/AimNet-Mouse-Dynamics

An open sourced approach to One-Shot Learning for Mouse Dynamics recognition in PyTorch. This includes tools for data preprocessing, training both classification and embedding models, and evaluating model performance on a Minecraft dataset.

Language: Python - Size: 168 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

hillaryke/contract-qa-high-precision-rag

A RAG system for Contract Q&A that enables chatting with a contract and asking questions about the contract. It has an interface build with React and FastAPI in backend integrating rag-pipeline with Autogen agents and websockets for communication. Evaluation of the RAG is done using RAGAS.

Language: Jupyter Notebook - Size: 804 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

MAMMAD1381/RAG

RAG system made using IR and LLMs

Language: Jupyter Notebook - Size: 15.1 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

hiteshkumar9211/Hindi-Text-summarization-major Fork of akshaykumar46/Hindi-Text-summarization-major

Hindi-Text-summarization-major project

Size: 11.7 KB - Last synced at: 9 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

tusharpandey003/Basics-of-machine-learning

Basics of machine learning is END-TO-END Repository which includes very Basic Machine Learning Models and Notebook

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: about 1 month ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

hwb96/M3E-Embedder

M3E-Embedder 是一个基于 Docker 的服务,旨在方便地部署和运行 m3e embedding嵌入模型,支持多种嵌入模型快速集成和高效计算。

Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

afmika/yw2v

Yet another word2vec implementation from scratch

Language: Python - Size: 25.4 KB - Last synced at: 10 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

nux-ai/vectors

Toolkit designed for developers to evaluate, select, and deploy embedding models. It streamlines the lifecycle from model evaluation to data embedding and querying.

Language: Python - Size: 5.86 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

songye38/2024_embedding_study

한국어 임베딩 책을 바탕으로 임베딩 모델에 대한 공부

Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

piyush-jaiswal/PDFConverse

Chat with your PDF!

Language: Python - Size: 340 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

julep-ai/dialog-fact-encoder

Dialog-Fact Encoder embeds conversational dialog turns and factual statements

Language: Jupyter Notebook - Size: 226 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jasminebilir/cs224N-transformer-ensemble-network

Ensemble Network Including Transformer Models for NLP Patient Text and ED Visit Prediction

Size: 6.64 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ethan-grinberg/misinformation-cluster-analysis

Clustering diffusion networks of low credibility sources by topology through unsupervised graph embeddings

Language: Python - Size: 208 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

krishnamohanathota/GenerativeAI

Generative AI concepts and POCs

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ishaanjav/prot2tex-protein-search

Semantic search tool for proteins based off natural-language functional description

Language: TypeScript - Size: 4.92 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

singkuangtan/BSautonet

BSnet Autoencoder

Language: Python - Size: 21.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

santhu932/Topic-Modeling-ETM

Attempted to replicate the research paper on the subject of topic modeling within embedded spaces.

Language: Jupyter Notebook - Size: 2.51 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

nomnomnonono/Paper-Search

Application to search for similar papers by title and abstract, keywords.

Language: Python - Size: 96.7 KB - Last synced at: 22 days ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

r3sist-uniq/BookWanderer

download any (almost) book you want

Language: Python - Size: 31.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 2

Ren-Mingyang/net-community-number-est

Consistent Estimation of the Number of Communities via Regularized Network Embedding

Language: R - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

huacenxu/COVID-Morality

This project builds a novel liberty dictionary to quantify liberty morality—a concept missing from the extended Moral Foundations Dictionary (eMFD)—and leverages it to study the relationship between audience engagement and COVID-related news.

Language: Jupyter Notebook - Size: 8.57 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Nourelhouda-Yahi/Preprocessing-impact-on-Doc2vec

Morphosyntactic Preprocessing Impact on Document Embedding

Language: Python - Size: 1.33 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

gautamvr/SkimLit

An NLP (Natural Language Processing) model that analyzes a research paper and categorizing it into objective, methods, results, etc, providing researchers to skim through the research paper easily with brief details.

Language: PureBasic - Size: 18.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

TinsaeW/Word2Vec-from-Scratch

Word2vec algorithm from scratch

Language: Jupyter Notebook - Size: 6.76 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

VarunB-17/Personality-Inferencing-ML

Personality Inferencing ML

Language: Jupyter Notebook - Size: 166 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

LiadzZ/Pairs-Relation-Machine-Learning-Neural-Network

5 Neural Networks architecture, 3 types of datasets, 3 pre-processing pipelines to use.

Language: Python - Size: 2.77 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

nathan-az/multi-trait-sgns

PyTorch implementation of skip-gram negative sampling for learning weighted item embeddings for items with side information.

Language: Python - Size: 26.4 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Mya-Mya/Novel2VecWeb

Word2Vec の小説バージョン

Language: Python - Size: 3.1 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

resuly/Traffic-Embedding

Codes for "Revealing the hidden features in traffic prediction via entity embedding"

Language: Jupyter Notebook - Size: 8.87 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

rajrohan/ramayanaocr

A Visual Narrative of Ramayana using Extractive Summarisation, Topic Modeling and NER tagging

Language: Jupyter Notebook - Size: 27.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

hossamhasanin/movies_recommender

Find new movies based on your last one our a story of movie you liked

Language: Python - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

OmarMohammed88/Categorize-Documents-

NLP TASKS using BERT , USE and other techniques

Language: Jupyter Notebook - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

rakibhhridoy/Natural-Language-Processing-Steps

Preprocess data in nlp text classification and text sequence in TensorFlow. There's different steps in both classification and sequence task, thus it need different steps. These steps in TensorFlow is so much easy if you get into it.

Language: Jupyter Notebook - Size: 2.88 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

RodolfoLSS/multi-label-text-classification

Multi-label Text Classification with Scikit-learn and Tensorflow

Language: Jupyter Notebook - Size: 932 KB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

JudePark96/nlp-embedding-tutorial

Embedding Model Study

Language: Jupyter Notebook - Size: 43 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

sontung/hci-intermodal-reasoning

Fachpraktikum project for Human-computer interaction course

Language: Jupyter Notebook - Size: 6.12 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

ChiragSaini/Textual-Similarity

This notebook provides textual similarity between given two paragraphs. Google universal sentence encoder is used to create embeddings for these words.

Language: Jupyter Notebook - Size: 33.6 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

anantSinghCross/poems_categorisation_neural_networks

A neural networks project that categorizes different poems in the dataset according to their genre

Language: Python - Size: 595 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

Craq/quora-kaggle

Quora kaggle competition research.

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

Vvkmnn/touristAI

🇫🇷 English to French Translation via Python 3 and Keras RNNs.

Language: HTML - Size: 5.39 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

jerrygaoLondon/AdaGram.jl Fork of sbos/AdaGram.jl

Adaptive Skip-gram implementation in Julia

Language: Julia - Size: 9.89 MB - Last synced at: 5 months ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0

jerrygaoLondon/mutli-sense-embedding Fork of jiweil/mutli-sense-embedding

Language: Java - Size: 1.99 MB - Last synced at: 5 months ago - Pushed at: over 9 years ago - Stars: 0 - Forks: 0

Related Topics
machine-learning 31 embeddings 29 nlp 24 python 23 deep-learning 21 rag 20 llm 17 openai 16 langchain 16 retrieval-augmented-generation 14 vector-database 14 vector-search 13 word2vec 12 natural-language-processing 12 tensorflow 10 pytorch 9 embedding-vectors 9 neural-networks 9 sentence-embeddings 9 pinecone 9 faiss 9 fine-tuning 9 semantic-search 8 keras 8 huggingface 8 embedding 8 sentence-transformers 7 ai 7 word-embeddings 6 nlp-machine-learning 6 generative-ai 6 chatbot 6 recommender-system 6 deep-neural-networks 6 neural-network 6 retrieval 6 text-classification 5 knowledge-graph 5 bert 5 large-language-models 5 recommendation-system 5 openai-api 5 information-retrieval 5 rnn 5 lstm 5 llms 5 artificial-intelligence 4 natural-language 4 unsupervised-learning 4 wordembedding 4 glove-embeddings 4 knowledge-graph-embeddings 4 prompt-engineering 4 embedding-python 4 flask 4 gpt-4 4 function-calling 3 onnx 3 embeddings-word2vec 3 data-science 3 bert-model 3 text-mining 3 transformer 3 azure-openai 3 clustering 3 topic-modeling 3 classification 3 gru 3 computer-vision 3 evaluation 3 lstm-neural-networks 3 gpt-3 3 text-analysis 3 langchain-python 3 mteb 3 chromadb 3 preprocessing 3 semantic-similarity 3 knowledge-graph-completion 3 chroma 3 network-analysis 3 streamlit-webapp 3 awesome 3 vector 3 embedded 2 cnn 2 qdrant 2 vision-language-model 2 music-information-retrieval 2 awesome-list 2 cross-lingual 2 conversational-ai 2 pretrained-models 2 language-model 2 visualization 2 tensorflow-projector 2 autoencoder 2 fasttext-embeddings 2 fasttext 2 bag-of-words 2