Topic: "document-embedding"
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
Language: Python - Size: 83.4 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 3,028 - Forks: 375

dissorial/doc-chatbot
Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.
Language: TypeScript - Size: 2.54 MB - Last synced at: about 5 hours ago - Pushed at: almost 2 years ago - Stars: 852 - Forks: 146

BobXWu/FASTopic
A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)
Language: Python - Size: 1.67 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 94 - Forks: 6

ddangelov/RESTful-Top2Vec
Expose a Top2Vec model with a REST API.
Language: Python - Size: 243 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 20

EQTPartners/pause
🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴
Language: Python - Size: 83 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 25 - Forks: 1

samhavens/flair-as-service
Container-first, JSON-configurable, NLP REST service based on Flair
Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 0

chen0040/java-text-embedding
Word embedding in Java
Language: Java - Size: 55.7 KB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

marcomoldovan/hierarchical-language-modeling
We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.
Language: Jupyter Notebook - Size: 6.83 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

maxoodf/tgnews
Telegram Data Clustering Contest (Bossy Gnu's submission )
Language: C++ - Size: 41 KB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

ehtisham-sadiq/Exploring-Word2Vec-and-Doc2Vec
Dive into the world of Word2Vec and Doc2Vec models to uncover insights and applications.
Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

mathiasbruun/politician2vec
Utilities for learning, manipulating, and visualising politician embeddings in semantic space and inferring party positions.
Language: Python - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

eriknovak/python-text-embedding-microservice
Service for producing text representations via word embeddings
Language: Python - Size: 248 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

Tobsky/DocuQuery
This Streamlit application demonstrates the integration of ChatGroq (Llama3 model), OpenAIEmbeddings, and FAISS for document embedding and retrieval.
Language: Python - Size: 1.58 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

jdenes/TopicEmbeddings
An open-source framework to create and test document embeddings using topic models.
Language: Python - Size: 208 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

cnuahs/semantic-history-search
A Chrome extension to provide semantic search over your browsing history.
Language: TypeScript - Size: 521 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

leyresv/Book_Recommendation_System
Content-based book recommendation system
Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

stko-lab/LD-Connect
LD Connect: A Linked Data Portal for IOS Press Scientometrics
Language: JavaScript - Size: 2.96 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ChiaraDiBonaventura/covid_opinion
Applying NLP to understand people's sentiment about Covid-19 and Government actions in Italy, conditional on their political affiliation.
Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

pprablanc/doc_embedding_topic_mod
Improving document embedding with weighted average of word embedding through topic modeling
Language: R - Size: 1.37 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

inimah/Neural-Language-Models
Experiments on Neural Language Embeddings
Language: Python - Size: 187 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0
