An open API service providing repository metadata for many open source software ecosystems.

Topic: "document-embedding"

ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

Language: Python - Size: 83.4 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 3,028 - Forks: 375

dissorial/doc-chatbot

Document chatbot — multiple files, topics, chat windows and chat history. Powered by GPT.

Language: TypeScript - Size: 2.54 MB - Last synced at: about 5 hours ago - Pushed at: almost 2 years ago - Stars: 852 - Forks: 146

BobXWu/FASTopic

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

Language: Python - Size: 1.67 MB - Last synced at: 1 day ago - Pushed at: 2 months ago - Stars: 94 - Forks: 6

ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

Language: Python - Size: 243 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 89 - Forks: 20

EQTPartners/pause

🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴

Language: Python - Size: 83 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 25 - Forks: 1

samhavens/flair-as-service

Container-first, JSON-configurable, NLP REST service based on Flair

Language: Python - Size: 23.4 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 0

chen0040/java-text-embedding

Word embedding in Java

Language: Java - Size: 55.7 KB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 7 - Forks: 1

marcomoldovan/hierarchical-language-modeling

We address the task of learning contextualized word, sentence and document representations with a hierarchical language model by stacking Transformer-based encoders on a sentence level and subsequently on a document level and performing masked token prediction.

Language: Jupyter Notebook - Size: 6.83 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

maxoodf/tgnews

Telegram Data Clustering Contest (Bossy Gnu's submission )

Language: C++ - Size: 41 KB - Last synced at: 26 days ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 2

ehtisham-sadiq/Exploring-Word2Vec-and-Doc2Vec

Dive into the world of Word2Vec and Doc2Vec models to uncover insights and applications.

Language: Jupyter Notebook - Size: 2.93 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

mathiasbruun/politician2vec

Utilities for learning, manipulating, and visualising politician embeddings in semantic space and inferring party positions.

Language: Python - Size: 181 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

eriknovak/python-text-embedding-microservice

Service for producing text representations via word embeddings

Language: Python - Size: 248 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 2

Tobsky/DocuQuery

This Streamlit application demonstrates the integration of ChatGroq (Llama3 model), OpenAIEmbeddings, and FAISS for document embedding and retrieval.

Language: Python - Size: 1.58 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

jdenes/TopicEmbeddings

An open-source framework to create and test document embeddings using topic models.

Language: Python - Size: 208 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

cnuahs/semantic-history-search

A Chrome extension to provide semantic search over your browsing history.

Language: TypeScript - Size: 521 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

leyresv/Book_Recommendation_System

Content-based book recommendation system

Language: Python - Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

stko-lab/LD-Connect

LD Connect: A Linked Data Portal for IOS Press Scientometrics

Language: JavaScript - Size: 2.96 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ChiaraDiBonaventura/covid_opinion

Applying NLP to understand people's sentiment about Covid-19 and Government actions in Italy, conditional on their political affiliation.

Language: Jupyter Notebook - Size: 13.4 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

pprablanc/doc_embedding_topic_mod

Improving document embedding with weighted average of word embedding through topic modeling

Language: R - Size: 1.37 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

inimah/Neural-Language-Models

Experiments on Neural Language Embeddings

Language: Python - Size: 187 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

Related Topics
word-embeddings 7 nlp 6 topic-modeling 5 semantic-search 3 word-embedding 3 natural-language-processing 3 top2vec 2 text-search 2 clustering 2 sentence-embeddings 2 sentence-encoder 2 langchain 2 machine-learning 2 language-model 2 deep-learning 2 openai 2 embeddings 2 nlp-machine-learning 2 word-vectors 2 llama3 1 text-similarity 1 semantic-search-engine 1 restful-api 1 rest-api 1 fastapi 1 similarity-search 1 positive-unlabeled-learning 1 rag 1 motherbrain 1 classification-algorithm 1 retreival 1 cpp 1 document-clustering 1 neural-topic-models 1 document-similarity 1 telegram 1 neural-topic-modeling 1 text-analysis 1 word2vec 1 nlp-models 1 neural-network 1 vectorization 1 typescript 1 tailwindcss 1 reactjs 1 pinecone 1 pdf-processing 1 openai-api 1 nextjs 1 mongoose 1 gpt-4 1 gpt-3 1 chatbot 1 chat 1 manifest-v3 1 langchain-js 1 huggingface-transformers 1 chrome-extension 1 browsing-history 1 brave-extension 1 angular 1 topic-vector 1 topic-search 1 topic-modelling 1 text-semantic-similarity 1 sentence-transformers 1 pre-trained-language-models 1 bert 1 topic-model 1 distributed-representations 1 translation-model 1 sequence-to-sequence 1 semi-supervised-learning 1 mono-language 1 cross-languages 1 cross-language-embeddings 1 binary-classification 1 bilingual-word-embedding 1 transformer 1 transfer-learning 1 representation-learning 1 pytorch 1 natural-language-understanding 1 information-retrieval 1 document-retrieval 1 attention-mechanism 1 glove-embeddings 1 glove 1 nlp-apis 1 kubernetes 1 flair-sota-nlp 1 flair-embeddings 1 flair 1 docker 1 microservice 1 cosine-similarity 1 bookrecommendsystem 1 groq 1 generative-ai 1 tsne-algorithm 1