An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: gensim-doc2vec

pbellot/ANF-TDM

Code, données et documentations de l'atelier "Apprentissage automatique pour la classification textuelle" organisé dans le cadre de l'Action Nationale de Formation "Exploration documentaire et extraction d'information" CNRS-INRAE en 2020-21.

Language: Jupyter Notebook - Size: 57.3 MB - Last synced at: 27 days ago - Pushed at: 27 days ago - Stars: 3 - Forks: 1

KrishArul26/Text-Classification-DBpedia-ontology-classes-Using-LSTM

Text classification is the task of assigning a set of predefined categories to free text. Text classifiers can be used to organize, structure, and categorize pretty much anything. For example, new articles can be organized by topics, support tickets can be organized by urgency, chat conversations can be organized by language, brand mentions can be organized by sentiment, and so on.

Language: Python - Size: 27.3 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

mihirsam/Information-Extraction-using-CNN

Extract the summary from the given text using Convolution Neural Network

Language: Jupyter Notebook - Size: 69.9 MB - Last synced at: 7 months ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 5

luminoso/news-keywords-searcher

Proof of concept project that implements a keyword search (text similarity) over a corpus

Language: Python - Size: 6.23 MB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

AmirMotefaker/Analysis-of-Hotel-Customer-Sentiments

Sentiment analysis is part of the NLP techniques that consists in extracting emotions related to some raw texts.

Language: Jupyter Notebook - Size: 18.7 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

colurw/wiki_abstracts_NLP

Document-level semantic clustering. Unsupervised topic modelling.

Language: Python - Size: 1.26 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SOUMEE2000/Applicant_Tracking_System

This streamlitapp is built for employers looking to match best candidate resumes against a particular job description.

Language: Python - Size: 324 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 4

levitation-opensource/LegalSearch

A prototype legal text search engine that uses a semantic search algorithm in order to find related keywords and sort the results by relevance.

Language: Python - Size: 27.3 MB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 1

PengboLiu/Doc2Vec-Document-Similarity

利用Doc2Vec计算文本相似度

Language: Python - Size: 20.5 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 129 - Forks: 37

akshataupadhye/News-articles-clustering-A-comparative-approach

A project featuring the use of various NLP techniques and ML algorithms like the topic modelling and paragraph embeddings, for document clustering. 📰📚

Language: Jupyter Notebook - Size: 186 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SeanFlannery/NAR-Data-Discovery

Nucleic Acids Research Data Discovery

Language: Jupyter Notebook - Size: 10.8 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

VarunB-17/Personality-Inferencing-ML

Personality Inferencing ML

Language: Jupyter Notebook - Size: 166 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

somjit101/NLP-Star-Trek-Scripts

Using digital form of the actual scripts of the 'Star Trek' science fiction series to perform interesting NLP tasks and answering some questions on Topic Modelling, Character properties and the plot as a whole.

Language: Jupyter Notebook - Size: 8.51 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

meet5398/NLP-Natural-Language-Processing-

This repository is a collection of six minor projects focused on Natural Language Processing (NLP) along with relevant datasets. The projects are designed to help individuals gain a better understanding of NLP by applying concepts to real-world problems. Additionally, the repository includes a file that provides a comprehensive overview of NLP .

Language: Jupyter Notebook - Size: 32.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

jacksavage/DocClusterART

Cluster documents with Fuzzy-ART and PV-DM

Language: Python - Size: 6.84 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

lordtt13/word-embeddings Fork of CYBINT-IN/word-embeddings

Custom word embeddings created from latent features generated by gensim and hugging face models

Language: Jupyter Notebook - Size: 25.7 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 7 - Forks: 3

rochimfn/question-answering-konstitusi

Indonesia Constitution Question Answering System (Telegram Bot, Streamlit Page, and HTTP API)

Language: Python - Size: 38.1 KB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

mehak0503/Mass-media

Understanding the growth pattern in districts of India using mass media data

Language: Jupyter Notebook - Size: 1.89 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

suchetsapre/automated-clinical-note-annotation

Georgetown University Medical Center ICBI summer 2019 research project involving the automation of the annotation of patients' clinical notes.

Language: Jupyter Notebook - Size: 78.6 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ResearchKernel/datascience

Language: Python - Size: 19.5 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

ucals/bettertogether

This app uses Machine Learning NLP/topic modeling/document similarity techniques to group OMSCS CS-6460 Fall 2018 students by interests based on their essays/writing assignments

Language: CSS - Size: 43.5 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 1

yash111381/Data-Mining-to-Improve-Commercial-Movie-Success

Efficient Word2Vec vectors for Sentiment Analysis to improve Commercial Movie Success, done in two phases, involving machine learning and sentiment analysis.

Language: Jupyter Notebook - Size: 67 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

Related Keywords
gensim-doc2vec 22 nlp 9 gensim-word2vec 7 gensim 6 python 4 doc2vec 4 tf-idf 4 nlp-machine-learning 4 sklearn 3 natural-language-processing 3 machine-learning 3 nltk 3 text-mining 3 matplotlib 3 cnn 2 bert 2 tensorflow2 2 cosine-similarity 2 topic-modeling 2 spacy 2 text-similarity 2 pandas 2 bag-of-words 2 scikit-learn 2 word-embeddings 2 paragraph-vector 2 word2vec 2 lemmatization 2 clustering 2 nltk-python 2 euclidean-similarity 1 logistic-regression 1 embedding-models 1 web-crawler 1 hierarchicalclustering 1 clustering-analysis 1 lda 1 beautifulsoup4 1 stopwords-removal 1 nucleic-acids 1 stopwords 1 tokenization 1 research 1 trigrams 1 pandas-dataframe 1 sentiment-analysis 1 textrazor 1 rnn 1 embedding-vectors 1 data-science 1 tfidf 1 indonesia 1 text-processing 1 huggingface-transformers 1 gensim-topic-modeling 1 paragraph2vec 1 fuzzy-art 1 adaptive-resonance-theory 1 tokenizer 1 spacy-nlp 1 regular-expressions 1 fasttext-python 1 bag-of-ngrams 1 star-trek 1 similarity-matrix 1 lda-model 1 json 1 data-mining 1 bert-embeddings 1 lsi-model 1 python3 1 neural-network 1 flask 1 word2vec-model 1 word2vec-embeddinngs 1 stemming 1 rnn-tensorflow 1 restapi-framework 1 lstm-neural-networks 1 glove-embeddings 1 flask-application 1 bagofwords 1 attention-mechanism 1 workshop-materials 1 weka 1 text-classification 1 tdm 1 naive-bayes-classifier 1 kmeans 1 keras 1 deep-learning 1 bigrams 1 word-embedding 1 semantic-search-algorithm 1 searching-algorithm 1 search-in-text 1 search-algorithm 1 naturallanguageprocessing 1 streamlit 1 nltk-library 1