Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: doc2vec

stivenramireza/spark-text-mining

Big data processing of news with Text Mining in Apache Spark through 3 fundamental processes: data preparation, searching based on the inverted index and grouping of news by similarity.

Language: Python - Size: 161 KB - Last synced: 13 days ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 1

singhmnprt01/NLP-and-PyTorch

NLP use cases using popular solutions: Frequency Embeddings, Word embedding (word2vec, doc2vec, Glove), RNN,LSTM, Transformers-BERT, Sentence_Transformers etc. PyTorch

Language: Jupyter Notebook - Size: 8.34 MB - Last synced: 25 days ago - Pushed: 26 days ago - Stars: 2 - Forks: 1

machulsky61/Dream-Journal

Project for the subject Data Laboratories, done in Python, using Web Scraping techniques, curation of Data Frames, Data Visualization and Classification, Natural Language Processing and Regression Models.

Language: Jupyter Notebook - Size: 145 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

thiswillbeyourgithub/AnnA_Anki_neuronal_Appendix

Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

Language: Python - Size: 3.55 MB - Last synced: 24 days ago - Pushed: 24 days ago - Stars: 57 - Forks: 1

vsoch/arxiv-equations

looking for patterns in equation use in arxiv papers

Language: Jupyter Notebook - Size: 144 MB - Last synced: about 1 month ago - Pushed: over 5 years ago - Stars: 2 - Forks: 1

tos-kamiya/d2vg 📦

A Doc2Vec grep. On your desktop.

Language: Python - Size: 11.8 MB - Last synced: about 1 month ago - Pushed: over 2 years ago - Stars: 3 - Forks: 0

dujm/Health_PrecisionMedicine

Personalized Medicine: Redefining Cancer Treatment

Language: Jupyter Notebook - Size: 2.21 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 0 - Forks: 2

ibrahimsharaf/doc2vec

:notebook: Long(er) text representation and classification using Doc2Vec embeddings

Language: Python - Size: 12.7 MB - Last synced: about 1 month ago - Pushed: almost 2 years ago - Stars: 104 - Forks: 42

Lab41/altair 📦

Assessing Source Code Semantic Similarity with Unsupervised Learning

Language: Python - Size: 189 MB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 41 - Forks: 14

Tixierae/deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

Language: Jupyter Notebook - Size: 105 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 435 - Forks: 106

natserract/natserract-ai

Using Doc2Vec, Langchain and OpenAI to chat with Natserract blog https://engineering-natserract.vercel.app/

Language: Python - Size: 7.79 MB - Last synced: about 1 month ago - Pushed: 7 months ago - Stars: 4 - Forks: 0

bnosac/doc2vec

Distributed Representations of Sentences and Documents

Language: C++ - Size: 3.2 MB - Last synced: 20 days ago - Pushed: over 2 years ago - Stars: 43 - Forks: 5

danielfrg/word2vec 📦

Python interface to Google word2vec

Language: C - Size: 6.42 MB - Last synced: about 2 months ago - Pushed: about 1 year ago - Stars: 2,548 - Forks: 630

kirs53/Atlas_of_linguistic_research

Language: HTML - Size: 2.05 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

wangz10/text-classification

A presentation/tutorial on text classification from the basics to advanced.

Language: Jupyter Notebook - Size: 3.09 MB - Last synced: 2 months ago - Pushed: almost 8 years ago - Stars: 1 - Forks: 0

sorodocosmin/feedbackHHC

This project focuses on analyzing patient feedback regarding the treatment provided by home healthcare service agencies.

Language: Python - Size: 5.52 MB - Last synced: 6 days ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

arcoyk/ml

Personal machine learning working pieces to simply run, or edit, or integrate into a bigger piece, to learn machine learning techniques from scratch, then create own ML recipe.

Language: Python - Size: 107 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

bigdata-ustc/EduNLP

A library for advanced Natural Language Processing towards multi-modal educational items.

Language: Python - Size: 127 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 49 - Forks: 18

oToToT/Doc2VecC

GPU accelerated implementation for Doc2VecC

Language: Cuda - Size: 329 KB - Last synced: 3 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

faezeh-gholamrezaie/Vectorization-Techniques-tutorial

Vectorization Techniques in Natural Language Processing Tutorial for Deep Learning Researchers

Language: Jupyter Notebook - Size: 1.83 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 0 - Forks: 0

atlijas/citizens_document_clustering

Language: Python - Size: 9.3 MB - Last synced: 3 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 2

conditg/nlp-grantland

Natural Language Processing: Textual Analysis of Grantland content

Language: Jupyter Notebook - Size: 13.2 MB - Last synced: 3 months ago - Pushed: about 5 years ago - Stars: 1 - Forks: 2

dmeoli/OnlineRetail

Data Mining project 2020/2021 @ University of Pisa

Language: Jupyter Notebook - Size: 235 MB - Last synced: about 1 month ago - Pushed: over 3 years ago - Stars: 11 - Forks: 4

skaghzz/elasticsearch-korquad-cosineSimilarity-Search

elasticsearch로 문서 유사도 검색 예제 - doc2vec

Language: Python - Size: 6.43 MB - Last synced: 3 months ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 1

aakashjhawar/twitter-sentiment-analysis

Sentiment analysis of tweets to detect negative tweets.

Language: Jupyter Notebook - Size: 5.85 MB - Last synced: 13 days ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

MohammedAly22/Semantify

A detailed comparison between 3 different techniques (TF-IDF, Doc2Vec, and Sentence Transformers) for performing semantic search on a huge dataset

Language: Jupyter Notebook - Size: 236 KB - Last synced: 4 months ago - Pushed: 4 months ago - Stars: 0 - Forks: 0

mtaruno/eve-bot

EVE bot, a customer service chatbot to enhance virtual engagement for Twitter Apple Support

Language: Jupyter Notebook - Size: 89.8 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 52 - Forks: 23

umutto/HyperparameterLogger

Simple logging wrapper for model hyperparameters from gensim.d2v, sklearn and keras.

Language: Python - Size: 13.7 KB - Last synced: 4 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

jagtapraj123/MealCheck

In this project we build docker deployable adaptive recommendation engine for meals to maintain users’ nutritional intake and variety in upcoming meals using Flask. We encode preparation steps of recipes in vector space to find similarities between recipes using math formula. We develop interactive Android App for users to log daily meals.

Language: Java - Size: 4.55 MB - Last synced: about 2 months ago - Pushed: over 1 year ago - Stars: 1 - Forks: 0

markedg/wfmu-universe

Graph to show the relationships of WFMU DJs and playlsts

Language: JavaScript - Size: 153 KB - Last synced: 5 months ago - Pushed: over 6 years ago - Stars: 3 - Forks: 0

searchisko/project-classifier-poc

Searchisko: A semantic search service over categorised content.

Language: Jupyter Notebook - Size: 258 MB - Last synced: about 2 months ago - Pushed: almost 6 years ago - Stars: 6 - Forks: 2

CodeCruncherDS/Sentiment-Analysis-using-Keras

Sentiment Analysis using Doc2vec.

Language: Jupyter Notebook - Size: 25.1 MB - Last synced: 5 months ago - Pushed: over 3 years ago - Stars: 1 - Forks: 0

nphdang/GE-FSG

Graph Embedding via Frequent Subgraphs

Language: Python - Size: 81.9 MB - Last synced: 3 months ago - Pushed: about 4 years ago - Stars: 42 - Forks: 8

yudukikun5120/kdb-doc2vec

筑波大学シラバスデータを用いて科目概要を Doc2Vec でベクトル化し、類似した科目を表示する

Language: Python - Size: 18.6 KB - Last synced: about 2 months ago - Pushed: 5 months ago - Stars: 2 - Forks: 0

striderxrs/wordsimilarity

Short Machine Learning script using Python, Word2Vec and Doc2Vec to train a classifier on a dataset of job titles.

Language: Jupyter Notebook - Size: 5.25 MB - Last synced: 5 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

maxoodf/word2vec

word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch

Language: C++ - Size: 118 KB - Last synced: 5 months ago - Pushed: 8 months ago - Stars: 122 - Forks: 22

Th3Tr00p3r/PrivacyPolicy

PPA breaks down privacy policies, aiming to simplify their understanding. By exploring data and using Doc2Vec modeling, it works toward clearer and more digestible policy insights.

Language: Python - Size: 7.87 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 1 - Forks: 0

anjali-tanna/cs4100_final_project

Addressing Political Bias in News Articles with Multinomial Regression

Language: Python - Size: 13.4 MB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

newsteps8/Text-Classification

Text Classifier for Turkish Text Data

Language: Jupyter Notebook - Size: 3.97 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

Snow-White-Group/CSU-K-Toolkit

This is the CSU-K toolkit for spoken call shared task 2. It contains several scripts, models and other data.

Language: Python - Size: 26 MB - Last synced: 7 months ago - Pushed: almost 6 years ago - Stars: 3 - Forks: 0

ryazh3nka/sirius.ai

a case-study within the "sirius.AI" program

Language: Python - Size: 2.17 MB - Last synced: 5 months ago - Pushed: 5 months ago - Stars: 0 - Forks: 0

sudharsan13296/Hands-On-Deep-Learning-Algorithms-with-Python

Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow

Language: Jupyter Notebook - Size: 206 MB - Last synced: 7 months ago - Pushed: over 3 years ago - Stars: 314 - Forks: 203

inejc/paragraph-vectors

:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).

Language: Python - Size: 821 KB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 392 - Forks: 75

jolsfd/songbeamer-duplications

Detect similar songs in your songbeamer archive 🗃️ by using doc2vec

Language: Python - Size: 19.5 KB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 0 - Forks: 0

founchoo/doc2vec_server

A Python server to be invoked doc2vec method, which uses TensorFlow and BERT model.

Language: Python - Size: 16.6 KB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

pranayjoshi/Medico

AI-powered medical terms detection tool.

Language: Python - Size: 79 MB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 16 - Forks: 3

extremecode/stress-detection-in-social-networks

stress detection in social networks

Language: R - Size: 5.26 MB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 2 - Forks: 2

sharmaroshan/Coursera-Reviews-Analysis

It is a Natural Language Processing Problem where we have to decide the sentiments of the users who reviewed the course. and then classifying the reviews into positive and negative.

Language: Jupyter Notebook - Size: 20.1 MB - Last synced: 8 months ago - Pushed: about 5 years ago - Stars: 7 - Forks: 3

memento7/KINCluster

Korean Involute News Cluster, KIN같은걸 클러스터링 합니다.

Language: Python - Size: 222 KB - Last synced: 8 months ago - Pushed: about 7 years ago - Stars: 2 - Forks: 2

tgll/word2vec_withfriends

word embedding with word2vec, doc2vec algorithms on friends tv show corpus/dataset

Language: Jupyter Notebook - Size: 5.73 MB - Last synced: 8 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

vanzytay/CIKM_CUP

CIKM Cup 2016 (1st Place) - Track 1 - Cross Device Entity Linking :smile:

Language: Python - Size: 72.3 KB - Last synced: 8 months ago - Pushed: over 6 years ago - Stars: 19 - Forks: 3

CLT29/semantic_neighborhoods

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]

Language: Python - Size: 3.17 MB - Last synced: 8 months ago - Pushed: almost 4 years ago - Stars: 9 - Forks: 6

Y-B-Class-Projects/NLP_HEBREW_TF_IDF

Information Retrieval EX03.2+EX04 + EX05

Language: Python - Size: 130 MB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

Kiminjo/Research-area-extract-from-papers

The major research areas are derived using the paper data of the researchers at Seoul National University of Science and Technology. This project was carried out as part of "Data and Business Innovation Lab."'s project.

Language: Python - Size: 1.13 GB - Last synced: 8 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

safetyAI/Company_report_public

Anonymized report from one of Safety AI's consulting projects

Size: 14.7 MB - Last synced: 8 months ago - Pushed: about 4 years ago - Stars: 0 - Forks: 0

j-gavran/doc2vec_reports

Finding similar physics lab reports using doc2vec

Language: Python - Size: 267 KB - Last synced: 5 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

aniass/Product-Categorization-NLP

Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

Language: Jupyter Notebook - Size: 14 MB - Last synced: 7 months ago - Pushed: 7 months ago - Stars: 69 - Forks: 23

breadfan/lyrics-based-songs-recommender

Recommender that uses song's lyrics to improve quality of the predictions

Language: Jupyter Notebook - Size: 12.7 KB - Last synced: 8 months ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

dahsie/spam_classification

Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx

Language: Python - Size: 1.91 MB - Last synced: 4 months ago - Pushed: 8 months ago - Stars: 0 - Forks: 0

PengboLiu/Doc2Vec-Document-Similarity

利用Doc2Vec计算文本相似度

Language: Python - Size: 20.5 KB - Last synced: 7 months ago - Pushed: about 6 years ago - Stars: 129 - Forks: 37

pixelneo/movie-database

Search for similar movies based on their wikipedia description

Language: Python - Size: 4.19 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

ocseckin/sicss_vaccine_project

The aim of this work is to understand how frequency and sentiment of Covid-19 vaccines' nomenclatures changed over time.

Language: Jupyter Notebook - Size: 4.86 MB - Last synced: 10 months ago - Pushed: almost 3 years ago - Stars: 3 - Forks: 0

daandouwe/svd-doc2vec

Turn documents into vectors by decomposing a PPMI cooccurence matrix.

Language: HTML - Size: 133 KB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

ahmadj1801/Using-User-Reviews-to-Enrich-Social-Recommender-Systems

Using User Reviews to Enrich Social Recommender Systems

Language: Python - Size: 1.96 MB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

neurite/med-embeddings

Language: Jupyter Notebook - Size: 76.3 MB - Last synced: 10 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

alphagov/govuk-content-similarity 📦

Find similar GOV.UK content to a piece of text or content item

Language: Python - Size: 327 KB - Last synced: 2 months ago - Pushed: about 1 year ago - Stars: 1 - Forks: 3

shriaithal/AlternusVera Fork of aarsanjani/AlternusVera

Alternus Vera Project

Language: Jupyter Notebook - Size: 45.2 MB - Last synced: 10 months ago - Pushed: over 5 years ago - Stars: 0 - Forks: 0

akulez/Reddit_Sentiment_Analysis-Word2Vec_VS_Doc2Vec

Comparing Google's Word2Vec and Doc2Vec Embeddings for Predicting Reddit Sentiment using XGBoost algorithm.

Language: Jupyter Notebook - Size: 1.17 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

yutaolife/DL4d2v

Language: Python - Size: 111 MB - Last synced: 10 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 3

nicole1020/eric_bot

Language: Python - Size: 127 KB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 0 - Forks: 0

vinhqdang/wikipedia_analysis

Different techniques to measure the quality of Wikipedia

Language: Python - Size: 868 MB - Last synced: 10 months ago - Pushed: about 7 years ago - Stars: 8 - Forks: 13

sindbach/doc2vec_pymongo

Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)

Language: Python - Size: 879 KB - Last synced: 10 months ago - Pushed: over 1 year ago - Stars: 36 - Forks: 14

SeanFlannery/NAR-Data-Discovery

Nucleic Acids Research Data Discovery

Language: Jupyter Notebook - Size: 10.8 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

WallaceLiu/word2vec_learn

word2vec_learn

Language: Jupyter Notebook - Size: 224 MB - Last synced: 10 months ago - Pushed: over 3 years ago - Stars: 0 - Forks: 0

PinPinIre/Final-Year-Project

Public repository for my final year project for Integrated Computer Science in Trinity College Dublin

Language: Python - Size: 107 KB - Last synced: 10 months ago - Pushed: over 7 years ago - Stars: 4 - Forks: 0

mehrgod/python

Doc2vec in Python using gensim

Language: Python - Size: 1000 Bytes - Last synced: 10 months ago - Pushed: almost 6 years ago - Stars: 0 - Forks: 0

chmodsss/word2vec_study

Study of word2vec and doc2vec on classifying emotions

Language: Python - Size: 1.36 MB - Last synced: 10 months ago - Pushed: over 7 years ago - Stars: 2 - Forks: 1

Elzawawy/DeftEval

Official Contribution for DeftEval 2020, Task 6 Subtask 1 from SemEval 2020 Competition. Solving NLP problem of "extracting term-definition pairs in free text" in multiple approaches ranging from highly simple till very complex modern ones.

Language: Jupyter Notebook - Size: 350 KB - Last synced: 10 months ago - Pushed: almost 4 years ago - Stars: 10 - Forks: 1

shrebox/Natural-Language-Processing

Compilation of Natural Language Processing (NLP) codes. BONUS: Link to Information Retrieval (IR) codes compilation. (checkout the readme)

Language: Python - Size: 1.85 MB - Last synced: 11 months ago - Pushed: almost 3 years ago - Stars: 12 - Forks: 0

shrebox/Fake-News-Detection

This repository contains supervised fake news detection on LIAR dataset. Check out the analysis details for more details.

Language: Jupyter Notebook - Size: 713 KB - Last synced: 11 months ago - Pushed: almost 4 years ago - Stars: 14 - Forks: 0

jamesgmccarthy/Unsupervised-Early-Detection-of-Fake-News

Repository for my Master's thesis on the unsupervised early detection of fake news articles

Language: Python - Size: 67.4 KB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 1 - Forks: 2

swigroup/semantic-learning

Semantic Classification of OERs with Word Embeddings

Language: Python - Size: 6.84 KB - Last synced: 11 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

t-hashiguchi1995/classification_character

Language: Python - Size: 33.2 KB - Last synced: 11 months ago - Pushed: about 6 years ago - Stars: 0 - Forks: 0

pakshuang/movie-reviews-topic-modelling

Group project for the NUS module "IT1244 Artificial Intelligence: Technology and Impact"

Language: Jupyter Notebook - Size: 3.7 MB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

Hamoon1987/TwitterTopicModeling

Topic modeling on tweets. Using doc2vec word embedding and k-means clustering to categorize tweets.

Language: Python - Size: 31.3 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 4 - Forks: 0

mizzle-toe/find-your-dream-job

Scraping, processing and analyzing job offers to help job seekers on their journey. Technologies used: Selenium, SQL, Word2Vec/Doc2vec, Google Cloud, Docker, FastAPI, Streamlit. Capstone project for Le Wagon Data Science Bootcamp.

Language: Jupyter Notebook - Size: 31.4 MB - Last synced: 6 months ago - Pushed: about 3 years ago - Stars: 4 - Forks: 2

Shraeyas/Plagiarism-Detection

Plagiarism detection between documents

Language: Jupyter Notebook - Size: 46.5 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 1

aditeyabaral/doc2sim

A simple command line utility to find similarity in content between documents using Doc2Vec.

Language: Python - Size: 1.53 MB - Last synced: about 1 year ago - Pushed: over 3 years ago - Stars: 0 - Forks: 2

nphdang/Sqn2Vec

Unsupervised Sequence Embedding via Sequential Patterns

Language: Python - Size: 3.2 MB - Last synced: 12 months ago - Pushed: over 5 years ago - Stars: 18 - Forks: 5

jananiarunachalam/Research-Paper-Summarization

Text Summarization for Research Papers

Language: Jupyter Notebook - Size: 170 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 53 - Forks: 17

zakarich/TF-IDF-cranDataSet

Language: Jupyter Notebook - Size: 695 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 2 - Forks: 0

HamedBabaei/PAN2019_bots_gender_profiling

PAN 2019, Bots and Gender Profiling Task

Language: Python - Size: 151 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 3 - Forks: 0

lokicui/doc2vec-golang

doc2vec , word2vec, implemented by golang. word embedding representation

Language: Go - Size: 4.12 MB - Last synced: 10 months ago - Pushed: about 6 years ago - Stars: 34 - Forks: 10

grvnair/nlp-techniques

Built a Spam Classifier using different vectorization techniques and algorithms.

Language: Jupyter Notebook - Size: 455 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

elifftosunn/Bert-Bank-Model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories. #Acikhack2023

Language: Jupyter Notebook - Size: 5.23 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

haresrv/Case-Study_Fake-Product-Review-Monitoring-and-Removal

Case Study from CSE380

Language: Jupyter Notebook - Size: 46 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 3 - Forks: 1

maxamin/Unsupervised-Learning-Sequence-Embeddings-via-Sequential-Patterns

latest update

Language: Python - Size: 41.4 MB - Last synced: over 1 year ago - Pushed: over 2 years ago - Stars: 7 - Forks: 3

YuzhouPeng/Linedin-User-profile-Hybrid-Recommendation

Hybrid recommedation for talents

Language: Python - Size: 162 KB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 18 - Forks: 6

MoinDalvs/Resume_Screening_and_Parser

Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention Sample Data Set Details: Resumes and financial documents

Language: Jupyter Notebook - Size: 95.9 MB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 3 - Forks: 1

MiteshPuthran/Document_Classification

Python code for classification of documents into different classes using machine learning

Language: Jupyter Notebook - Size: 89.1 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 12 - Forks: 6

Related Keywords
doc2vec 214 word2vec 77 nlp 63 python 57 machine-learning 53 gensim 35 natural-language-processing 22 tf-idf 20 deep-learning 14 nlp-machine-learning 14 sentiment-analysis 13 nltk 13 topic-modeling 13 word-embeddings 12 python3 12 text-classification 12 bert 11 scikit-learn 11 data-science 9 fasttext 9 pandas 9 pytorch 8 lda 8 keras 8 random-forest 8 recommender-system 7 tfidf 7 classification 7 tensorflow 7 logistic-regression 7 data-analysis 7 doc2vec-word2vec 6 neural-network 6 bag-of-words 6 clustering 6 xgboost 6 pca 5 bert-embeddings 5 embeddings 5 svm 5 elasticsearch 5 matplotlib 5 numpy 5 doc2vec-model 4 gensim-doc2vec 4 twitter 4 unsupervised-learning 4 word-embedding 4 sklearn 4 neural-networks 4 visualization 4 machine-learning-algorithms 4 flask 4 text-mining 4 recommendation-engine 4 ai 4 text-analysis 4 pipeline 4 vectorization 3 singlestore 3 naive-bayes-classifier 3 tokenization 3 jupyter-notebook 3 lemmatization 3 data-visualization 3 artificial-intelligence 3 supervised-learning 3 text-processing 3 huggingface-transformers 3 data-mining 3 lstm 3 arxiv 3 inverted-index 3 kmeans-clustering 3 cosine-similarity 3 information-retrieval 3 regression 3 spacy 3 crawler 3 glove-embeddings 3 infersent 3 recurrent-neural-networks 3 recommendation-system 3 latent-dirichlet-allocation 3 search-engine 3 sentence-embeddings 3 chatbot 3 memsql 3 similaridade 3 natural-language-understanding 3 gridsearchcv 2 glove 2 top2vec 2 keras-tensorflow 2 deep-neural-networks 2 item2vec 2 bert-model 2 graph-embedding 2 docker 2 similarity 2