An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: doc2vec

sudharsan13296/Hands-On-Deep-Learning-Algorithms-with-Python

Master Deep Learning Algorithms with Extensive Math by Implementing them using TensorFlow

Language: Jupyter Notebook - Size: 206 MB - Last synced at: 20 days ago - Pushed at: over 4 years ago - Stars: 345 - Forks: 186

danielfrg/word2vec 📦

Python interface to Google word2vec

Language: C - Size: 6.42 MB - Last synced at: 30 days ago - Pushed at: about 2 years ago - Stars: 2,601 - Forks: 631

atsukoba/LabelEstimator

Simple Unsupervised Document Labeling with MeCab and Pretrained Doc2Vec Model and some experiments about `Doc2Vec.infer_vector()`

Language: Jupyter Notebook - Size: 583 KB - Last synced at: about 2 months ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 0

jananiarunachalam/Research-Paper-Summarization

Text Summarization for Research Papers

Language: Jupyter Notebook - Size: 170 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 75 - Forks: 19

ryogrid/anime-illust-image-searcher

Anime Style Illustration Specific Image Search App with ViT Tagger x BM25/Doc2Vec

Language: Python - Size: 346 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 8 - Forks: 1

aniass/Product-Categorization-NLP

Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

Language: Jupyter Notebook - Size: 14.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 93 - Forks: 27

AdrianKlessa/SteamScout

Ensemble recommendation (recommender) system for finding similar games on Steam

Language: Python - Size: 376 KB - Last synced at: 15 days ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

DSXiangLi/Embedding

Embedding模型代码和学习笔记总结

Language: Python - Size: 135 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 39 - Forks: 3

thiswillbeyourgithub/AnnA_Anki_neuronal_Appendix

Using machine learning on your anki collection to enhance the scheduling via semantic clustering and semantic similarity

Language: Python - Size: 3.89 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 64 - Forks: 1

pko89403/Recommender

Implementation of recommender ( Pytorch & Keras )

Language: Jupyter Notebook - Size: 67.9 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

davidhelguero/jfk_files

Collection of files related to JFK assassination investigation and conspiracy theories. Includes documents, photos, and transcripts for researchers and history enthusiasts.

Language: Python - Size: 5.19 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

PengboLiu/Doc2Vec-Document-Similarity

利用Doc2Vec计算文本相似度

Language: Python - Size: 20.5 KB - Last synced at: 20 days ago - Pushed at: about 7 years ago - Stars: 138 - Forks: 38

ibrahimsharaf/doc2vec

:notebook: Long(er) text representation and classification using Doc2Vec embeddings

Language: Python - Size: 12.7 MB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 107 - Forks: 42

andrewtavis/wikirec

Recommendation engine framework based on Wikipedia data

Language: Python - Size: 340 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 20 - Forks: 10

MiteshPuthran/Document_Classification

Python code for classification of documents into different classes using machine learning

Language: Jupyter Notebook - Size: 89.1 MB - Last synced at: 2 months ago - Pushed at: about 6 years ago - Stars: 28 - Forks: 8

Isa1asN/plagiarism-detector

Plagiarism detection for Amharic language text

Language: Jupyter Notebook - Size: 635 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

MStrzezon/Arxiv-Topic-Trend-Analysis

The comparative evaluation of topic modeling approaches, including LDA, BERTopic, Doc2Vec, and Top2Vec, highlights distinct strengths and optimal use cases for each model. In the following, we out- line their clustering performance, thematic insights, and practical applications

Language: Jupyter Notebook - Size: 97.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

bnosac/doc2vec

Distributed Representations of Sentences and Documents

Language: C++ - Size: 3.2 MB - Last synced at: 18 days ago - Pushed at: over 3 years ago - Stars: 48 - Forks: 7

Nikoletos-K/QA-with-SBERT-for-CORD19

⚕️🦠 Developed a document retrieval system to return titles of scientific papers containing the answer to a given user question based on the first version of the COVID-19 Open Research Dataset (CORD-19) ☣️🧬

Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 1

MoinDalvs/Resume_Screening_and_Parser

Business objective- The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention Sample Data Set Details: Resumes and financial documents

Language: Jupyter Notebook - Size: 95.9 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

hank-chu/YouTube-View-Prediction-Using-Bi-LSTM

此專案利用自然語言處理 (NLP) 和雙向 LSTM (Bi-LSTM) 深度學習模型,利用 YouTube 影片的標題來預測影片的點閱數。這個專案旨在幫助內容創作者了解哪些影片可能會受歡迎,並提供一種有效的解決方案來預測影片的流量。

Language: Python - Size: 1.1 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

singhmnprt01/NLP-and-PyTorch

NLP use cases using popular solutions: Frequency Embeddings, Word embedding (word2vec, doc2vec, Glove), RNN,LSTM, Transformers-BERT, Sentence_Transformers etc. PyTorch

Language: Jupyter Notebook - Size: 12.1 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

IhabBendidi/sentiment_embeddings

A scientific benchmark and comparison of the performance of sentiment analysis models in NLP on small to medium datasets

Language: Jupyter Notebook - Size: 54 MB - Last synced at: 2 months ago - Pushed at: over 4 years ago - Stars: 13 - Forks: 3

LastStep/Data-Science-Projects

Various data science projects made during my internship at Social Prachar

Language: Jupyter Notebook - Size: 599 MB - Last synced at: 5 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

nphdang/GE-FSG

Graph Embedding via Frequent Subgraphs

Language: Python - Size: 81.9 MB - Last synced at: 3 months ago - Pushed at: about 5 years ago - Stars: 44 - Forks: 8

wilianselzlein/S4

Sugestão de Solução de Salts na Sustentação

Language: Python - Size: 28.4 MB - Last synced at: 10 months ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

seer-lab/bug-severity-prediction

The Automatic Bug Traige (AutoBugTriage) tool allows for the prediction of bug severity at the beginning of the project by using NLP and an organization's historical data.

Language: Jupyter Notebook - Size: 1.11 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 0

anjali-tanna/cs4100_final_project

Addressing Political Bias in News Articles with Multinomial Regression

Language: Python - Size: 13.4 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

elifftosunn/Bert-Bank-Model

It is a Turkish BERT-based model that will analyze people's bank complaints and classify them according to one of eight categories.

Language: Jupyter Notebook - Size: 5.23 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

aarryasutar/Hate_Speech_Detection

This project aims to detect hate speech on Twitter using advanced NLP and machine learning techniques, exploring feature extraction methods like TF-IDF and sentiment analysis, and evaluating models such as Logistic Regression and SVM.

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 12 days ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

aditeyabaral/doc2sim

A simple command line utility to find similarity in content between documents using Doc2Vec.

Language: Python - Size: 1.53 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 2

jjlatval/covid-19

My submission for Covid-19 challenges in Kaggle.

Language: Python - Size: 31.3 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

farvath/Resume-Parser-and-Analysis

This application is built for employers looking for candidates against a particular job description .

Language: Python - Size: 369 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

CodeNBucket/GraduationProject-MLAlgorithm

Senior Project codes, it will be mostly about natural language process and machine learning

Language: Python - Size: 353 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

iankurgarg/adbi-projects

Projects from the Course on "Algorithms for Data guided Business Intelligence"

Language: Python - Size: 22.5 MB - Last synced at: about 1 year ago - Pushed at: about 7 years ago - Stars: 4 - Forks: 12

lokicui/doc2vec-golang

doc2vec , word2vec, implemented by golang. word embedding representation

Language: Go - Size: 4.12 MB - Last synced at: 12 months ago - Pushed at: about 7 years ago - Stars: 39 - Forks: 11

bigdata-ustc/EduNLP

A library for advanced Natural Language Processing towards multi-modal educational items.

Language: Python - Size: 127 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 50 - Forks: 18

stivenramireza/spark-text-mining

Big data processing of news with Text Mining in Apache Spark through 3 fundamental processes: data preparation, searching based on the inverted index and grouping of news by similarity.

Language: Python - Size: 161 KB - Last synced at: about 1 year ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

faezeh-gholamrezaie/Vectorization-Techniques-tutorial

Vectorization Techniques in Natural Language Processing Tutorial for Deep Learning Researchers

Language: Jupyter Notebook - Size: 1.83 MB - Last synced at: 26 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

MohamedAliHabib/easyLearn-Arabic-Text-Recommender-System

Building, Training and Testing Doc2Vec and Word2Vec (Skip-Gram) Model Using Gensim Library for Recommending Arabic Text.

Language: Jupyter Notebook - Size: 1.04 MB - Last synced at: 15 days ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 3

machulsky61/Dream-Journal

Project for the subject Data Laboratories, done in Python, using Web Scraping techniques, curation of Data Frames, Data Visualization and Classification, Natural Language Processing and Regression Models.

Language: Jupyter Notebook - Size: 145 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

n-yU/shisho

Book search and management system for closed environment

Language: Python - Size: 5.67 MB - Last synced at: 5 days ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

tos-kamiya/d2vg 📦

A Doc2Vec grep. On your desktop.

Language: Python - Size: 11.8 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Lab41/altair 📦

Assessing Source Code Semantic Similarity with Unsupervised Learning

Language: Python - Size: 189 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 41 - Forks: 14

papachristoumarios/sade

Code for paper: Software clusterings with vector semantics and the call graph

Language: Java - Size: 1.87 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 0

Tixierae/deep_learning_NLP

Keras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP

Language: Jupyter Notebook - Size: 105 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 435 - Forks: 106

luizanisio/Doc2VecRapido

Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) de forma simples, sem muita configuração. Dicas usando elasticsearch e singlestore.

Language: Python - Size: 7.77 MB - Last synced at: 3 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

natserract/natserract-ai

Using Doc2Vec, Langchain and OpenAI to chat with Natserract blog https://engineering-natserract.vercel.app/

Language: Python - Size: 7.79 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

kirs53/Atlas_of_linguistic_research

Language: HTML - Size: 2.05 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

wangz10/text-classification

A presentation/tutorial on text classification from the basics to advanced.

Language: Jupyter Notebook - Size: 3.09 MB - Last synced at: about 1 year ago - Pushed at: almost 9 years ago - Stars: 1 - Forks: 0

sorodocosmin/feedbackHHC

This project focuses on analyzing patient feedback regarding the treatment provided by home healthcare service agencies.

Language: Python - Size: 5.52 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

roboreport/doc2vec-api

document embedding and machine learning script for beginners

Language: Python - Size: 10.6 MB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 92 - Forks: 36

arcoyk/ml

Personal machine learning working pieces to simply run, or edit, or integrate into a bigger piece, to learn machine learning techniques from scratch, then create own ML recipe.

Language: Python - Size: 107 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

atlijas/citizens_document_clustering

Language: Python - Size: 9.3 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 2

conditg/nlp-grantland

Natural Language Processing: Textual Analysis of Grantland content

Language: Jupyter Notebook - Size: 13.2 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 1 - Forks: 2

dmeoli/OnlineRetail

Data Mining project 2020/2021 @ University of Pisa

Language: Jupyter Notebook - Size: 235 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 11 - Forks: 4

skaghzz/elasticsearch-korquad-cosineSimilarity-Search

elasticsearch로 문서 유사도 검색 예제 - doc2vec

Language: Python - Size: 6.43 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

aakashjhawar/twitter-sentiment-analysis

Sentiment analysis of tweets to detect negative tweets.

Language: Jupyter Notebook - Size: 5.85 MB - Last synced at: 3 months ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

MohammedAly22/Semantify

A detailed comparison between 3 different techniques (TF-IDF, Doc2Vec, and Sentence Transformers) for performing semantic search on a huge dataset

Language: Jupyter Notebook - Size: 236 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jteijema/asreview-plugin-wide-doc2vec 📦

This plugin adds a new feature extractor based on doc2vec with a wider vector.

Language: Python - Size: 25.4 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

mtaruno/eve-bot

EVE bot, a customer service chatbot to enhance virtual engagement for Twitter Apple Support

Language: Jupyter Notebook - Size: 89.8 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 52 - Forks: 23

umutto/HyperparameterLogger

Simple logging wrapper for model hyperparameters from gensim.d2v, sklearn and keras.

Language: Python - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

ppontisso/Text-Search-Engine-using-Doc2Vec-and-TF-IDF

How to make a search engine using Doc2Vec and TF-IDF models

Language: Jupyter Notebook - Size: 18 MB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 3

jagtapraj123/MealCheck

In this project we build docker deployable adaptive recommendation engine for meals to maintain users’ nutritional intake and variety in upcoming meals using Flask. We encode preparation steps of recipes in vector space to find similarities between recipes using math formula. We develop interactive Android App for users to log daily meals.

Language: Java - Size: 4.55 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

searchisko/project-classifier-poc

Searchisko: A semantic search service over categorised content.

Language: Jupyter Notebook - Size: 258 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 6 - Forks: 2

tomislijepcevic/medline_embedding

Language: Jupyter Notebook - Size: 21 MB - Last synced at: 6 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

CodeCruncherDS/Sentiment-Analysis-using-Keras

Sentiment Analysis using Doc2vec.

Language: Jupyter Notebook - Size: 25.1 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

yudukikun5120/kdb-doc2vec

筑波大学シラバスデータを用いて科目概要を Doc2Vec でベクトル化し、類似した科目を表示する

Language: Python - Size: 18.6 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

striderxrs/wordsimilarity

Short Machine Learning script using Python, Word2Vec and Doc2Vec to train a classifier on a dataset of job titles.

Language: Jupyter Notebook - Size: 5.25 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

maxoodf/word2vec

word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch

Language: C++ - Size: 118 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 122 - Forks: 22

Th3Tr00p3r/PrivacyPolicy

PPA breaks down privacy policies, aiming to simplify their understanding. By exploring data and using Doc2Vec modeling, it works toward clearer and more digestible policy insights.

Language: Python - Size: 7.87 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

newsteps8/Text-Classification

Text Classifier for Turkish Text Data

Language: Jupyter Notebook - Size: 3.97 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Snow-White-Group/CSU-K-Toolkit

This is the CSU-K toolkit for spoken call shared task 2. It contains several scripts, models and other data.

Language: Python - Size: 26 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

ryazh3nka/sirius.ai

a case-study within the "sirius.AI" program

Language: Python - Size: 2.17 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

inejc/paragraph-vectors

:page_facing_up: A PyTorch implementation of Paragraph Vectors (doc2vec).

Language: Python - Size: 821 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 392 - Forks: 75

luizanisio/Doc2VecFacil

Classe responsável por simplificar o processo de criação de um modelo Doc2Vec (gensim) com facilitadores para geração de um vocab personalizado e com a geração de arquivos de curadoria. Dicas usando elasticsearch e singlestore.

Language: Python - Size: 31.9 MB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 1

jolsfd/songbeamer-duplications

Detect similar songs in your songbeamer archive 🗃️ by using doc2vec

Language: Python - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jayfunc/doc2vec_server

A Python server to be invoked doc2vec method, which uses TensorFlow and BERT model.

Language: Python - Size: 16.6 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pranayjoshi/Medico

AI-powered medical terms detection tool.

Language: Python - Size: 79 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 3

extremecode/stress-detection-in-social-networks

stress detection in social networks

Language: R - Size: 5.26 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

sharmaroshan/Coursera-Reviews-Analysis

It is a Natural Language Processing Problem where we have to decide the sentiments of the users who reviewed the course. and then classifying the reviews into positive and negative.

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

memento7/KINCluster

Korean Involute News Cluster, KIN같은걸 클러스터링 합니다.

Language: Python - Size: 222 KB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 2 - Forks: 2

tgll/word2vec_withfriends

word embedding with word2vec, doc2vec algorithms on friends tv show corpus/dataset

Language: Jupyter Notebook - Size: 5.73 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

vanzytay/CIKM_CUP

CIKM Cup 2016 (1st Place) - Track 1 - Cross Device Entity Linking :smile:

Language: Python - Size: 72.3 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 19 - Forks: 3

CLT29/semantic_neighborhoods

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval [ECCV 2020]

Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 6

Y-B-Class-Projects/NLP_HEBREW_TF_IDF

Information Retrieval EX03.2+EX04 + EX05

Language: Python - Size: 130 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Kiminjo/Research-area-extract-from-papers

The major research areas are derived using the paper data of the researchers at Seoul National University of Science and Technology. This project was carried out as part of "Data and Business Innovation Lab."'s project.

Language: Python - Size: 1.13 GB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

safetyAI/Company_report_public

Anonymized report from one of Safety AI's consulting projects

Size: 14.7 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

j-gavran/doc2vec_reports

Finding similar physics lab reports using doc2vec

Language: Python - Size: 267 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

breadfan/lyrics-based-songs-recommender

Recommender that uses song's lyrics to improve quality of the predictions

Language: Jupyter Notebook - Size: 12.7 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

dahsie/spam_classification

Ce fut mon prémier projet NLP où j'ai réalisé la détection de spam en utilisant les algorithmes d'embedding pour encorder mes textes. J'ai utilisé Random Forest et Milti-Layres Perceptrons pour la phase de classification. Ce qui a pemit l'obtension des précisions respective de 97% et 98%. J'ai aussi appris à documenter mes codes via sphinx

Language: Python - Size: 1.91 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

pixelneo/movie-database

Search for similar movies based on their wikipedia description

Language: Python - Size: 4.19 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

ocseckin/sicss_vaccine_project

The aim of this work is to understand how frequency and sentiment of Covid-19 vaccines' nomenclatures changed over time.

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 0

ahmadj1801/Using-User-Reviews-to-Enrich-Social-Recommender-Systems

Using User Reviews to Enrich Social Recommender Systems

Language: Python - Size: 1.96 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

neurite/med-embeddings

Language: Jupyter Notebook - Size: 76.3 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

alphagov/govuk-content-similarity 📦

Find similar GOV.UK content to a piece of text or content item

Language: Python - Size: 327 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 3

shriaithal/AlternusVera Fork of aarsanjani/AlternusVera

Alternus Vera Project

Language: Jupyter Notebook - Size: 45.2 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

akulez/Reddit_Sentiment_Analysis-Word2Vec_VS_Doc2Vec

Comparing Google's Word2Vec and Doc2Vec Embeddings for Predicting Reddit Sentiment using XGBoost algorithm.

Language: Jupyter Notebook - Size: 1.17 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

yutaolife/DL4d2v

Language: Python - Size: 111 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 3

nicole1020/eric_bot

Language: Python - Size: 127 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Related Keywords
doc2vec 232 word2vec 82 nlp 69 python 61 machine-learning 56 gensim 41 natural-language-processing 30 tf-idf 21 deep-learning 16 nlp-machine-learning 15 nltk 15 topic-modeling 14 sentiment-analysis 13 word-embeddings 12 python3 12 scikit-learn 12 text-classification 12 bert 12 pandas 10 random-forest 10 data-science 9 recommender-system 9 lda 9 pytorch 9 fasttext 9 doc2vec-word2vec 8 logistic-regression 8 keras 8 classification 7 tensorflow 7 tfidf 7 data-analysis 7 numpy 7 flask 6 bag-of-words 6 matplotlib 6 neural-network 6 clustering 6 svm 6 xgboost 6 bert-embeddings 5 search-engine 5 sklearn 5 text-mining 5 embeddings 5 elasticsearch 5 pca 5 text-analysis 5 tokenization 4 docker 4 neural-networks 4 cosine-similarity 4 chatbot 4 visualization 4 twitter 4 ai 4 machine-learning-algorithms 4 huggingface-transformers 4 doc2vec-model 4 pipeline 4 unsupervised-learning 4 gensim-doc2vec 4 streamlit 4 recommendation-engine 4 word-embedding 4 regression 3 naive-bayes 3 similaridade 3 data-mining 3 sentence-embeddings 3 information-retrieval 3 big-data 3 text-processing 3 spacy 3 infersent 3 glove-embeddings 3 naive-bayes-classifier 3 supervised-learning 3 kmeans-clustering 3 top2vec 3 stopwords-removal 3 memsql 3 singlestore 3 glove 3 artificial-intelligence 3 jupyter-notebook 3 lstm 3 recurrent-neural-networks 3 lemmatization 3 data-visualization 3 transformer 3 vectorization 3 inverted-index 3 recommendation-system 3 crawler 3 natural-language-understanding 3 latent-dirichlet-allocation 3 arxiv 3 prediction 2 plotting 2