Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: stopwords-removal

yeshamittal/sentimentAnalysis

Language: Python - Size: 27.3 KB - Last synced: 26 days ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 0

antononcube/Raku-Lingua-StopwordsISO

Raku package for stop words of different languages and stop words deletion. Provides corresponding CLI scripts.

Language: Raku - Size: 94.7 KB - Last synced: 28 days ago - Pushed: about 2 years ago - Stars: 1 - Forks: 1

Aalaa4444/Text_Processing-and-Unique_Word_Extraction_fromHTML

Extract text content from an HTML page, process it, and extract unique words from the processed text. This notebook utilizes various text processing techniques including cleaning, normalization, tokenization, lemmatization or stemming, and stop words removal.

Language: Jupyter Notebook - Size: 12.7 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 0 - Forks: 0

JanviSK/SMS-Spam-Detection-using-NLP

This project implements NLP and Classification models for Spam SMS detection

Language: Jupyter Notebook - Size: 242 KB - Last synced: about 2 months ago - Pushed: about 2 months ago - Stars: 0 - Forks: 0

ctomtom/Word.cloud

Language: Python - Size: 345 KB - Last synced: 2 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

Blitz464/Call-Transcript-classification

This was a hackathon project that I worked on for BestBuy around classifying the call transcripts using ML & NLP techniques

Language: Jupyter Notebook - Size: 10.7 KB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 0 - Forks: 0

juanantoniodelgado/StopWords

PHP StopWords removal library with support for multiple languages.

Language: PHP - Size: 146 KB - Last synced: 24 days ago - Pushed: about 1 month ago - Stars: 5 - Forks: 6

pictureinthenoise/gotstopwords

Python package that makes it easy to use stop words lists in Python projects.

Language: Python - Size: 299 KB - Last synced: 4 days ago - Pushed: 4 months ago - Stars: 1 - Forks: 0

nishi1612/Email-Spam-Classification-using-SVM

The uploaded codes help to classify emails into spam and non spam classes by using Support Vector Machine classifier.

Language: Python - Size: 463 KB - Last synced: 7 months ago - Pushed: almost 4 years ago - Stars: 25 - Forks: 13

kennedyCzar/NLP-PROJECT-BOOK-INSIGHTS-WITH-PLOTLY

Plotly-Dash NLP project. Document similarity measure using Latent Dirichlet Allocation, principal component analysis and finally follow with KMeans clustering. Project is completed with dynamic visual interaction.

Language: Python - Size: 171 MB - Last synced: 7 months ago - Pushed: over 1 year ago - Stars: 10 - Forks: 5

Kheem-Dh/Text-Data-Preprocessing-

Text Data Preprocessing

Language: Jupyter Notebook - Size: 15.6 KB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

AqilaFadia/app-Restaurant-Review

This application serves as a powerful tool for categorizing restaurant reviews as either negative or positive. Its primary purpose is to provide restaurateurs and managers with an efficient means of evaluating customer feedback. By distinguishing between negative and positive comments, this app aims to enhance the quality of service.

Language: Jupyter Notebook - Size: 413 KB - Last synced: 6 months ago - Pushed: 6 months ago - Stars: 0 - Forks: 0

TawfiqAbuArrh/SearchEngine

An Implement of search Engine

Language: Java - Size: 18.6 KB - Last synced: 9 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

Foysal87/bn_nlp

Bangla NLP toolkit.

Language: Python - Size: 6.84 MB - Last synced: 8 months ago - Pushed: over 1 year ago - Stars: 7 - Forks: 9

anuragkr29/TweetAnalysis

Work with a set of Tweets about US airlines and examine their sentiment polarity.The aim is to learn to classify Tweets as either “positive”, “neutral”, or “negative” by using two classifiers and pipelines for pre-processing and model building.

Language: Scala - Size: 4.88 KB - Last synced: 9 months ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

mattlyons0/Blackboard-Test-Grader

A Chrome Extension which enables automatic grading (currently using Porter's Stemmer and Stopword Removal) of (short) Free Response questions in Blackboard.

Language: JavaScript - Size: 1.25 MB - Last synced: 9 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

SeanFlannery/NAR-Data-Discovery

Nucleic Acids Research Data Discovery

Language: Jupyter Notebook - Size: 10.8 MB - Last synced: 10 months ago - Pushed: over 4 years ago - Stars: 0 - Forks: 0

kevinmastascusa/CORD_19_Research

"KZM COVID Informatics: A repository for data analysis and insight extraction from the CORD-19 dataset, focused on advancing our understanding of the COVID-19 pandemic."

Language: Jupyter Notebook - Size: 88.2 MB - Last synced: 8 months ago - Pushed: 8 months ago - Stars: 1 - Forks: 0

Naren-7701/DEVICE-RECOMMENDATION-SYSTEM

Device Recommendation System using Cosine Similarity. It will recommend Electronic Gadgets based on Similar Configuration.

Language: Jupyter Notebook - Size: 38.1 KB - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 1 - Forks: 0

D4rkisek/Sentiment_Classification_NLP

NLP methods for distinguishing positive and negative reviews written about movies.

Language: Jupyter Notebook - Size: 0 Bytes - Last synced: 12 months ago - Pushed: 12 months ago - Stars: 0 - Forks: 0

eklem/stopword-trainer

A module for creating stopword lists for any language, based on a set of documents.

Language: JavaScript - Size: 5.96 MB - Last synced: 3 days ago - Pushed: 6 months ago - Stars: 14 - Forks: 0

UtkarshTiwari123/Information-Retrieval-System

The aim of the code is to present a solution for retrieving specific passages or paragraphs from documents along with the document names based on user queries.

Language: Jupyter Notebook - Size: 659 KB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

mesc08/movie-reviews-sentiment-analysis

A MACHINE LEARNING PROJECT IMPLEMENTATION ON REAL LIFE EXAMPLE

Language: Jupyter Notebook - Size: 58.6 KB - Last synced: 10 months ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

JoyeBright/stopwords_guilannlp

A python package to be used in removing stopwords in different languages.

Language: Python - Size: 89.8 KB - Last synced: 24 days ago - Pushed: about 1 year ago - Stars: 5 - Forks: 1

bryanchw/Traditional-Chinese-Stopwords-and-Punctuations-Library

Created a Python library specifically for Traditional Chinese stopwords and punctuations removal

Language: Python - Size: 43.9 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 2 - Forks: 1

Shakiba-Alipour/Information-Retrieval-on-CISI

Implementation and evaluation an information retrieval system

Language: Jupyter Notebook - Size: 1.8 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 0 - Forks: 0

Balajirvp/Sentiment-Analysis-of-Movie-Reviews

Performed Sentiment Analysis of Movie reviews using Bag of Words and TF-IDF Vectorizers.

Language: Jupyter Notebook - Size: 501 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

manasik29/Named-Entity-Recognition-Emotion-Mining-on-Apple-reviews

Named Entity recognition and emotion mining on Apple Macbook reviews.

Language: Jupyter Notebook - Size: 3.04 MB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

chiaszu/youtube-comments-sentiment-analysis

YouTube Comments as a Corpus of Sentiment Analysis is the final project of DFLL672 Corpus Linguistics.

Language: Jupyter Notebook - Size: 860 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 1 - Forks: 0

manasik29/Sentiment_Analysis_on_Elon_Musk_Tweets

Performed Sentiment Analysis on Elon Musk's Tweets. Extracting Positive or Negative Sentiment.

Language: Jupyter Notebook - Size: 442 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 1

irfandythalib/python-indonesia-stopwords-remover

This code is used to remove stopwords using Tala stopwords library for Indonesia. Very useful for text processing

Language: Jupyter Notebook - Size: 6.84 KB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 1

meng-ucalgary/ensf-612-assignment-2

An assignment on preprocessing of text including tokenization, stop word removal, noise reduction, and stemming

Language: HTML - Size: 84.3 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

devshashwat/Tweets-Vector-Space-Model

Using Vector Space Model in Simple Tweets Database with Custom Test Cases for COVID-19 related Misinformation Data.

Language: Java - Size: 4.56 MB - Last synced: 12 months ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

meng-ucalgary/ensf-612-assignment-1

An assignment on preprocessing of text including tokenization, stop word removal

Language: HTML - Size: 84.5 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 0 - Forks: 0

elmurod1202/survey-clustering

K-means clustering of texts (survey answers) using word-embeddings, finding optimal elbow-point, and averaging multiple-word expressions.

Language: Python - Size: 1.23 MB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 1 - Forks: 0

ananyaroy1011/Fake-News-Classification

Given the title of a fake news article A and the title of a coming news article B, program classifies B into agree, disagree, and unrelated.

Language: HTML - Size: 410 KB - Last synced: about 1 year ago - Pushed: almost 3 years ago - Stars: 1 - Forks: 0

Rajdeep2121/NLP-Fundamentals

Basics of Natural Language Processing

Language: Python - Size: 2.79 MB - Last synced: about 1 year ago - Pushed: almost 4 years ago - Stars: 0 - Forks: 0

share424/Android-Sastrawi

Android Sastrawi is a Natural Language Processing Toolkit for Bahasa Indonesia

Language: Kotlin - Size: 362 KB - Last synced: about 1 year ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0

djuliar/ir_stemming

Simpel aplikasi untuk Tokenisasi, Stopword Removal, dan Stemming pada Information Retrieval dengan Codeigniter

Language: PHP - Size: 1.91 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 0 - Forks: 1

okzapradhana/stopword-analysis

Stopword Analysis on Text Mining - With dataset from Kaggle: https://www.kaggle.com/nltkdata/web-text-corpus

Language: Jupyter Notebook - Size: 1.71 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 1 - Forks: 0

Euno257/Sentimental-Analysis-on-Twitter-US-Airline-dataset

Language: Jupyter Notebook - Size: 1.84 MB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

anaustinbeing/spark-remove-stopwords

Prints contents of file after filtering out stopwords.

Language: Python - Size: 0 Bytes - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 0 - Forks: 0

atul04/TopicClassificationChallenge

Long english text passages are given, a genuine topic is needed to be assigned to the particular text passage. After cleaning the dataset, features were learnt using thidf approach, Linear SVC is used to get the final prediction

Language: Python - Size: 15.8 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 0 - Forks: 0

mraduldubey/bostonbombing

Getting started with Twitter data analysis.

Language: Jupyter Notebook - Size: 9.55 MB - Last synced: about 1 year ago - Pushed: almost 7 years ago - Stars: 0 - Forks: 1

Related Keywords
stopwords-removal 44 nlp 14 stemming 10 stopwords 9 tokenization 9 lemmatization 8 nltk 7 python 6 bag-of-words 5 nlp-machine-learning 5 information-retrieval 4 jupyter-notebook 4 stemmer 4 tokenizer 4 tf-idf 4 natural-language-processing 4 sentiment-analysis 4 logistic-regression 3 rdd 3 naive-bayes-classifier 3 beautifulsoup 3 dataset 3 text-processing 3 pyspark 3 preprocessing 3 cosine-similarity 3 machine-learning-algorithms 2 python3 2 vectorization 2 java 2 text 2 wordcloud 2 udf 2 mlp-classifier 2 sentiment-classification 2 tf-idf-vectorizer 2 text-mining 2 regular-expression 2 covid19-data 2 tfidf-vectorizer 2 tweets 2 spark 2 support-vector-machines 2 noise-reduction 2 text-classification 2 word-embeddings 2 pandas-dataframe 2 databricks 2 document-processing 1 spelling-correction 1 wildcard 1 baseline-model 1 ngrams 1 wordcount 1 cantonese 1 punctuation 1 traditional-chinese 1 cisi 1 normalization 1 fake-news-classification 1 recoomendation-system 1 csv 1 reasearch-papers 1 data-visualization 1 covid19 1 covid-19 1 web-crawler 1 research 1 nucleic-acids 1 gensim-doc2vec 1 gensim 1 doc2vec 1 clustering-analysis 1 clustering 1 beautifulsoup4 1 fishingsms 1 lemmetization 1 multi-layer-perceptron 1 multinomial-logistic-regression 1 naive-bayes 1 similarity-measures 1 tf-idf-features-num2words 1 nbviewer 1 wordnet 1 android 1 kotlin 1 sastrawi 1 mutual-information 1 zipfs-law 1 us-airline-dataset 1 classification 1 featureselect 1 linearsvc 1 sklearn-library 1 topic 1 ipynb 1 twitter-data-analysis 1 posting-list 1 tf 1 count-vectorizer 1