An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-preprocessing"

adbar/trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language: Python - Size: 33.8 MB - Last synced at: about 9 hours ago - Pushed at: about 1 month ago - Stars: 4,151 - Forks: 289

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

Language: Python - Size: 22.1 MB - Last synced at: 10 days ago - Pushed at: over 1 year ago - Stars: 2,904 - Forks: 240

jfilter/clean-text

🧹 Python package for text cleaning

Language: Python - Size: 157 KB - Last synced at: 7 days ago - Pushed at: almost 2 years ago - Stars: 975 - Forks: 79

lyeoni/prenlp

Preprocessing Library for Natural Language Processing

Language: Python - Size: 156 KB - Last synced at: 11 days ago - Pushed at: over 2 years ago - Stars: 161 - Forks: 12

berknology/text-preprocessing

A python package for text preprocessing task in natural language processing.

Language: Python - Size: 40 KB - Last synced at: 18 days ago - Pushed at: over 2 years ago - Stars: 63 - Forks: 7

ezgisubasi/turkish-tweets-sentiment-analysis

This sentiment analysis project determines whether the tweets posted in the Turkish language on Twitter are positive or negative.

Language: Jupyter Notebook - Size: 1.96 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 60 - Forks: 14

CDSoft/panda

Panda is a Pandoc Lua filter that works on internal Pandoc's AST. Panda is heavily inspired by [abp](http:/cdelord.fr/abp) reimplemented as a Pandoc Lua filter.

Language: Lua - Size: 265 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 51 - Forks: 5

Lipairui/textgo

Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!

Language: Python - Size: 532 KB - Last synced at: 9 days ago - Pushed at: about 3 years ago - Stars: 44 - Forks: 2

ksnugroho/basic-text-preprocessing

Basic text preprocessing for Bahasa with Python.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 34 - Forks: 10

csebuetnlp/normalizer

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.

Language: Python - Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 5

jeongukjae/python-mecab 📦

A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)

Language: C++ - Size: 1.29 MB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 28 - Forks: 7

fmpr/texttk

Text Preprocessing in Python

Language: Python - Size: 33.2 KB - Last synced at: about 2 years ago - Pushed at: over 8 years ago - Stars: 19 - Forks: 2

lanl/T-ELF

Tensor Extraction of Latent Features (T-ELF). Within T-ELF's arsenal are non-negative matrix and tensor factorization solutions, equipped with automatic model determination (also known as the estimation of latent factors - rank) for accurate data modeling. Our software suite encompasses cutting-edge data pre-processing and post-processing modules.

Language: Python - Size: 45.5 MB - Last synced at: 8 days ago - Pushed at: 18 days ago - Stars: 17 - Forks: 5

jangedoo/jange

Easy NLP in Python

Language: Python - Size: 2.06 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 17 - Forks: 4

Ankur3107/nlp_preprocessing

Text Preprocessing Package includes cleaning, tokenization, dataset preparation ...etc

Language: JavaScript - Size: 5.19 MB - Last synced at: 8 months ago - Pushed at: over 4 years ago - Stars: 17 - Forks: 7

Abhishekmamidi123/100DaysOfMLCode

Learning Machine Learning and showcasing my work for 100 Days.

Language: Jupyter Notebook - Size: 2.81 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 6

venkat-0706/Sentimental-Analysis

Build a model to classify text as positive, negative, or neutral. Apply NLP techniques for preprocessing and machine learning for classification. Aim for accurate sentiment prediction on various text formats.

Language: Jupyter Notebook - Size: 280 KB - Last synced at: 8 days ago - Pushed at: 8 months ago - Stars: 13 - Forks: 2

CDSoft/ypp

Yet a PreProcessor

Language: Lua - Size: 191 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 12 - Forks: 1

alaradirik/TR-NLP-workshop

2020 Açık Seminer - Turkish NLP workshop

Language: Jupyter Notebook - Size: 8.43 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 12 - Forks: 3

VipinJain1/VIP-Machine-Learning-Exercises-and-Practices

VIP Machine Learning Exercises and Practices

Language: Jupyter Notebook - Size: 15.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 6

bademiya21/Topic-Modeling-with-Automated-Determination-of-the-Number-of-Topics

My version of topic modelling using Latent Dirichlet Allocation (LDA) which finds the best number of topics for a set of documents using ldatuning package which comes with different metrics

Language: R - Size: 272 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 10 - Forks: 0

ku-nlp/text-cleaning

A powerful text cleaner for Japanese web texts

Language: Python - Size: 56.6 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 2

VivekChoudhary77/Textify-text-Preprocessing

A text preprocessing web application

Language: HTML - Size: 1.03 MB - Last synced at: 6 days ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 0

omar-sherif9992/Dialect-LLM-Bachelor-Project

The aim of the Bachelor project is to innovate a new way for Arabic (Egyptian-Dialect) Sentiment Analysis , Forecasting and Topic Modeling using Machine Learning , Deep Learning and Transformers!

Language: Jupyter Notebook - Size: 16.6 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 7 - Forks: 0

giocoal/reddit-tldr-summarizer-and-topic-modeling

Extreme Extractive Text Summarization and Topic Modeling (using LSA and LDA techniques) over Reddit Posts from TLDRHQ dataset.

Language: Python - Size: 52.5 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 1

SayamAlt/Resume-Classification-using-fine-tuned-BERT

Successfully developed a resume classification model which can accurately classify the resume of any person into its corresponding job with a tremendously high accuracy of more than 99%.

Language: Jupyter Notebook - Size: 1.19 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 4

AndyTheFactory/article-extraction-dataset

Article title, authors, date and body extraction dataset.

Language: HTML - Size: 31.9 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 1

byam/mnlp

MNLP: Mongolian Natural Language Processing.

Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: 7 months ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 4

AbeerAbuZayed/Hate-Speech-Detection_OSACT4-Workshop

Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets.

Language: Jupyter Notebook - Size: 855 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 5

anshul1004/InformationRetrieval

Performs tokenization, stemming, lemmatization, index creation, index compression and ranked retrieval of Cranfield documents

Language: Python - Size: 1.99 MB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 6 - Forks: 1

khuyentran1401/Extract-text-from-article

Language: Jupyter Notebook - Size: 82 KB - Last synced at: 8 days ago - Pushed at: about 5 years ago - Stars: 6 - Forks: 0

praneetmehta/reSEARCH

Vector Space based Search Engine for Arxiv Research Publications

Language: Python - Size: 43.4 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 2

Ailln/proces

🐨 text preprocess.

Language: Python - Size: 42 KB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

SayamAlt/Language-Detection-using-fine-tuned-XLM-Roberta-Base-Transformer-Model

Successfully developed a language detection transformer model that can accurately recognize the language in which any given text is written.

Language: Jupyter Notebook - Size: 1.09 MB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 4

vaitybharati/Assignment-11-Text-Mining-01-Elon-Musk

Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 3

carrliitos/NLPInformationExtraction 📦

My 2020 project focusing on NLP - Information Extraction

Language: Python - Size: 115 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 5 - Forks: 1

paul-pias/Text-Preprocessing-in-Bangla-and-English

Language: Python - Size: 29.3 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 4

chlaudiah/Sentiment-Classification-FD-Reviews

Text Classification for Sentiment Analysis using Female Daily's Reviews Dataset

Language: Jupyter Notebook - Size: 470 KB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 5 - Forks: 4

GyanPrakashkushwaha/MobileRecommenderSystem

Mobile Recommendation System (Recommendation using cosine-similarity)

Language: Jupyter Notebook - Size: 89.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

SayamAlt/Emotion-Detection-using-fine-tuned-BERT-Transformer

Successfully developed a fine-tuned BERT transformer model which can effectively perform emotion classification on any given piece of texts to identify a suitable human emotion based on semantic meaning of the text.

Language: Jupyter Notebook - Size: 971 KB - Last synced at: 19 days ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

LeeTaylorLondon/Dissertation

My Undergraduate dissertation project.

Language: Jupyter Notebook - Size: 170 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

krisograbek/text-preprocessing

Text preprocessing in Python. Libs include string, re, nltk, spacy, gensim, textblob, unidecode, autocorrect, pyspellchecker

Language: Jupyter Notebook - Size: 81.1 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

splAcharya/Imdb_Reviews_Sentiment_Analysis

Sentiment Analysis of IMDB movie reviews using CLassical Machine Learning Algorithms, Ensemble of CLassical Machine Learning Algorithms and Deep Learning using Tensorflow Keras Framework.

Language: Jupyter Notebook - Size: 105 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 2

Isurie/Text-Classification-Module

Sinhala text extraction, preprocessing, and classification considering subject and domain.

Language: Jupyter Notebook - Size: 2.7 MB - Last synced at: 10 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 2

MusfiqDehan/data-preprocessors

🛠️An easy to use tool for Data Preprocessing specially for Text Preprocessing

Language: Python - Size: 193 KB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

mim-solutions/mim_nlp

A Python package with ready-to-use models for various NLP tasks and text preprocessing utilities. The implementation allows fine-tuning.

Language: Jupyter Notebook - Size: 413 KB - Last synced at: 30 days ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

prashver/movie-recommendation-system

This recommendation system employs content-based filtering and NLP preprocessing to suggest similar movies based on user preferences and movie data. It fetches movie posters via APIs and is deployed on Streamlit for easy access.

Language: Jupyter Notebook - Size: 1.97 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

umapornp/textprepro

👀 Everything Everyway All At Once Text Preprocessing for Natural Language Processing.

Language: Python - Size: 1.3 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

olga1590/NLP_projects

NLP with Python

Language: Jupyter Notebook - Size: 8.45 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

SayamAlt/SMS-Spam-Classification-using-fine-tuned-RoBERTa-Base-Transformer

Successfully developed a fine-tuned RoBERTa transformer model which can almost perfectly classify whether any given SMS is spam or not.

Language: Jupyter Notebook - Size: 822 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

pri1311/TweeToxicity

TweeToxicity is a program that analyzes user profiles or hashtags based on recent tweets. The program utilizes machine learning to give Twitter users an appropriate score according to their tweets or retweets. This program is meant for educational purposes and no ill intetions existed prior to creating this program.

Language: Jupyter Notebook - Size: 36.6 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

hello-abyannaufal/Sentiment-Analysis-ID

Sentiment Analysis for Indonesian Language with probability 2 output, Bullying or Non-Bullying.

Language: Jupyter Notebook - Size: 161 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

imeghri-sami/vse

A search engine that helps you to perform a search inside a video

Language: Python - Size: 41.5 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

bhattbhavesh91/clean-text-demo

Tutorial on Clean-Text which is a Python package for text cleaning

Language: Jupyter Notebook - Size: 19.5 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 1

bhattbhavesh91/texthero-demo

Tutorial to demonstrate the power of Texthero which is a library used for Text preprocessing, representation and visualization from zero to hero.

Language: Jupyter Notebook - Size: 511 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 2 - Forks: 0

prince-makwana/Autotagging-of-Question-and-Answers-of-Stackoverflow-Questions

This is a multi label text classification problem. The task is to extract keywords from Stack overflow question and answers, i.e., the problem of auto-tagging of question and answers.

Language: Jupyter Notebook - Size: 946 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

Nourshosharah/introduction-to-natural-language-processing-in-python

my exercises of course natural language processing datacamp

Language: Jupyter Notebook - Size: 1.7 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 4

ashwin4glory/Quora-Question-Pair-Similarity

We have to build a machine learning model to predict whether two questions asked on quora are similar or not . So that the similar questions asked may have the same answers which have been given earlier for the previously asked similar question.

Language: Jupyter Notebook - Size: 5.34 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 0

Avinraj01/SHL-Grammar-Scoring-Engine-for-Voice-Samples

This model predicts grammar scores (1–5) from audio files. It uses Whisper to transcribe speech to text, cleans the text, and extracts features with TF-IDF. A Random Forest Regressor is trained to learn grammar score patterns. Evaluation via Pearson Correlation showed good results.

Language: Jupyter Notebook - Size: 40 KB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 1 - Forks: 0

ssciwr/mailcom

Recognize and pseudonymize named entities in emails

Language: Python - Size: 16 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 1

jhlopesalves/CorpusAid

Automated text preprocessing pipeline for large corpora. Features customizable filters for diacritics, stop words, punctuation, and regex.

Language: Python - Size: 1.33 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

pngo1997/N-gram-Language-Models

Builds N-gram language modes and applies text generation.

Language: Jupyter Notebook - Size: 4.73 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

AjayKumar095/Natural_Language_Processing

Explore cutting-edge Natural Language Processing (NLP) techniques in this GitHub repository. Includes pre-trained models, custom NLP pipelines, text preprocessing tools, sentiment analysis, text classification, and more. Ideal for research, learning, and deploying NLP solutions in Python.

Language: Jupyter Notebook - Size: 2.32 MB - Last synced at: 12 days ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

DeepakMishra99/Natural_Language_Processing_Practice

Natural Language Processing

Language: Jupyter Notebook - Size: 354 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

adilrasheed139/AI-Powered-Resume-Screening-using-BERT

Successfully developed a resume classification model which can accurately classify the resume of any person into its corresponding job with a tremendously high accuracy of more than 99%.

Language: Jupyter Notebook - Size: 1.19 MB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

SayamAlt/Fake-News-Classification-using-fine-tuned-BERT

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

Language: Jupyter Notebook - Size: 18 MB - Last synced at: 15 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Jesly-Joji/Spam-Ham-Classifier

Used Naive Bayes Algorithm, NLP Text Preprocessing Techniques

Language: Jupyter Notebook - Size: 961 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

p208p2002/wikitext-table-parser

A WikiText table parser written in Rust.

Language: Rust - Size: 98.6 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

HannahIgboke/Sentiment-Analysis-of-Real-time-Flipkart-Product-Reviews

Integration of a trained sentiment classification model into a Flask web app for real-time inference on product reviews from Flipkart store.

Language: Jupyter Notebook - Size: 3.43 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

SayamAlt/Financial-News-Sentiment-Analysis

Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.

Language: Jupyter Notebook - Size: 745 KB - Last synced at: 2 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

shinho123/23.08-23.12-KOREA-BASIC-SCIENCE-INSTITUTE-

기관 협업 공동 연구(23.08~12) - 한국기초과학지원연구원(Korea Basic Science Institute)

Language: Jupyter Notebook - Size: 15.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Muhammad-Sheraz-ds/Natural-Language-Processing

This repository serves as a comprehensive resource for learning and implementing Natural Language Processing (NLP) techniques. The content is organized to provide an understanding of NLP challenges, real-world applications, and various approaches used to solve NLP use cases.

Language: Jupyter Notebook - Size: 93.6 MB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Arbazkhan-cs/Movies-Recommend-System

Our innovative platform takes the guesswork out of choosing your next movie by offering personalized recommendations based on similar films you love.❤️

Language: Jupyter Notebook - Size: 9.58 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

KashifMoin1410/AmazonFoodReview

This ML binary classification model uses text handling techniques like stemming, lemmatisation, and stop word removal to classify reviews as positive or negative.

Language: Jupyter Notebook - Size: 86.9 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

amirezzati/Reviews-Sentiment-Analysis

Sentiment Analysis for Reviews

Language: Jupyter Notebook - Size: 441 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

SarangGami/Topic-modeling-on-News-Articles-Unsupervised-Learning

In this project, task involves analyzing the content of the articles to extract key concepts and themes that are discussed across the articles to identify major themes/topics across a collection of BBC news articles.

Language: Jupyter Notebook - Size: 7.26 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

SayamAlt/News-Category-Classification

Successfully developed a news category classification model using fine-tuned BERT which can accurately classify any news text into its respective category i.e. Politics, Business, Technology and Entertainment.

Language: Jupyter Notebook - Size: 3.69 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Emeierkeio/harrypotter-textmining

🪄⚡️ Documentation, data and code used to do Text Processing, Text Representation, Topic Modeling and Text Summarization for 2022/23 Text Mining project @ University of Milano-Bicocca

Language: Jupyter Notebook - Size: 2.86 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

SayamAlt/Abstractive-Text-Summarization-of-News-Articles

Successfully developed an encoder-decoder based sequence to sequence (Seq2Seq) model which can summarize the entire text of an Indian news summary into a short paragraph with limited number of words.

Language: Jupyter Notebook - Size: 4.83 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

Salma-AZIZ/NLP_First_Steps_Python

Natural Language Processing First Steps with Python

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

KelvinLam05/sentiment_analysis

Sentiment classification of shoppers' reviews using machine learning techniques.

Language: Jupyter Notebook - Size: 3.86 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

adhilcodes/Annvi-classifier

Malayalam Gender Classifier - Using Machine Learning to Predict Gender of Individuals using their name

Language: Jupyter Notebook - Size: 10.7 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 3

kelvinlim3/Twitter-Rumour-Detection

BERT model that classifies the source tweet of a Twitter thread as a rumour or non-rumour.

Language: Jupyter Notebook - Size: 27.9 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

nainiayoub/demystifying-nlp

demistifying nlp with a series of nlp implementation notebooks.

Language: Jupyter Notebook - Size: 4.49 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

vaitybharati/Assignment-11-Text-Mining-02-Amazon-Product-Reviews

NLP: Sentiment Analysis or Emotion Mining on Amazon Product Reviews - Part-1. Let’s learn the NLP techniques to perform Sentiment Analysis or Emotion Mining on extracted Product Reviews from Amazon. Part-1 covers Text preprocessing and Feature extraction, the next part covers Sentiment Analysis or Emotion Mining on text corpus. https://medium.com/@vaitybharati/nlp-sentiment-analysis-or-emotion-mining-on-amazon-product-reviews-part-1-428d43112027

Language: Jupyter Notebook - Size: 1.29 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

Shakilgithub20/Text-Preprocessing

Language: Jupyter Notebook - Size: 3.77 MB - Last synced at: about 2 months ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

copev313/Chatbot-Using-Deep-Learning

We build a chatbot by implementing machine learning and natural language processing.

Language: Jupyter Notebook - Size: 368 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Abrar2652/Unintended-Bias-Toxicity-Detection-Project

This is the second project to be completed in Upskill ISA Intelligent Machines. The project was done after the end of the competition. The ensemble of BERT, GPT2, XLNet was used in this model that obtained 0.94656 private scores on Kaggle.

Language: Jupyter Notebook - Size: 49.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

aqaqsubin/Article-Summarizer

Generate an Abstractive summary of Article.

Language: Jupyter Notebook - Size: 874 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 1

itissandeep98/StackOverFlow-Tag-Predictor

develop a predictor that predicts tags for given questions using machine learning models like Naive Bayes, Logistic Regression, Decision Trees and SVM

Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 1

Andrews2017/kkltk

The Kinyarwanda and Kirundi Languages Toolkit (KKLTK) is a Python package for Kinyarwanda and Kirundi languages processing. KKLTK currently provides the sets of stopwords for both languages and other preprocessing tools such as Kinyarwanda and Kirundi tokenizers will be added soon. KKLTK requires Python 3.0, 3.5, 3.6, 3.7, or 3.8.

Language: Python - Size: 25.4 KB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 2

DedeBrahma/Basic-NLP

Basic Practical Guide of Natural Language Processing (NLP)

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

amirali-saj/Persian-News-Information-Retrieval-System

An Information retrieval system for Persian news with ranked retrieval of documents according to relevance to the query.

Language: Python - Size: 76.1 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

ajpar94/flair-extra

A collection of NLP related scripts and notebooks for using the framework flair (https://github.com/flairNLP/flair)

Language: Jupyter Notebook - Size: 1.1 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

dqhuy140598/Learn-NLP

Language: Python - Size: 2.64 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

NatLee/On-the-Role-of-Text-Preprocessing-in-Neural-Network-Architectures-For-IMDB 📦

Unofficial code with the paper "On the Role of Text Preprocessing in Neural Network Architectures" for IMDb dataset.

Language: Python - Size: 23.9 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

bademiya21/Supervised-Classification-of-Text-Categories

This repo describes a supervised approach to text classification using different features and classifiers. This, obviously, is good to use if there is labelled data available.

Language: Jupyter Notebook - Size: 215 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

young-zonglin/text-preprocessing

Preprocess text and transfer them into format used by language model.

Language: Python - Size: 37.1 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0

Ad-Chekk/EchoAI

Web Content Analyzer with LLMs is a powerful tool for scraping, processing, and analyzing web content using advanced Machine Learning (ML) and Natural Language Processing (NLP) techniques. It leverages state-of-the-art models such as RoBERTa for extractive question answering, BART for summarization, and various other NLP models for tasks like senti

Language: Python - Size: 37.1 KB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

Raj-UtsaV/IMDB_Movies_Review

"A sentiment analysis project using IMDb movie reviews with NLP and machine learning techniques to classify reviews as positive or negative."

Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

Related Topics
nlp 73 python 56 natural-language-processing 56 machine-learning 43 text-classification 39 sentiment-analysis 25 nltk 22 deep-learning 21 nlp-machine-learning 18 tf-idf 17 text-processing 17 text-cleaning 14 tokenization 14 bag-of-words 12 text-mining 11 text 10 text-tokenization 10 python3 10 model-evaluation 9 topic-modeling 9 spacy 9 data-science 9 lemmatization 9 pandas 9 stemming 8 exploratory-data-analysis 8 feature-engineering 8 model-training-and-evaluation 8 sklearn 7 text-analysis 7 fine-tuning-bert 7 text-summarization 7 data-visualization 7 named-entity-recognition 7 tfidf-vectorizer 7 regular-expression 7 cosine-similarity 7 logistic-regression 6 data-cleaning 6 scraping 6 wordcloud 6 matplotlib 6 model-inference 6 bert-model 6 word-embeddings 6 embeddings 6 count-vectorizer 6 tensorflow 6 classification 6 nltk-python 6 information-retrieval 6 lda 5 jupyter-notebook 5 data-analysis 5 tokenizer 5 word2vec 5 web-scraping 5 word-cloud 5 scikit-learn 5 text-generation 5 clustering 5 feature-extraction 5 text-representation 5 bert-fine-tuning 5 pos-tagging 4 multiclass-classification 4 porter-stemmer 4 bert-embeddings 4 hugging-face-transformers 4 tf-idf-vectorizer 4 distilbert-model 4 sentiment-classification 4 tfidf 4 keras 4 supervised-learning 4 preprocessing 4 text-clustering 4 dataset 4 eda 4 pytorch 4 n-grams 4 random-forest 4 text-vectorization 4 ngrams 4 naive-bayes-classifier 4 lstm 3 glove-embeddings 3 ner 3 java 3 dimensionality-reduction 3 summarization 3 data-preprocessing 3 arabic-nlp 3 glove 3 twitter 3 machine-learning-algorithms 3 streamlit 3 neural-networks 3 text-extraction 3 corpus-builder 3