An open API service providing repository metadata for many open source software ecosystems.

Topic: "text-vectorization"

ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data

Language: Python - Size: 95.3 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 1,843 - Forks: 161

mkearney/wactor

Word Factor Vectors

Language: R - Size: 378 KB - Last synced at: 10 days ago - Pushed at: over 5 years ago - Stars: 32 - Forks: 2

amansrivastava17/bns-short-text-similarity

📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.

Language: Python - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 24 - Forks: 3

Rishabbh-Sahu/information_retrieval

Given a document, identifying the closest documents within the list of documents using tf-idf matrix and cosine similarity

Language: Python - Size: 1.39 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

rtimbro185/syr_mads_ist736_text_mining

Syracuse University, Masters of Applied Data Science - IST 736 Text Mining

Language: Jupyter Notebook - Size: 75.4 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 3 - Forks: 0

Minku-Koo/Comment-Sentiment-Analysis

Comment Sentiment Analysis using Deep Learning

Language: Python - Size: 252 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 2

sergio11/headline_generation_lstm_transformers

Explore advanced neural networks for crafting captivating headlines! Compare LSTM 🔄 and Transformer 🔀 models through interactive notebooks 📓 and easy-to-use wrapper classes 🛠️. Ideal for content creators and data enthusiasts aiming to automate and enhance headline generation ✨.

Language: Jupyter Notebook - Size: 6.37 MB - Last synced at: 20 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

markiskorova/Machine-Learning-NLP-Predict-Author

Machine Learning & Natural Language Processing: Reads Classic Novels and Predicts the Author of a Phrase

Language: Python - Size: 3.49 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

dmitriitimoshenko/govectorize

This is a Go lib that helps converting a slice of strings into a slice of vectors

Language: Go - Size: 3.91 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

vladimiralbrekhtccr/topic_modeling_top2vec_scientific-texts

A diploma project focused on vectorizing scientific texts using the Top2Vec algorithm, with the aim of analyzing thematic groups, identifying trends, and visualizing the dynamics of interest in various topics in the field of computer science.

Language: Jupyter Notebook - Size: 4.16 MB - Last synced at: 5 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

IanCarmona/Recommendation-Songs-Taylor-Swift

This program is a project carried out in the Natural Language Processing course, which is a Taylor Swift song recommender. It utilizes topics such as sentiment analysis in texts, text vectorization, and the removal of stopwords.

Language: Python - Size: 845 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Vidhi1290/ScienceQA-Insights-Exploring-with-LLMs

Predictive Text Analysis project! This repository contains code for predicting answers to science exam questions using advanced natural language processing techniques. Check out the code and results!

Language: Jupyter Notebook - Size: 5.29 MB - Last synced at: 24 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

Chronos-Asteri/movie-recommender-system-v1

Using text-vectorization and similarity-based-matrix computation

Language: Jupyter Notebook - Size: 34.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

SarangGami/Topic-modeling-on-News-Articles-Unsupervised-Learning

In this project, task involves analyzing the content of the articles to extract key concepts and themes that are discussed across the articles to identify major themes/topics across a collection of BBC news articles.

Language: Jupyter Notebook - Size: 7.26 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

ni3choudhary/Toxic-Comment-Classifier

A DL project that helps in classifying Toxic Comment weather it is positive or not.

Language: Jupyter Notebook - Size: 766 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

nainiayoub/demystifying-nlp

demistifying nlp with a series of nlp implementation notebooks.

Language: Jupyter Notebook - Size: 4.49 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

singh-l/Clustering_Repo

Clustering text using text vectorization

Language: Jupyter Notebook - Size: 371 KB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

rosette-api-community/visualize-embeddings

A simple Python script for transforming a corpus of documents into text vectors suitable for visualization

Language: Python - Size: 6.84 KB - Last synced at: about 2 months ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 1

ns-nexus/Movie-Recommender-System

Movie Recommender System leverages a content-based approach, suggesting films to users based on the attributes of movies they have previously enjoyed. By analyzing movie metadata such as genre, cast, director, keywords, etc., this project offers personalized recommendations aligned with users' cinematic tastes.

Language: Jupyter Notebook - Size: 19.7 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

nikhil1209ui/movie_recommender

Movie Recommender based on Content based filtering.

Language: Jupyter Notebook - Size: 24.4 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Ganesh2409/Course-Recommendation-System

🚀 Course Recommendation System is a machine learning-powered web application designed to recommend similar courses from Coursera's vast dataset of over 3,000 courses. Built using Python, Scikit-learn, and Streamlit, the app preprocesses course data, applies text vectorization, and leverages cosine similarity to offer personalized recommendations.

Language: Jupyter Notebook - Size: 75.3 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

vlada-pv/Prediction-Sociolinguistic-Data-Based-on-the-Diaries-Texts-of-the-Prozhito-Project

The repository contains notebooks created for collecting and preprocessing the corpus of diary entries and for experiments on creating models for predicting gender, age groups of authors and the time period of text creation.

Language: Jupyter Notebook - Size: 2.06 MB - Last synced at: about 1 month ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

thatdamncoder/whats-next-on-netflix

Content Based Movie Recommendation System | Python

Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Vanaub22/Movie-Recommender-App

This is a Content-based Movie Recommendation App wherein the user can type in a particular movie that he/she has enjoyed and can get the names and posters of top 5 similar movies.

Language: Jupyter Notebook - Size: 1.06 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

NikosMav/FakeNews-Classification

In this notebook we analyze and classify news articles using machine learning techniques, including Logistic Regression, Naive Bayes, Support Vector Machines, and Random Forests. Explore text vectorization and NLP for accurate news categorization.

Language: Jupyter Notebook - Size: 2.87 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SayamAlt/E-Commerce-Text-Classification

Successfully established a machine learning model that can accurately classify an e-commerce product into one of four categories, namely "Books", "Clothing & Accessories", "Household" and "Electronics", based on the product's description.

Language: Jupyter Notebook - Size: 10.6 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

kulwinderkk/recipe_recommender_nlp

This project is an unsupervised NLP-based recipe recommender system designed to provide personalized recipe suggestions. The system employs content-based filtering techniques, utilizing cosine similarity to measure the resemblance between user inputs and a database of recipes.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

avd1729/Movie-Reviews-Classification

IMDB movie review classification using neural network (text-vectorization v/s word-embeddings)

Language: Jupyter Notebook - Size: 207 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rid17pawar/Sentiment-Analysis-Model-Experiments

Experiments in the field of Sentiment Analysis using ML Algorithms namely Logistic Regression, Naive Bayes along with tfidf, one hot encoding, bag of words vectorization. Different MLP and RNN models viz. LSTM, GRU, Bidirectional LSTM. Lastly, state of the art BERT model

Language: Jupyter Notebook - Size: 1.8 MB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

LorenzoRottigni/ML-text-vectorization

Machine Learning course of Piero Savastano 3: CountVectorizer

Language: Python - Size: 0 Bytes - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ndgigliotti/ndg-tools

Data science and NLP tools developed for my own use.

Language: Python - Size: 16.1 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

jaunel/Content-Based---Movie-Recommender-System

This project is a Recommendation based project. Similar movies will be recommended based upon the content of the movie. I have used ML algorithm & BOW technique of NLP for our model. An interactive web page is also designed using streamlit library for next level user experience.

Language: Jupyter Notebook - Size: 9.77 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

aditya-bhatt-coder/Movie-Recommender-System

A dynamic Movie Recommender web app using ML text vectorization technique to suggest the user with similar kind of movies and their posters.

Language: Jupyter Notebook - Size: 5.9 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

alla-g/infosearch_hw

Homeworks and final project for Infosearch course

Language: Python - Size: 5.62 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

nykolai-d/nlp_tag_prediction

Tag prediction on Stack Overflow using TensorFlow Keras and Text Vectorization

Language: Jupyter Notebook - Size: 76.2 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

andreytsimbalov/News_Classification_and_Vectorization

Evaluation of the accuracy of vectorization and text classification methods

Language: Jupyter Notebook - Size: 258 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

fratambot/Sentiment_analysis_RNN

Performing sentiment analysis on movie reviews using RNN (LSTM) in keras

Language: Jupyter Notebook - Size: 69.2 MB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

dibya-pati/Naive-Bayes

Naive Bayes classifier with text parser and vectorization libs

Language: Python - Size: 4.98 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 0

KaavyaRekanar/Master_Theses

Text Classification of Legitimate and Rogue Online Privacy Policies: A manual analysis and an experimental procedure

Language: Java - Size: 2.29 MB - Last synced at: over 1 year ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

Related Topics
machine-learning 11 nlp 11 python 9 natural-language-processing 6 bag-of-words 6 neural-networks 5 text-classification 5 cosine-similarity 5 tf-idf 4 word-embeddings 4 deep-learning 4 sentiment-analysis 4 topic-modeling 4 streamlit-webapp 4 text-preprocessing 4 keras 3 tensorflow 3 text 3 data-science 3 streamlit 3 sentiment-classification 3 naive-bayes 3 recommender-system 3 nltk 3 logistic-regression 3 tfidf-vectorizer 3 nlp-machine-learning 3 naive-bayes-classifier 3 model-training-and-evaluation 2 exploratory-data-analysis 2 python3 2 lstm-neural-networks 2 lstm 2 api 2 decision-trees 2 transformers 2 model-training 2 twitter-sentiment-analysis 2 visualization 2 model-comparison 1 lstm-model 1 embedding 1 web-hosting 1 computer-science 1 similar-patterns 1 similarity-search 1 keras-tensorflow 1 tag-prediction 1 content-based-recommendation 1 porter-stemmer 1 similarity-matrix 1 stop-word-removal 1 streamlit-application 1 gensim 1 latent-dirichlet-allocation 1 latent-semantic-analysis 1 spacy 1 tmdb-api 1 jupyter-notebook 1 categorize-data 1 clustering-text 1 knowledge 1 data-collection 1 deployments 1 feature-selection-and-engineering 1 model-building 1 rnn 1 tfidf 1 transformer-architecture 1 docker 1 recommendation-system 1 text-embedding 1 tsv 1 text-tokenization 1 author-profiling 1 bilstm 1 convol 1 convolutional-neural-networks 1 diary-entries 1 recurrent-neural-networks 1 sociolinguistics 1 tf-idf-vectorizer 1 data-visualization 1 data-wrangling 1 high-dimensional-data 1 time-series 1 text-generation 1 counter-vectorizer 1 r 1 r-package 1 rstats 1 text-processing 1 word-vectors 1 word2vec 1 interactive-visualizations 1 kaggle 1 kaggle-competition 1 multi-class-classification 1 predictive-text-analysis 1 random-forest-classifier 1