GitHub topics: preprocessing-data
damaniayesh/Cognifyz_Internship_Tasks
The project provides Four Tasks which is given by Cognifyz Technology.
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jdenisova/user-churn-prediction
Machine learning project for solving binary classification problem using logistic regression and gradient boostin
Language: Jupyter Notebook - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iremhttp/DepressionDetection
Text-Based Depression Detection By Machine Learning
Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

chollette/SEDNet_Shallow-Encoder-Decoder-Network-for-Brain-Tumor-Segmentation
Official Implementation for SEDNet
Language: Jupyter Notebook - Size: 57.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Minarose/Resting-State-fMRI-Analysis
some of the work I've done with resting-state fMRI
Language: Jupyter Notebook - Size: 119 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

Rubenmarbez/Proyecto-HomeFinder
Con HomeFinder se busca crear una herramienta que permita a sus usuarios encontrar las mejores ofertas que se adapten a sus necesidades y preferencias, a través del análisis de datos de venta de inmuebles de segunda mano en Madrid.
Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

functorism/snapcrop
CLI for crop/resize of large amounts of images with configurable resolutions
Language: Rust - Size: 17.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

wasifijaz/Airbnb-Listings-Success-Classification
Airbnb Listings Success Label Classification
Language: Python - Size: 238 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shipra-09/ML-Project-KNN-Classification
This Github repository contains projects related to KNN classification. Exploring Insights/Inferences by performing EDA on the given project data (Iphone purchase and Bangalore house price).
Language: Jupyter Notebook - Size: 1010 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BalajiN743/Multi-Linear-Regression-examples
Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

EslamElbassel/MNIST-Dataset-Classification-with-KNN-using-centroid-preprocessing
MNIST is a Dataset for images of handwritten digits Classification with KNN by extracting features using centroid
Language: Python - Size: 1.71 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

PankajVispute/I-phone-purchase-project--Prediction-with-KNN-Classification
Prediction of customer will purchase iPhone or not using KNN classifier model and multiple supervised ML model.
Language: Jupyter Notebook - Size: 598 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

bilaloumehdi/TP_NLP
Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

L98S/step-by-step-credit-card-approvals-prediction
This repository provides a step-by-step guide for predicting credit card approval using machine learning techniques.
Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nourhenehanana/Big-Mart-Sales-Prediction
Build a predictive model that help Big Mart (retail chain) to understand the properties of products and stores which play a key role in increasing sales.
Language: Jupyter Notebook - Size: 376 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mrsaraei/autoprep
Automated Data Preprocessing Python Package for CSV-based Clinical Data
Language: Python - Size: 23.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lawl2/object-detection-and-spatial-relation
Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

zaha2020/Machine_Learning
Machine Learning projects
Language: Jupyter Notebook - Size: 167 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

shahed-adnan/ML-House-Price-Prediction
The data contains information from the 1990 California census and used in the second chapter of Aurélien Géron's recent book 'Hands-On Machine learning with Scikit-Learn and TensorFlow'. The dataset is used to train and test machine learning model using regression and random forest.
Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shalini210688/CustomerAnalysis
Customer Analysis
Language: Jupyter Notebook - Size: 21.5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sorrychoe/RBigKinds
BigKinds Data Analysis Toolkit for R
Language: R - Size: 12.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ghulam-ahmad-1/Credit_Card_fraud_Detection
Credit Card Fraud Detection using RANDOM FOREST CLASSIFIER
Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas
Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas
Language: Jupyter Notebook - Size: 121 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 0

rafaelaqfc/Duplicate-Questions-Classifier
This is my first project on NLP algorithms and techniques to identify duplicate questions.
Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

boomalope/misc
Growing collection of scripts that manipulate text data.
Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

NM001007/Suicidal_Ideation_Detection_Using_GAT_and_GCN
In this project, three different models based on GAT, GCN and SAGE have been implemented to examine their performance on two prominent social networking platforms, namely Twitter and Reddit.
Language: Python - Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AmestOsipyan/Portfolio_Data-Analytics
This repository is containing a portfolio of data analyst projects that I have completed and showcases my skills and experience
Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

PedramPeiro/Customer-Health-Score-Prediction
This project was done for Didar CRM, a leading company in CRM in Iran. In this project the aim was to assign Health Score to each customer in order to recognize ill customers and decrease churn rate.
Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

cjean-pierre/Scoring_FastAPI
Rest API for predicting default scores
Language: Python - Size: 7.61 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

m92vyas/Implementing_Attention_Mechanism_Language_Translation
Bahdanau Attention Mechanism | Tensorflow Custom Layers/Model/Loss Function/Metrics | LSTM | Encoder | Decoder | Cross-Attention | Language Translation | Blue Score | Dropout
Language: Jupyter Notebook - Size: 48.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tuanio/backend-recommender-system-book
Flask REST API for Recommender System Book App on Android
Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

tmard/Deep_Learning_Challenge
Non-profit foundation funding predictor using deep learning and neural networks.
Language: Jupyter Notebook - Size: 894 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

AnastasiaNehodova/credit_scoring
Исследование надёжности заёмщиков — анализ банковских данных
Language: Jupyter Notebook - Size: 743 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

thien20/DSDV_project
Scrap manga from web and preprocess that data
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bharadwaj-chukkala/Data-driven-motion-planning-using-various-machine-learning-algorithms
ENPM808A: Introduction to Machine Learning Final Project
Language: Jupyter Notebook - Size: 4.52 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

angelicavelez/predict_the_amount_of_gold_mined
Model to predict the amount of gold extracted from gold mineral.
Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shaheer-khan-github/Natural-Language-Processing-in-Python-DataCamp
Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

imyjk729/Memristor
In-sensor reservoir computing for language learning via two-dimensional memristors
Language: Jupyter Notebook - Size: 458 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

NgKhaiPhu/Data-preprocessing
Different methods of data preprocessing
Language: Jupyter Notebook - Size: 2.64 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

triyoza/Loan-Prediction-HCI
A Final Task in Virtual Internship Experience Program: Data Scientist HOme Credit Indonesia
Language: Jupyter Notebook - Size: 4.29 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shakilgithub20/Improving-Classification
Language: Jupyter Notebook - Size: 3.76 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

knavoid/stock-reports-data Fork of LightOne-Capstone/reports_data
Extract and analyze keywords in the stock reports
Size: 2.33 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

dwiputris/Data_Pre-processing_credit_scoring
In this project data pre-processing is employed to handle a dataset that is peppered with problems, like missing values, explicit duplicates, implicit duplicates, and numerous categories.
Language: Jupyter Notebook - Size: 977 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

markbader/prepro_split_by_time_signature
A MIDI preprocessing script to avoid time signature changes in data.
Language: Python - Size: 5.86 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

marti1999/Decision-Tree-Implementation
Creating a Decision Tree Classifier using Python
Language: Python - Size: 1.25 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

harshith20/nutrient_recogniser
Identify the name and nutrient facts ,by uploading pic of a vegetable or fruit
Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

victorchendra02/Students-Performance-in-Exams
Dataset by Aman Chauhan from kaggle.com
Language: HTML - Size: 2.74 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

caesarmario/data-warehouse-credit-card-applicant-using-pentaho
This repository contains OLTP, ETL process (using Pentaho Data Integration), and OLAP of credit card dataset. The dataset is taken from Kaggle (https://www.kaggle.com/rikdifos/credit-card-approval-prediction) and part of author Capstone Project.
Size: 1010 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

etetteh/production_ml
Language: Python - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ShubhamAgr09/Chennai_Housing-Price_Pridiction
Regression Model to precisely predict the price of house based on various proposed features and also help the sellers understand what factors are fetching more money for the houses.
Language: Jupyter Notebook - Size: 862 KB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

marcadeant/DWA
Drink water access study with Tableau Software
Language: Jupyter Notebook - Size: 7.47 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

starkjones/House-Prices-Advanced-RegressionKaggle-Exercise
Predicting housing prices using feature engineering and XGBoost and Sequential Modeling Models
Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

marizombie/text-preprocessing-examples
Basic text preprocessing operations shown in jupyter notebook. You can play with them and look what are they doing. For stemming and lemmatization there are different options, I showed only what I prefer to use. Repository contains the data to play with taken from kaggle (can also be found here on github), but for convenience I attach it here.
Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

RuntimeTerror-Plotify/plotify
A web app which facilitates user to visualize data and perform statistical operation without any need to write code in machine learning and data science domain.
Language: EJS - Size: 10.4 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

huiyi999/concordiacrawler
web crawler
Language: Python - Size: 42.5 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ironymint/tl_preprocessor
Just add a few sample images and run transfer Learning. It classifies tons of images like magic.
Language: Python - Size: 38.1 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

r-research/Honours_Final_Code
This is the repository for K S Rome's Honours Code in 2021.
Language: Jupyter Notebook - Size: 144 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

xxl4tomxu98/learn-you-from-text
This App predicts author's sentiment and personality traits by analyzing simple text input he or she writes. M1 Macbook Optimized Pytorch Neural Network Models.
Size: 179 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

abderrahman-bns/Data-Cleaning-and-Preprocessinng-with-Pandas
Introducing you to the fundamentals of the quintessential Python data analysis library, pandas, and its core data structures – the Series and DataFrame objects.
Language: Jupyter Notebook - Size: 604 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

Faroja/Practice-Machine-Learning-11
Machine Learning Practice Essemble Model Bagging, Using detailed EDA, Preprocessing Scheme, looking model with best performance F1 score, Hyperparamater Tunning for best models, and intrepertation
Language: Jupyter Notebook - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Navaneeth-Sharma/Speech_Recognition_of_Digits
This project of recognizing digit and converting it to text uses Signal processing techniques such as MFCC and other Advanced Signal Processing techniques for the preprocessing of the data. Then the Preprocessed data is used by the Neural Network algorithms to learn the pattern or structure of the sound.
Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

adarshnitt/House-Prediction
Kaggle Competition under programme "30 day of ML" by Alexis Cook
Language: Jupyter Notebook - Size: 6.08 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

rbsathish/keras-Datapreprocessing-Handling
In this git you can find the dataset preprocessing and handling.
Language: Python - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

MaxBubblegum47/Preprocessing
Preprocessing method for Information Retrieval System
Language: Python - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

YassirMatrane/featureEngeneeringModules
Instead of hand-coding the preprocessing of data within a data science life cycle project, you can make use of these modules to automatically preprocess your data
Language: Jupyter Notebook - Size: 310 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Jeffresh/wine-data-preprocessing
preprocessing wine data set using matlab/octave
Language: Jupyter Notebook - Size: 147 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
