An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: preprocessing-data

damaniayesh/Cognifyz_Internship_Tasks

The project provides Four Tasks which is given by Cognifyz Technology.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jdenisova/user-churn-prediction

Machine learning project for solving binary classification problem using logistic regression and gradient boostin

Language: Jupyter Notebook - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iremhttp/DepressionDetection

Text-Based Depression Detection By Machine Learning

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

chollette/SEDNet_Shallow-Encoder-Decoder-Network-for-Brain-Tumor-Segmentation

Official Implementation for SEDNet

Language: Jupyter Notebook - Size: 57.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Minarose/Resting-State-fMRI-Analysis

some of the work I've done with resting-state fMRI

Language: Jupyter Notebook - Size: 119 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

Rubenmarbez/Proyecto-HomeFinder

Con HomeFinder se busca crear una herramienta que permita a sus usuarios encontrar las mejores ofertas que se adapten a sus necesidades y preferencias, a través del análisis de datos de venta de inmuebles de segunda mano en Madrid.

Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

functorism/snapcrop

CLI for crop/resize of large amounts of images with configurable resolutions

Language: Rust - Size: 17.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

wasifijaz/Airbnb-Listings-Success-Classification

Airbnb Listings Success Label Classification

Language: Python - Size: 238 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shipra-09/ML-Project-KNN-Classification

This Github repository contains projects related to KNN classification. Exploring Insights/Inferences by performing EDA on the given project data (Iphone purchase and Bangalore house price).

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

BalajiN743/Multi-Linear-Regression-examples

Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

EslamElbassel/MNIST-Dataset-Classification-with-KNN-using-centroid-preprocessing

MNIST is a Dataset for images of handwritten digits Classification with KNN by extracting features using centroid

Language: Python - Size: 1.71 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

PankajVispute/I-phone-purchase-project--Prediction-with-KNN-Classification

Prediction of customer will purchase iPhone or not using KNN classifier model and multiple supervised ML model.

Language: Jupyter Notebook - Size: 598 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

bilaloumehdi/TP_NLP

Language: Jupyter Notebook - Size: 71.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

L98S/step-by-step-credit-card-approvals-prediction

This repository provides a step-by-step guide for predicting credit card approval using machine learning techniques.

Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nourhenehanana/Big-Mart-Sales-Prediction

Build a predictive model that help Big Mart (retail chain) to understand the properties of products and stores which play a key role in increasing sales.

Language: Jupyter Notebook - Size: 376 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mrsaraei/autoprep

Automated Data Preprocessing Python Package for CSV-based Clinical Data

Language: Python - Size: 23.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lawl2/object-detection-and-spatial-relation

Language: Python - Size: 3.17 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

zaha2020/Machine_Learning

Machine Learning projects

Language: Jupyter Notebook - Size: 167 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

shahed-adnan/ML-House-Price-Prediction

The data contains information from the 1990 California census and used in the second chapter of Aurélien Géron's recent book 'Hands-On Machine learning with Scikit-Learn and TensorFlow'. The dataset is used to train and test machine learning model using regression and random forest.

Language: Jupyter Notebook - Size: 5.16 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shalini210688/CustomerAnalysis

Customer Analysis

Language: Jupyter Notebook - Size: 21.5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sorrychoe/RBigKinds

BigKinds Data Analysis Toolkit for R

Language: R - Size: 12.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ghulam-ahmad-1/Credit_Card_fraud_Detection

Credit Card Fraud Detection using RANDOM FOREST CLASSIFIER

Language: Jupyter Notebook - Size: 28.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas

Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas

Language: Jupyter Notebook - Size: 121 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 9 - Forks: 0

rafaelaqfc/Duplicate-Questions-Classifier

This is my first project on NLP algorithms and techniques to identify duplicate questions.

Language: Jupyter Notebook - Size: 21.2 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

boomalope/misc

Growing collection of scripts that manipulate text data.

Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

NM001007/Suicidal_Ideation_Detection_Using_GAT_and_GCN

In this project, three different models based on GAT, GCN and SAGE have been implemented to examine their performance on two prominent social networking platforms, namely Twitter and Reddit.

Language: Python - Size: 18.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AmestOsipyan/Portfolio_Data-Analytics

This repository is containing a portfolio of data analyst projects that I have completed and showcases my skills and experience

Language: Jupyter Notebook - Size: 20.2 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

PedramPeiro/Customer-Health-Score-Prediction

This project was done for Didar CRM, a leading company in CRM in Iran. In this project the aim was to assign Health Score to each customer in order to recognize ill customers and decrease churn rate.

Language: Jupyter Notebook - Size: 1.21 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

cjean-pierre/Scoring_FastAPI

Rest API for predicting default scores

Language: Python - Size: 7.61 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

m92vyas/Implementing_Attention_Mechanism_Language_Translation

Bahdanau Attention Mechanism | Tensorflow Custom Layers/Model/Loss Function/Metrics | LSTM | Encoder | Decoder | Cross-Attention | Language Translation | Blue Score | Dropout

Language: Jupyter Notebook - Size: 48.4 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

tuanio/backend-recommender-system-book

Flask REST API for Recommender System Book App on Android

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 1

tmard/Deep_Learning_Challenge

Non-profit foundation funding predictor using deep learning and neural networks.

Language: Jupyter Notebook - Size: 894 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

AnastasiaNehodova/credit_scoring

Исследование надёжности заёмщиков — анализ банковских данных

Language: Jupyter Notebook - Size: 743 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

thien20/DSDV_project

Scrap manga from web and preprocess that data

Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

bharadwaj-chukkala/Data-driven-motion-planning-using-various-machine-learning-algorithms

ENPM808A: Introduction to Machine Learning Final Project

Language: Jupyter Notebook - Size: 4.52 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

angelicavelez/predict_the_amount_of_gold_mined

Model to predict the amount of gold extracted from gold mineral.

Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shaheer-khan-github/Natural-Language-Processing-in-Python-DataCamp

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

imyjk729/Memristor

In-sensor reservoir computing for language learning via two-dimensional memristors

Language: Jupyter Notebook - Size: 458 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 2

NgKhaiPhu/Data-preprocessing

Different methods of data preprocessing

Language: Jupyter Notebook - Size: 2.64 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

triyoza/Loan-Prediction-HCI

A Final Task in Virtual Internship Experience Program: Data Scientist HOme Credit Indonesia

Language: Jupyter Notebook - Size: 4.29 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Shakilgithub20/Improving-Classification

Language: Jupyter Notebook - Size: 3.76 MB - Last synced at: 4 months ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

knavoid/stock-reports-data Fork of LightOne-Capstone/reports_data

Extract and analyze keywords in the stock reports

Size: 2.33 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

dwiputris/Data_Pre-processing_credit_scoring

In this project data pre-processing is employed to handle a dataset that is peppered with problems, like missing values, explicit duplicates, implicit duplicates, and numerous categories.

Language: Jupyter Notebook - Size: 977 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

markbader/prepro_split_by_time_signature

A MIDI preprocessing script to avoid time signature changes in data.

Language: Python - Size: 5.86 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

marti1999/Decision-Tree-Implementation

Creating a Decision Tree Classifier using Python

Language: Python - Size: 1.25 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

harshith20/nutrient_recogniser

Identify the name and nutrient facts ,by uploading pic of a vegetable or fruit

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

victorchendra02/Students-Performance-in-Exams

Dataset by Aman Chauhan from kaggle.com

Language: HTML - Size: 2.74 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

caesarmario/data-warehouse-credit-card-applicant-using-pentaho

This repository contains OLTP, ETL process (using Pentaho Data Integration), and OLAP of credit card dataset. The dataset is taken from Kaggle (https://www.kaggle.com/rikdifos/credit-card-approval-prediction) and part of author Capstone Project.

Size: 1010 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

etetteh/production_ml

Language: Python - Size: 35.2 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

ShubhamAgr09/Chennai_Housing-Price_Pridiction

Regression Model to precisely predict the price of house based on various proposed features and also help the sellers understand what factors are fetching more money for the houses.

Language: Jupyter Notebook - Size: 862 KB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

marcadeant/DWA

Drink water access study with Tableau Software

Language: Jupyter Notebook - Size: 7.47 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

starkjones/House-Prices-Advanced-RegressionKaggle-Exercise

Predicting housing prices using feature engineering and XGBoost and Sequential Modeling Models

Language: Jupyter Notebook - Size: 2.71 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

marizombie/text-preprocessing-examples

Basic text preprocessing operations shown in jupyter notebook. You can play with them and look what are they doing. For stemming and lemmatization there are different options, I showed only what I prefer to use. Repository contains the data to play with taken from kaggle (can also be found here on github), but for convenience I attach it here.

Language: Jupyter Notebook - Size: 61.5 KB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

RuntimeTerror-Plotify/plotify

A web app which facilitates user to visualize data and perform statistical operation without any need to write code in machine learning and data science domain.

Language: EJS - Size: 10.4 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

huiyi999/concordiacrawler

web crawler

Language: Python - Size: 42.5 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ironymint/tl_preprocessor

Just add a few sample images and run transfer Learning. It classifies tons of images like magic.

Language: Python - Size: 38.1 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

r-research/Honours_Final_Code

This is the repository for K S Rome's Honours Code in 2021.

Language: Jupyter Notebook - Size: 144 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

xxl4tomxu98/learn-you-from-text

This App predicts author's sentiment and personality traits by analyzing simple text input he or she writes. M1 Macbook Optimized Pytorch Neural Network Models.

Size: 179 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

abderrahman-bns/Data-Cleaning-and-Preprocessinng-with-Pandas

Introducing you to the fundamentals of the quintessential Python data analysis library, pandas, and its core data structures – the Series and DataFrame objects.

Language: Jupyter Notebook - Size: 604 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

Faroja/Practice-Machine-Learning-11

Machine Learning Practice Essemble Model Bagging, Using detailed EDA, Preprocessing Scheme, looking model with best performance F1 score, Hyperparamater Tunning for best models, and intrepertation

Language: Jupyter Notebook - Size: 93.8 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

Navaneeth-Sharma/Speech_Recognition_of_Digits

This project of recognizing digit and converting it to text uses Signal processing techniques such as MFCC and other Advanced Signal Processing techniques for the preprocessing of the data. Then the Preprocessed data is used by the Neural Network algorithms to learn the pattern or structure of the sound.

Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 0

adarshnitt/House-Prediction

Kaggle Competition under programme "30 day of ML" by Alexis Cook

Language: Jupyter Notebook - Size: 6.08 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

rbsathish/keras-Datapreprocessing-Handling

In this git you can find the dataset preprocessing and handling.

Language: Python - Size: 97.7 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

MaxBubblegum47/Preprocessing

Preprocessing method for Information Retrieval System

Language: Python - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

YassirMatrane/featureEngeneeringModules

Instead of hand-coding the preprocessing of data within a data science life cycle project, you can make use of these modules to automatically preprocess your data

Language: Jupyter Notebook - Size: 310 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Jeffresh/wine-data-preprocessing

preprocessing wine data set using matlab/octave

Language: Jupyter Notebook - Size: 147 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Related Keywords
preprocessing-data 166 machine-learning 60 python 52 data-science 26 exploratory-data-analysis 20 data-visualization 19 preprocessing 18 pandas 17 data-analysis 17 machine-learning-algorithms 11 scikit-learn 11 numpy 11 feature-engineering 11 seaborn 9 logistic-regression 8 data 8 eda 8 dataset 8 classification 8 deep-learning 8 feature-selection 7 matplotlib 6 random-forest 6 clustering 6 tensorflow 6 artificial-intelligence 6 python3 6 predictive-modeling 6 cleaning-data 6 nlp 6 datacleaning 5 random-forest-classifier 5 data-mining 5 jupyter-notebook 5 csv 5 linear-regression 5 powerbi 5 knn-classification 5 svm-classifier 4 data-engineering 4 sklearn 4 dimensionality-reduction 4 statistics 4 nltk-python 4 data-cleaning 4 neural-network 4 keras-tensorflow 4 sklearn-library 4 flask 3 regression-models 3 statistical-analysis 3 business-analytics 3 matplotlib-pyplot 3 preprocessor 3 svm-model 3 data-structures 3 r 3 machinelearning 3 streamlit 3 supervised-learning 3 numpy-library 3 neural-networks 3 sentiment-analysis 3 twitter 3 datascience 3 scikitlearn-machine-learning 3 decision-tree-classifier 3 keras 3 plotly 3 feature-extraction 3 nltk-library 3 hyperparameter-tuning 3 nlp-machine-learning 3 natural-language-processing 3 analysis 3 cnn-classification 2 standard-scaler 2 vizualization 2 wordcloud 2 critical-thinking 2 open-source 2 time-series 2 evaluation-metrics 2 svm 2 data-exploration 2 gradient-boosting 2 streamlit-webapp 2 preprocessing-techniques 2 tableau 2 hypothesis-testing 2 descision-tree 2 model-evaluation 2 jupyter 2 data-scraping 2 string-manipulation 2 string-formatter 2 standardization 2 standardscaler 2 joblib 2 vizualize-data 2