GitHub topics: data-preparation
chahelgupta/DEP-videogames-dataset
The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.
Language: Jupyter Notebook - Size: 2.11 MB - Last synced at: about 2 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

cevheryilmaz/Honey_Production_in_the_USA_in_Machine_Learning
Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

DataRish/MBTI-Personality-Predictor
This project predicts MBTI personality types from users' recent 50 posts using NLP and ML techniques.
Language: Jupyter Notebook - Size: 24.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Chan-dre-yi/industry-4.0-exploratory-data-analysis
An exploratory data analysis of an Industry 4.0 dataset uncovered insights indicating that Business Intelligence and IoT systems will have the greatest impact in the field over the next decade.
Language: MATLAB - Size: 1.32 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

aditijoshi613/Brazilian-E-commerce-Analytics
Analytics for a leading Brazilian E-commerce firm, Olist Store
Size: 41.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 2

kakarot11/Logistic_Regression_NeuralNetwork
Multiple models for binary classification and checking the accuracy with each model.
Language: Jupyter Notebook - Size: 3.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Arckitechttt/Data-Preprocessing-Projects
Perform Data Preprocessing including “Handling Missing Values”, “Handling Outliers”, “Handling Irrelevant Data”, “Handling Imbalanced Dataset”, “Handling Unstandardized Data”, and “Feature Selection based on Features Reduction algorithms and Features Correlation method”.
Language: Jupyter Notebook - Size: 212 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Ganeshkarwa/Diwali-Sales-Analysis-Project-
Diwali-Sales-Analysis-Project
Language: Jupyter Notebook - Size: 893 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Ogefest/refinator-site
Public repo for refinator.xyz webstie. My new project, no-code tool to work with messy data
Language: HTML - Size: 7.73 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

dustin-decker/featuremill
general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning
Language: Go - Size: 64.5 KB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 12 - Forks: 0

brprojects/MLS_model
In this project I predict the 2016 MLS season using historical data and Poisson regression. The project includes cleaning, preprocessing and analyzing the dataset, building and evaluating predictive models for match outcomes, forecasting team performance and simulating the league table. It uses Pandas, Numpy, MatPlotLib and StatsModel libraries.
Language: Python - Size: 1.35 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

victorantoniassi/jr_analytics_engineer_practical_test
Minha resolução para um teste prático de uma vaga de Analytics Engineer Júnior
Language: Python - Size: 34 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

bodybuilders-team/ist-meic-cd-g03
Data Science project of group 03 - MEIC @ IST 2023/2024.
Language: Python - Size: 146 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ArthurSrz/Introduction-aux-Interactions-Homme-Donn-es Fork of microsoft/Data-Science-For-Beginners
Un cours pour apprendre à construire des interactions homme-données
Language: Jupyter Notebook - Size: 79.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

piebro/simple-image-classification-labeling-website
A simple website to label images for classification locally.
Language: HTML - Size: 5.86 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

lprtk/pyTCTK
Python Text Cleaning ToolKit library (pyTCTK)
Language: Python - Size: 21.5 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

damaniayesh/KPMG_FORAGE_JOB_SIMULATIONS
The project describes the client on customer targeting with the Data, Analytics & Modelling team. Assessed data quality and completeness in preparation for analysis. The Analysed data to target high-value customers based on demographics and attributes
Language: Jupyter Notebook - Size: 6.45 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Cyrill98/Extract-Invoice-PDF-file-to-CSV
Language: Jupyter Notebook - Size: 266 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ELToulemonde/dataPreparation
Data preparation for data science projects.
Language: R - Size: 5.18 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 31 - Forks: 10

ka00ri/sumIT
Computes the sum or difference of two digits, given two images and an operation to perform +/-.
Language: Python - Size: 14.6 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dmandache/sleek-patch
Python 3 Package for optimally sampling big images with texture-aware patchification based on SLIC superpixels. So Sleek !
Language: Python - Size: 30.1 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

ArtemKornev0/Data_preparation-resume_analysis
Подготовка данных (анализ резюме из HeadHunter) / Data preparation (resume analysis from HeadHunter)
Language: Jupyter Notebook - Size: 2.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

oliverweissl/KnowledgeAndData-Project
Visualisation of Codon-useage for species in the NCBI Taxonomy.
Size: 55.9 MB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

OgeAno/HR--Employee-Turnover-Analysis
An analysis of employee turnover for a given 12-month period
Language: Jupyter Notebook - Size: 187 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

itzhak0estrella/Energy_Data_Analytics_GNN
Undergraduate research project that was funded by the ECE Next Program. Contributed with Professor Hao Zhu and with my grad. mentors Shaohui Liu and Young-ho Cho .
Language: Jupyter Notebook - Size: 1.27 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

AnvithaChaluvadi/Whale-Analysis_Module4Challenge
In this assignment, I'll get to use what I've learned this week to evaluate the performance among various algorithmic, hedge, and mutual fund portfolios and compare them against the S&P 500 Index.
Language: Jupyter Notebook - Size: 6.67 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

mpokojovy/COVID.LOS.prep
Time-to-Event Modeling for Hospital Length of Stay Prediction for COVID-19 Patients: Data Preparation
Language: R - Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

FarhanaTeli/Factors-Influencing-US-Home-Prices
Using publicly available data for the national factors that impact supply and demand of homes in US, build a data science model to study the effect of these variables on home prices.
Language: Jupyter Notebook - Size: 4.08 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SanaeSaccomano/Intelligence-Artificielle
Résumé de mes projets d'Intelligence artificielle
Language: Jupyter Notebook - Size: 2.89 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

parsa-abbasi/Data-Preparation-and-Visualization-in-Python
Data Preparation and Visualization in Python
Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

tirthgala/Automation-of-Operations-for-Zola
This repository contains my work on VBA macros while working in the e-commerce department of an Indian fashion brand called Zola.
Language: HTML - Size: 14.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

nisheethjaiswal/Data-Annotator-for-SpaCy
🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.
Language: HTML - Size: 3.71 MB - Last synced at: 5 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

pablo14/data-science-live-book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Language: TeX - Size: 58.4 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 215 - Forks: 106

georgezoto/Tableau-Advanced
Udemy's Tableau 10 Advanced Training: Master Tableau in Data Science. Harness the power of your data. Unleash the potential of your team. Learn data visualization through Tableau and create opportunities for you or key decision makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks.
Size: 742 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 3

nischaybikramthapa/Physical-Activity-Recognition
Can we predict what a person is doing based on their movements?
Language: Jupyter Notebook - Size: 23.4 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

franc136/2022_Cyclistic_Case_Study
A case study analyzing 2022 bicycle rideshare data, to identify trends in rider behavior.
Size: 3.39 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lemarigo/PortifolioProjects-Data-Prep-and-Machine-Learning
Folder contains python scripts and reports around Data Preparation and Machine Learning implementation.
Language: Jupyter Notebook - Size: 37.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

whwu95/MVFNet
【AAAI'2021】MVFNet: Multi-View Fusion Network for Efficient Video Recognition
Language: Python - Size: 20.3 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 136 - Forks: 12

vzhomeexperiments/R_selflearning
Developing self learning robot
Language: R - Size: 89 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 12 - Forks: 35

WaliUllahbaig/OCR-with-VisionEncoderDecoder-Model
The project focuses on building an OCR system using state-of-the-art deep learning models, specifically VisionEncoderDecoder models, which have demonstrated impressive performance in various computer vision tasks.
Language: Jupyter Notebook - Size: 386 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

WaliUllahbaig/Exploring-Hyperparameters-and-Weight-Initializations-in-Neural-Networks
This project delves into artificial neural networks, using Python and Keras, to build and analyze these networks. Neural networks are computational models inspired by the human brain, consisting of interconnected nodes (neurons) that process information.
Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

bayhaqy/Data-Preparation-Analysis-Mico
Simple Way to Data Preparation and Analysis with Miro
Language: Python - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sdixit5/Analysis-of-Barack-Obama-s-Presidency
This Project compares the Effective Minimum Wage and Unemployment Rate statistically and Analytically at the start and end of former President Barack Obama's term.
Language: Jupyter Notebook - Size: 3.54 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

nilot-pal/Membrane-permeability-using-ML
Source code for "Prediction of Membrane Permeability of Molecules Using Machine Learning"
Language: Python - Size: 4.42 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

OpsDataHub/data_analytics_portfolio
Portfolio containing projects to showcase data skills
Language: HTML - Size: 2.52 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

SamuelBarbosaDev/Roof_Imoveis_Data_Analysis
The company hired you because they want to know what would be the 5 properties they should invest in and why, and which 5 you would not recommend investing in at all.
Language: Jupyter Notebook - Size: 4.31 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

SamuelBarbosaDev/Walrmart_Data_Analysis
You have been hired by Walmart to survey the revenue of their stores in the USA and point out which store would be best to expand its size. It is necessary to analyze the weekly sales of each store, calculate some important information that will be asked, and at the end of it all, indicate which store should be invested in.
Language: Jupyter Notebook - Size: 2.94 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

kbelisar/datalark
Like the mudlark finding treasures on the foreshore, the datalark seeks treasures hidden within messy data!
Language: R - Size: 32.2 KB - Last synced at: 20 days ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

martamanevska/Big-Data-Kaggle-Dataset-Project
Finding insights for further marketing decisions using dataset: from order status, price, payment and freight performance to customer location, product and reviews. According to the description of the dataset available on Kaggle, the collection of dataset used to develop the project refers to orders made at multiple marketplaces in Brazil.
Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

noernimat/data_preparation_covid19_dataset
Data preparation covid19 dataset for Machine Learning Model
Language: Jupyter Notebook - Size: 5.45 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 1

dartwinshu/rakamin-digital-festival-data-science
Data Science course by Rakamin Academy
Language: Jupyter Notebook - Size: 448 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

dartwinshu/revou-mini-couse-data-analytics
Data analytics course by RevoU
Size: 7.81 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ghulam-ahmad-1/Movie_Recommendation_system
Movie Recommendation System
Language: Jupyter Notebook - Size: 225 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

MiladNooraei/Quera-Superstore Fork of FarzanehSoltanzadeh/Quera-Superstore
Conducted data pre-processing, optimized data warehousing, applied statistical analysis and machine learning techniques, and created visually compelling Power BI visualizations to derive valuable insights for informed decision-making.
Language: Jupyter Notebook - Size: 22.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

E7su/hypno
Data analysis with pandas, numpy, scikit-learn
Language: Python - Size: 2.85 MB - Last synced at: almost 2 years ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

viniavskyi-ostap/recommender_systems
Implementation of different approaches to recommendation on Amazon Review dataset
Language: Jupyter Notebook - Size: 7.28 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

halil/sau-ml
SAU Makine Öğrenmesi Eğitim İçerikleri
Language: Python - Size: 14.8 MB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 13 - Forks: 3

neuro-ml/reskit
A library for creating and curating reproducible pipelines for scientific and industrial machine learning
Language: Jupyter Notebook - Size: 36.4 MB - Last synced at: 10 days ago - Pushed at: almost 8 years ago - Stars: 27 - Forks: 7

sadnanMohosin/Data-Science-Machine-Learning-Literacy
The purpose of this repository to learn the underlying theory and concept of DL/ML from data preparation to implementing prepared data to the models.
Size: 2.96 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

mymickiewicz/data-preprocessor
A data preprocessing tool for `MyMickiewicz`.
Language: TypeScript - Size: 40 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

kyaw-yethu/data-preparation-toolkit
A toolkit to help preparing data for machine learning projects
Language: Python - Size: 2.24 MB - Last synced at: 3 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

RodrigoSdeCarvalho/rsEasyML
Rust version of my machine learning framework that provides data preprocessing, feature selection, classification, regression and even more complex deep learning models, model persistence, autoencoders and anomaly detection
Language: Rust - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

CSFelix/Data-Science-Mental-Maps
🐍 Mental Maps Related to Contents in Data Science 🐍
Size: 51.8 KB - Last synced at: 5 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

doratako/Data-Quality-Assurance
Data validation and data cleansing
Language: Jupyter Notebook - Size: 54.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

virchan/predictive_modeling_workflow
This project explores the predictive modeling workflow using the Kaggle competition "Titanic - Machine Learning from Disaster." It emphasizes key stages like data analysis and model evaluation, aiming to identify the optimal model. Through a real-world approach, we enhance our understanding of the workflow and emphasize rigorous model evaluation.
Language: Jupyter Notebook - Size: 2.48 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

LucienCastle/loan-delinquency-prediction
Predicts if a customer will delinquent using ML classification models
Language: Jupyter Notebook - Size: 6.28 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lucoliv23/KC-Roasters-Classification-
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lucoliv23/Celestial-Object-Detection
Language: Jupyter Notebook - Size: 1.16 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

lucoliv23/Genomic-Data-Clustering
Language: Jupyter Notebook - Size: 0 Bytes - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

carpentries-incubator/rna-seq-data-for-ml
RNA-Seq: Data Readiness for Machine Learning Applications
Language: R - Size: 47.5 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 2

ejay34/01_real_estate_market
Используя данные сервиса Яндекс.Недвижимость, определить рыночную стоимость объектов недвижимости и типичные параметры квартир
Language: Jupyter Notebook - Size: 581 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ejay34/06_recovery_of_gold
На основании сырых данных с параметрами добычи и очистки золотоносной руды построить прототип модели для предсказания коэффициента восстановления золота из золотоносной руды с лучшей метрикой sMAPE.
Language: Jupyter Notebook - Size: 416 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

ejay34/05_location_for_the_well
На основании данных о геологоразведке построить модели прогноза запасов нефтяных скважин для регионов, выбрать регион для разработки с приемлемым порогом риска безубыточности и наиболее перспективными ресурсами.
Language: Jupyter Notebook - Size: 230 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ejay34/04_churn_forecast
На основании данных о поведении клиентов построить модель с максимально большим значением F1 для задачи классификации, которая будет определять клиентов, склонных к оттоку.
Language: Jupyter Notebook - Size: 135 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ejay34/03_recommendations_tariff_plan
На основании данных о поведении клиентов построить модель с максимально большим значением accuracy для задачи классификации, которая предложит подходящий тариф.
Language: Jupyter Notebook - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

ejay34/02_computer_games_sales
Используя исторические данные о продажах компьютерных игр, оценки пользователей и экспертов, жанры и платформы, выявить закономерности, определяющие успешность игры.
Language: Jupyter Notebook - Size: 386 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

NotTheStallion/Data_preparation_4_ML_algorithm
This project will focus on data preparation and will follow the steps : data cleaning, handling text and categorical attributes, and feature scaling.
Language: Jupyter Notebook - Size: 1.65 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

konradmalik/ann-laminar-burning-velocity 📦
Models trained in my article on LBV predictions.
Language: C - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

Benazir023/BookReviewAnalysis_efficient_workflow
This is a Dataquest project that focuses on creating an efficient workflow
Size: 318 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

MouhtaramSoufiane/Projets-Machine-Learning
this repository contains two projects : the first it s applying ML algorithm (Logistic regression) for classification on Titanic dataset From scratch and with use Sickit-Learn and the second for analyze this data : Understanding data - data preprocessing
Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

hrtnisri2016/celestial-bodies-database
This is one of the required projects to earn the Relational Databases certification from freeCodeCamp. For this project, I built a database of celestial bodies using PostgreSQL.
Language: Jupyter Notebook - Size: 729 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 4

alirezaniki/DPSA
A GUI-based seismic data processing and source analysis app leveraging KIWI tools and Pyrocko package.
Language: Shell - Size: 101 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

mrsaraei/AutoPDPGLCM
Automated Pixel Data Preparation based on GLCM
Language: Python - Size: 6.81 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

mrsaraei/AutoDP
Automated Data Preparation Model for Machine Learning
Language: Python - Size: 235 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

AlecVail/Preparing_Data_Using_Alteryx
Alteryx Academy Challenge #363
Size: 41 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Taiyo-ai/pt-mesh-pipeline
Use this template repository to write projects and tenders data ingestion pipelines
Language: Python - Size: 111 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 145

azeezat123/Bank-statement-Analysis
Documenting the data cleaning process on a bank statement dataset using the python libraries, NumPy and Pandas.
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 0

GB49/GABB
R package giving tools for RDA and PCA analyses, from data preparation to the creation and visualization of nice graphics
Language: R - Size: 32.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

praathapj/UsedCarPriceCalc
End to End ML model with Deployment
Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

to-schi/ASR-Deepspeech2-Tensorflow
An end-to-end speech recognition engine similar to DeepSpeech2
Language: Jupyter Notebook - Size: 2.19 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

Arvindhh931/Mileage-prediction
Fuel Efficiency of car in miles per gallon
Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

Tanya-Tandon02/Retail-Data-Analysis
A Retail store is required to analyze the day-to-day transactions and keep a track of its customers spread across various locations along with their purchases/returns across various categories.
Language: Jupyter Notebook - Size: 719 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Tanya-Tandon02/Cyber-Security
The objective of the project is to build network intrusion detection system to detect anomalies and attacks in the network.
Language: Jupyter Notebook - Size: 6.31 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

priya-explorer/Response_Model_for_Marketing_Campaign
Build a machine learning model to find target audience and important feature to maximize the ROI of the next marketing campaign
Language: Jupyter Notebook - Size: 1.53 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Talend/data-prep 📦
OS code of Data-prep project
Language: Java - Size: 67.2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 56 - Forks: 28

Bharat-Reddy/Bank-Marketing-Analysis
The data is related with direct marketing campaigns (phone calls) of a Portuguese banking institution. The classification goal is to predict if the client will subscribe a term deposit.
Language: Jupyter Notebook - Size: 2.34 MB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 9

mateuszdorobek/Machine-Learning-Classification
Project made for Advanced Methods in Machine Learning subject at MINI PW
Language: Python - Size: 3.69 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 1 - Forks: 0

rmsandu/segmentation-eval
Extract and evaluate radiomics for liver cancer tumors from DICOM segmentation masks. Using SimpleITK, PyRadiomics and PyDicom.
Language: Python - Size: 1.44 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 5

DohaElHady/DatasetPreparation-LifeSaving-Scripts
This repository contains different random scripts for machine learning dataset preparations.
Language: Python - Size: 11.7 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

SagarGaniga/Data-Preprocessing
Data preprocessing is a data mining technique that involves transforming raw data into an understandable format.
Language: Jupyter Notebook - Size: 422 KB - Last synced at: almost 2 years ago - Pushed at: about 7 years ago - Stars: 15 - Forks: 21
