An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: preprocessing-data

LuisFelipePoma/Machine_Learning

Learning about the algorithms used in machine learning, along with techniques for training and testing models.

Language: Jupyter Notebook - Size: 17.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 2 - Forks: 0

Mohammed061/Transportation-and-logistics-Challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

Language: Jupyter Notebook - Size: 3.36 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Multiomics-Analytics-Group/acore

Functionality to preprocess and analyse multi-omics data

Language: Python - Size: 2.74 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

JoseRuiz01/ChestXRayPneumoniaDetection

Pneumonia detection using Convolutional Neural Networks

Language: Jupyter Notebook - Size: 1.46 GB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 0 - Forks: 0

Naeem1144/segmentation-project

Customer Segmentation using Machine learning models for clustering analysis

Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Lummy-A/montgomery-county-crime-analysis

Analysis of crime patterns in Montgomery County (2018-2022) using Python data science tools to identify trends, spatial hotspots, and temporal distributions across crime types. Includes visualizations and insights to inform prevention strategies.

Language: Jupyter Notebook - Size: 5.24 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

vanderschaarlab/hyperimpute

A framework for prototyping and benchmarking imputation methods

Language: Python - Size: 428 KB - Last synced at: about 11 hours ago - Pushed at: about 2 years ago - Stars: 183 - Forks: 14

ArthurMangussi/pymdatagen

A Python Library for the Generation of Artificial Missing Data

Language: Python - Size: 2.36 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 5 - Forks: 2

Tomaslopera/Fifa_Analysis

Language: Jupyter Notebook - Size: 8.58 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 0 - Forks: 0

ddihora1604/Advanced_Business_Analytics_on_World_Bank_Global_Financial_Inclusion_Data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

Language: Jupyter Notebook - Size: 27 MB - Last synced at: 18 days ago - Pushed at: 27 days ago - Stars: 0 - Forks: 0

Shakilgithub20/News-Classification

Language: Jupyter Notebook - Size: 11 MB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

BHARGAVPRAVEEN-CHINTAPALLI/Uber-Trends-Analysis

UBER TREND ANALYSIS

Size: 2.33 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

DavidRichardson02/Standardized_CSV_Data_Analysis

Given the pathname of a file, it automates data extraction, statistical analysis, and modeling via MATLAB plotting scripts, facilitating a streamlined approach to handling analysis of datasets. This project provides a robust, standardized pipeline for reading, preprocessing, analyzing, and modeling data from CSV(or similarly delimited) files.

Language: C - Size: 2.88 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

courtois-neuromod/ds_prep

All the scripts to prepare the Courtois-Neuromod dataset

Language: Python - Size: 67.1 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 2 - Forks: 4

shellynagar27/Transportation-and-logistics-Challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

Language: Jupyter Notebook - Size: 3.38 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sonjaove/ML-hands-on

repo for some hands on stuff

Language: Jupyter Notebook - Size: 135 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

r-a-j/Social-Scope

"SocialScope harnesses the power of data science to Instagram's vast content, providing insightful analytics and trend predictions for informed decision-making."

Language: SCSS - Size: 16.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

HoangLeminh17/Ranks-Prediction-for-LOL

A method to predict rankings based on performances of players for game League Of Legends

Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

NavyaTrilok/Earthquake-Analysis-Dashboard

We have designed an Earthquake dashboard for Researchers, Emergency Response Teams and Educators studying earthquake patterns and trends.

Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Jingvu/Anime-Database-Preprocessing-R-Project

During the data preprocessing step, I identified three tasks that I believe are crucial and require careful attention: data transformation, handling outliers, and managing missing values. This repository serves as a resource to share what I've learned on these topics for anyone interested.

Size: 16.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

BirchKwok/spinesUtils

A library that provides template code for Python development to shorten the project development cycle.

Language: Python - Size: 209 KB - Last synced at: about 1 month ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

lisek75/ts_forecasting_notebook

Time series forecasting using ML models (ARIMA, SARIMA, SARIMAX and Prophet)

Language: Jupyter Notebook - Size: 22.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Himank-Khatri/ClassiFlow

A web app that automates tedious data preprocessing and machine learning model testing.

Language: Python - Size: 258 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Tszon/Data-Science-Projects

Included are all the worth-noting Data Science projects in my learning journey.

Language: Jupyter Notebook - Size: 1.69 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RafiQamar/Customer-Churn-Prediction-App

Built and deployed a Streamlit-based customer churn prediction app using ML models. Preprocessed data with encoding and scaling, improving model accuracy. Designed for churn prediction and retention insights.

Language: Jupyter Notebook - Size: 876 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

nlqthinh/WeaviateAnime

Explore your favorite anime with this interactive search app! 🚀 This project leverages Weaviate for vector search and Gradio for a seamless user interface. Using embeddings from a custom anime dataset, you can perform quick and accurate similarity searches for anime titles

Language: Python - Size: 8.87 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

bindugayatri02/Employee-Data-Preprocessing-for-Tableau-Analysis-Coursera-Project-

For this project, I preprocessed employee data sourced from three Excel files hosted on Tableau Public: "Employee names," "Employee data," and "Employee travel responses." This dataset encompasses employee IDs, names, hire dates, travel survey responses, and other relevant information. The source files and the final processed data are attached.

Size: 120 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

RafiQamar/HR-Analytics-Project

Cleaned and processed HR data using Python for analysis and visualization. Analyzed employee trends and performance using SQL and Python. Built an interactive Power BI dashboard connected to MySQL for dynamic insights.

Language: Jupyter Notebook - Size: 4.71 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

msche81/2-Jedha_Fullstack

450h Data Scientist training - Collect and store large amounts of data - Build prediction models in Machine Learning and Deep Learning - Deploy your models in real conditions

Language: Jupyter Notebook - Size: 248 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

ArtZaragozaGitHub/CV--Plants-Seedling-Classification

A robust image classifier using CNNs to efficiently classify different plant seedlings and weeds to improve crop yields and minimize the extensive human effort to do this manually.

Language: Jupyter Notebook - Size: 7.81 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

khangbdd/Data-processing-CLI

CLI tools for preprocess csv data

Language: Python - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

GMeghana19/solar-power-output

Solar power prediction using liner regression

Language: Jupyter Notebook - Size: 1.44 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

drleniaw/Analysis_Sentiment_Twitter_Free_Sex_In_Indonesian

Analysis Sentiment on Twitter Free Sex In Indonesia

Language: Jupyter Notebook - Size: 2.35 MB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

RafiQamar/IMDb-Movie-Analysis

This project involves web scraping, data preprocessing, database storage and visualization of IMDb movie data from the last decade (2014-2024). The dataset includes details of 10,000 movies such as name, release year, genre, ratings, metascore and more. The project culminates in an interactive Power BI dashboard for in-depth insights and reporting.

Language: Jupyter Notebook - Size: 24.1 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

iliavrtn/final-project

This project explores whether Mathematics and Computer Science texts still retain enough linguistic patterns (metalanguage) for classification once domain-specific words are removed. 🤖📚

Language: Jupyter Notebook - Size: 15.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

WaodeAnisaNurdinia/PreprocessingModelKNN

22.114966_Waode Nurdinia Anisa

Language: Jupyter Notebook - Size: 1.33 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

MatanNafshi/Wine-Quality-Prediction-Machine-Learning-Python

This project predicts wine quality using machine learning based on chemical properties like acidity, sugar content, and alcohol. It includes data exploration, preprocessing, and applying models like Linear Regression, Random Forest, and SVM. Models are evaluated for accuracy to determine the best predictor of wine quality.

Language: Jupyter Notebook - Size: 2.24 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Nouran246/Credit-Card-Approval-Prediction-Classification Fork of Rowlkh/Credit-Card-Approval-Prediction-Classification-

This project predicts credit card application approval by analyzing applicant data. It includes EDA, preprocessing, feature selection with Genetic Algorithms, and classification using KNN, Decision Trees, and MLP models.

Language: Jupyter Notebook - Size: 8.25 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 1

sarahloree/Project-2--Bank-Loan-Marketing-Model

This is the second project I completed as part of the Machine Learning Module from my post-graduate certification in AI/ Machine Learning from University of Texas' McCombs School of Business.

Language: Jupyter Notebook - Size: 3.62 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

lucianoscarpaci/News-Data-Classification

Using the Reuters dataset, this example illustrates the process of data preprocessing, model definition and training, and performance evaluation.

Language: Jupyter Notebook - Size: 94.7 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shahzadsiddiqi/NLP

This repository contains implementations and workflows for key NLP tasks like text classification, Generative AI, sentiment analysis, and entity recognition. It includes preprocessing scripts, annotated datasets, and fine-tuning methods for frameworks like Hugging Face and spaCy. Ideal for building and deploying scalable NLP solutions.

Size: 1000 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

alvaro-concha/animal-behavior-preprocessing

animal-behavior-preprocessing is a Python repository to preprocess animal behavior data. It works on the output spreadsheets from video-tracking of animal body parts with LEAP or DeepLabCut. It applies a Median Filter, an Ensemble Kalman Filter, transforms data to joint angles and computes their Morlet Wavelet Spectra.

Language: Python - Size: 251 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

AlejandroLara11/MachineLearningCourse

Machine Learning Basics: From Setup to Clustering

Language: Python - Size: 1.1 MB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Pritam3355/Scripts

This Repository contains differnt scripts for data collection

Language: HTML - Size: 2.12 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

girgisadel/RegressionUsingCsharp

A machine learning project to predict taxi fares using ML.NET. This solution includes end-to-end data preprocessing, training, evaluation, and prediction, designed for both learning and practical deployment.

Language: C# - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

thiwaK/preprocess-50k-tiles-sri-lanka

Preprocessing scripts for 1:50K tiles issued by the survey department, Sri Lanka

Language: Python - Size: 16.6 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

karthik-d/nyc-taxi-dataset-eda

Clearning, transformation and analysis large datasets as part of coursework for UCS1629: Data Warehousing and Data Mining.

Language: Jupyter Notebook - Size: 9.79 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

saadhaniftaj/AI-EssayScore-Automated-Essay-Scoring-Using-LSTM

AI-EssayScore is an automated essay scoring system using LSTM neural networks. It tokenizes and pads essays, processes them through an LSTM model, and predicts scores. The project includes data preprocessing, model training, evaluation, and saving the model for future use.

Language: Jupyter Notebook - Size: 8.8 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ChristianGoueguel/specProc

The specProc package is a collection of preprocessing tools for spectroscopy data analysis.

Language: R - Size: 68.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

Sabaudian/Music_Genre_Classification_project

Audio Pattern Recognition project - Music Genres Classification

Language: Python - Size: 1.33 GB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

PhilaController/gun-violence-dashboard-data

Python toolkit for preprocessing data for the City Controller's Gun Violence Dashboard

Language: Python - Size: 355 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

Pawel-Tomasz-Nowak/Scientific-collaboration

The repository highlights the results of my scientific collaboration with Dr. Eng. Adam Zagdański

Language: Jupyter Notebook - Size: 111 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

neosaffana/TugasDataMining1

Tugas 1 Mata Kuliah Data Mining

Language: Jupyter Notebook - Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

ELHoussineT/AutoDataCleaner

Simple and automatic data cleaning in one line of code! It performs one-hot encoding, date & time casting to datetime dtype, detects binary columns, safely convert non-numeric columns to numeric dtypes, cleaning dirty/empty values, normalizing values and removing unwanted columns all in one line of code. Get your data ready for model training and fitting quickly.

Language: Python - Size: 647 KB - Last synced at: 23 days ago - Pushed at: almost 4 years ago - Stars: 19 - Forks: 4

UniFeat/unifeat

An open-source tool for performing feature selection process in different areas of research

Language: Java - Size: 30.5 MB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 3

sorrychoe/pyBigKinds

BigKinds Data Analysis Toolkit for python

Language: Python - Size: 31.1 MB - Last synced at: 10 days ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

Animesh-Chourey/Loan-Classifier

Trained machine learning algorithms (Logistic Regression, KNN, SVM, Decision Tree) specifically, after performing visualization and pre-preocessing tasks on a loan dataset. Executed the evaluation metrics such as F1-score, Log loss and jaccard-similarity score to assess the algorithms performance.

Language: Jupyter Notebook - Size: 29.3 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

AmruhaAhmed/Data-Cleaning-on-New-York-Airbnb-Listings

Language: Jupyter Notebook - Size: 3.11 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

mariotruss/ML-supportticket-classifyer-prep

🔬 For a paper on AI / ML in Support Ticket Systems, I used this code to clean my data.

Language: Python - Size: 6.84 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

eZWALT/MVA-MultiVariate-Analysis

MDS-FIB Multivariate-Analysis (MVA) subject 2024-25 Q1

Language: R - Size: 135 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

DavidRichardson02/CSV_DataSet_Analysis

The program processes CSV files to capture and format file contents, generate custom directories of files, extract data, perform analysis, and generate MATLAB script(s) for visualization and further analysis.

Language: C - Size: 128 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 0

istolesweetroll/Elimination-of-entry-preprocessing-errors

R language Shiny application using shiny.fluent, presenting methods of applying machine learning algorithms in elimination of entry preprocessing errors.

Size: 0 Bytes - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

BhavinPatel4199/Machine-Learning-Framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

AndrewDettor/YouTubeMostPopularVideos

ETL data pipeline using YouTube API, AWS EC2, and AWS RDS, with EDA and Tableau visualizations.

Language: Jupyter Notebook - Size: 7.59 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

ManjiriSDS/Data-Science-Case-Study

Language: Jupyter Notebook - Size: 104 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

blleshi/Neural_Network_Binary_Classification

Venture Funding with Deep Learning (Neural Network Binary Classification)

Language: Jupyter Notebook - Size: 278 KB - Last synced at: 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

weiglszonja/meeg-tools

EEG/MEG data preprocessing and analyses framework

Language: Jupyter Notebook - Size: 120 MB - Last synced at: 7 days ago - Pushed at: about 3 years ago - Stars: 12 - Forks: 5

Jatin-Mehra119/Flight-Price-Prediction

This study aims to analyze flight booking data from "Ease My Trip" website, using statistical tests and linear regression to extract insights. By understanding this data, valuable information can be gained to benefit passengers using the platform.

Language: Jupyter Notebook - Size: 10.9 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

ItsCodeBakery/K-Means-Clustering

Music Recommendation System using K-Means Clustering

Language: Jupyter Notebook - Size: 2.69 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

IRoyalX/Dataset_Preprocessing_Sample

UNI S6: Preprocessing in Data Mining using ucimlrepo

Language: Python - Size: 3.91 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

mohd-faizy/Preprocess_ML

This repository hosts Python code that utilizes the Scikit-learn preprocessing API for data preprocessing. The code presents a comprehensive range of tools that handle missing data, scale data, encode categorical variables, and perform other functions.

Language: Jupyter Notebook - Size: 1.33 MB - Last synced at: 2 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

amorim-dev/DSA_Data-Science-Course

Data Dcience Education in Data Science Academy. The course is in Portuguese and online.

Language: Jupyter Notebook - Size: 161 MB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

nagababumo/Preprocessing-Unstructured-Data-for-LLM-Applications

Language: Jupyter Notebook - Size: 37.1 KB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 2

karimelmolla/fakenews-detection

"Factual Eye" is a Fake News Detection mobile application developed as our graduation project. Utilizing machine learning and deep learning models, our project aims to combat misinformation effectively.

Language: Jupyter Notebook - Size: 71.7 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

akshupande/Sales-Analysis-Enhancing-Customer-Experience-and-Boosting-Sales-through-Data-Insights

Unlock valuable data insights with Sales Analysis, a project focused on analyzing sales data to identify trends, patterns, and recommendations for enhancing customer experience and increasing sales.

Language: Jupyter Notebook - Size: 807 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

DavidRichardson02/CSV_Data_Set_Analysis

The program processes CSV files to capture and format file contents, generate custom directories of files, extract data, perform analysis, and generate MATLAB script(s) for visualization and further analysis.

Language: C - Size: 255 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

lucasDSBR/AI-data-preprocessing

Data preprocessing for Artificial Intelligence

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

emkr-13/model_ta

Model buat TA Sentimen and Topik Berita Indonesia

Language: Jupyter Notebook - Size: 70 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 1 - Forks: 0

Unstructured-IO/community 📦

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Size: 5.7 MB - Last synced at: 12 months ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 6

azevedontc/dataPreprocessing

Introduction to KDD and data preprocessing / Introdução ao KDD e pré-processamento de dados

Language: Python - Size: 396 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

CCaribe9/AdaptStdEPF

Code and experiments related to the paper: 'An adaptive standardisation model for Day-Ahead electricity price forecasting'

Language: Jupyter Notebook - Size: 96.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

Rahafzsh/DataMiningTasks

Language: Jupyter Notebook - Size: 6.2 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

XuanyiJennyMa/pupil_cloud_data_preprocessing_Phase_1

Scripts for pre-processing eye-tracker data from pupil cloud

Language: Python - Size: 2.08 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

kkmk11/BLIGHT-VISION

This is a ML based Web App that aims to detect the presence of late blight or early blight on potato leaves, which are the primary causes of crop damage. Additionally, the system recommends appropriate precautions and pesticides to help farmers eliminate the blight and protect their crops and increasing their yields.

Language: PureBasic - Size: 79.3 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

HayatiYrtgl/audio_processing_for_cnn_network

Spectrum creation is the most important thing while dealing with audio data

Language: Jupyter Notebook - Size: 581 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

kenza-ily/24UCL_HospitalReadmissionPred-DiabeticPatients

ML prediction of hospital readmissions for diabetic patients

Language: Jupyter Notebook - Size: 48.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mr-daniyalkhan/Cardiovascular-Disease-Prediction

This Repository contains End to End Project of Cardiovascular Disease Prediction System.

Language: Jupyter Notebook - Size: 15.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

juliorodrigues07/ny_rent_pricing 📦

Rent pricing prediction on NY properties with interactive dashboards.

Language: Jupyter Notebook - Size: 13.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

RAQUELFONT/Master-s-Projects

A compilation of impactful projects undertaken during my master's degree studies. 🎓

Language: Jupyter Notebook - Size: 4.06 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

prakhar8922/WhatsApp-chat-analyzer

WhatsApp Chat Analyzer is a user-friendly Python application designed to provide insightful analyses of your WhatsApp group or individual chats.

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

SkullkyAI/ML-CLASIFICACION-TITANIC-KAGGLE

Práctica de clasificación con Machine Learning en el dataset del Titanic, abordando exploración de datos, preprocesamiento, selección de métricas y modelos, con el objetivo de analizar detalladamente los resultados obtenidos.

Language: Jupyter Notebook - Size: 7.97 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

damaniayesh/Cognifyz_Internship_Tasks

The project provides Four Tasks which is given by Cognifyz Technology.

Language: Jupyter Notebook - Size: 1.14 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

jdenisova/user-churn-prediction

Machine learning project for solving binary classification problem using logistic regression and gradient boostin

Language: Jupyter Notebook - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

iremhttp/DepressionDetection

Text-Based Depression Detection By Machine Learning

Language: Jupyter Notebook - Size: 10.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

chollette/SEDNet_Shallow-Encoder-Decoder-Network-for-Brain-Tumor-Segmentation

Official Implementation for SEDNet

Language: Jupyter Notebook - Size: 57.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Minarose/Resting-State-fMRI-Analysis

some of the work I've done with resting-state fMRI

Language: Jupyter Notebook - Size: 119 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

Rubenmarbez/Proyecto-HomeFinder

Con HomeFinder se busca crear una herramienta que permita a sus usuarios encontrar las mejores ofertas que se adapten a sus necesidades y preferencias, a través del análisis de datos de venta de inmuebles de segunda mano en Madrid.

Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

functorism/snapcrop

CLI for crop/resize of large amounts of images with configurable resolutions

Language: Rust - Size: 17.5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

wasifijaz/Airbnb-Listings-Success-Classification

Airbnb Listings Success Label Classification

Language: Python - Size: 238 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Shipra-09/ML-Project-KNN-Classification

This Github repository contains projects related to KNN classification. Exploring Insights/Inferences by performing EDA on the given project data (Iphone purchase and Bangalore house price).

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Related Keywords
preprocessing-data 158 machine-learning 58 python 48 data-science 25 exploratory-data-analysis 19 data-visualization 19 pandas 17 preprocessing 16 data-analysis 16 feature-engineering 11 numpy 11 machine-learning-algorithms 10 scikit-learn 10 seaborn 9 logistic-regression 8 dataset 8 classification 8 deep-learning 8 data 7 feature-selection 7 eda 7 clustering 6 cleaning-data 6 python3 6 tensorflow 6 matplotlib 6 artificial-intelligence 5 random-forest-classifier 5 knn-classification 5 csv 5 jupyter-notebook 5 linear-regression 5 predictive-modeling 5 powerbi 5 data-mining 5 random-forest 5 keras-tensorflow 4 sklearn 4 datacleaning 4 sklearn-library 4 nlp 4 data-engineering 4 data-cleaning 4 statistics 4 neural-network 4 dimensionality-reduction 4 svm-model 3 nltk-python 3 neural-networks 3 twitter 3 sentiment-analysis 3 scikitlearn-machine-learning 3 numpy-library 3 r 3 preprocessor 3 matplotlib-pyplot 3 machinelearning 3 streamlit 3 supervised-learning 3 keras 3 business-analytics 3 data-structures 3 flask 3 feature-extraction 3 regression-models 3 statistical-analysis 3 analysis 3 decision-tree-classifier 3 plotly 3 svm-classifier 3 datascience 3 joblib 2 standardscaler 2 vizualize-data 2 cnn-classification 2 confusion-matrix 2 hyperparameter-optimization 2 regression 2 mysql-database 2 graph 2 pandas-python 2 modeling 2 hypothesis-testing 2 svm 2 docker 2 streamlit-webapp 2 lstm 2 audio-processing 2 natural-language-processing 2 wordcloud 2 generalization 2 data-exploration 2 descision-tree 2 pandas-dataframe 2 hyperparameter-tuning 2 nlp-machine-learning 2 jupyter 2 naive-bayes-classifier 2 text-processing 2 text-analysis 2