An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: cleansing-data

Wb-az/ML-airbnb-paris-analytics-and-price-prediction

Airbnb Paris - analytics and accommodation price prediction

Language: Jupyter Notebook - Size: 36.8 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

daddel80/notepadpp-multireplace

MultiReplace is a Notepad++ plugin for advanced multi-string replacements. It supports saving and loading replacement lists, CSV column targeting, match highlighting, and external hash tables for DQ cleansing and anonymization. Conditional and mathematical operations are fully integrated.

Language: C - Size: 41.8 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 31 - Forks: 7

autistic-symposium/ml-netclean-py 📦

👾 package to cleanse complex networks data, extracted from the ml-graph-network-analyser

Language: Python - Size: 6.96 MB - Last synced at: 6 days ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

ws-garcia/VBA-CSV-interface

The power you need to cleanse, filter, sort, reshape, manage and analyze data from CSV files.

Language: VBA - Size: 146 MB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 73 - Forks: 9

tomexiskandar/bintang

A tiny and temporary db for quick data cleansing and transformation. It is a high-level python coding and would help any Pythonistas up to speed with ETL work.

Language: Python - Size: 28.8 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 1

NouranHaitham/ML_WaterQuality

A notebook aimed at predicting and improving water safety by analyzing contaminants and pollution levels in water sources, enhancing public health and ensuring access to clean drinking water.

Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 1 - Forks: 1

intigration/CVA

CVA highlights shifts in the process mean, making it particularly effective for detecting small changes.

Language: Python - Size: 15.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

JonathanPollyn/Diabetic-Prediction

Diabetes prediction utilizing established characteristics. The objective of this exercise is to showcase the efficacy of Machine learning. The dataset comprises various health-related attributes gathered to facilitate the creation of predictive models for detecting potential diabetes risks.

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

bensetiawan/API_Cleansing_Data_Tweets_with_Python_Regex

API Flask for Cleansing Input Text and Tweet File

Language: Python - Size: 9.72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sai0299/RawDataAnalysis

Cleaning and Analysing the raw data containing customers details of a grocery store based on their age, gender and products they purchase.

Size: 15.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cristhianc001/movie-recommendation-system

Movie recommender system based on content using Tfid-idf vectorizer and cosine similarity to calculate the scores and render.com free trail to deploy the results

Language: Jupyter Notebook - Size: 63.2 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

christadel27/2300944_09_ADE_hate-speech_Challenge-Gold

Tugas Gold Challenge

Language: Python - Size: 6.91 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

microsoft/AutoBrewML

With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.

Language: Jupyter Notebook - Size: 141 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 24 - Forks: 32

sgrams/ci 📦

computational intelligence, university of gdańsk 2019-2020

Language: Jupyter Notebook - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

vcwild/data-cleaning 📦

Notebooks and scripts with data cleansing methods.

Language: Jupyter Notebook - Size: 911 KB - Last synced at: 3 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

SuryaRao007/Python-SQL-like-analysis

Data preparation for machine learning involves a. Removing data with blanks b. Picking up only those rows where there X value column has valid values

Language: Jupyter Notebook - Size: 23.4 KB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

Related Keywords
cleansing-data 16 machine-learning 7 python 6 data-science 4 deep-learning 2 sql 2 csv 2 data 2 anomaly-detection 2 gridsearchcv 2 randomforestclassifier 1 render 1 recommender-system 1 recommendation-system 1 regression-models 1 fastapi 1 exploratory-data-analysis 1 dimensionality-reduction 1 cosine-similarity 1 cleaning-data 1 raw-data-analysis 1 raw-data 1 excel 1 prediction 1 dataanalysis 1 cleaned-data 1 regex 1 predictive-modeling 1 water-quality 1 threshold 1 statistical-analysis 1 machine-learning-algorithms 1 analytics 1 dataframes 1 r 1 data-cleaning 1 text-mining 1 neural-networks 1 naive-bayes-classification 1 knn-classification 1 genetic-algorithm 1 deep-neural-networks 1 data-compression 1 data-c 1 association-rules 1 3-satisfiability 1 text-summarization 1 text-classification 1 text-analysis 1 sampling-strategies 1 responsible-ml 1 nlp-machine-learning 1 microsoft 1 datavisualization 1 azure-automl 1 swagger 1 flask 1 api 1 vectorization 1 scikit-learn 1 logistic-regression 1 string-replace 1 replacement 1 replace-text 1 notepadplusplus 1 notepad-plusplus-plugin 1 notepad-plus-plus 1 find-and-replace 1 filter-replacement 1 delimited-file 1 delimited-data 1 delimited 1 computational-replacement 1 column-filter 1 anonymized-data 1 anonymize 1 xgboost-regression 1 xai-shap 1 wrangling-cleaning 1 visualization 1 svr-regression-prediction 1 random-forest 1 pipeline 1 neural-network 1 gradient-boosting 1 feature-selection 1 ensemble-machine-learning 1 hyperparameter-tuning 1 decision-trees 1 dataset 1 dataprocessing 1 classification-models 1 transform 1 tinydb 1 tempdb 1 tabular 1 table 1 staging-tables 1 etl 1 db 1