An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-preparation

ibrahim-Sobh/EPITA_course-materials

EPITA Course Materials

Language: Jupyter Notebook - Size: 532 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Ashleshk/Power-BI-A-Z-Hands-On-Power-BI-Training-For-Data-Science-Udemy

Learn data visualization through Microsoft Power BI and create opportunities for you or key decision makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. You'll learn all of the features in Power BI that allow you to explore, experiment with, fix, prepare, and present data easily, quickly, and beautifully.

Size: 5.82 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

kushagrathisside/Dataset-Creator

Dataset preparation tool which can be used to create numeric datasets for Images. It takes uploaded images or live web camera inputs and converts it into numerical records with landmarks as values.

Language: Jupyter Notebook - Size: 60.5 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ruchikaverma-iitg/MoNuSAC

This repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.

Language: Jupyter Notebook - Size: 137 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 46 - Forks: 11

archettialberto/federated_survival_datasets

Build realistic heterogeneous datasets for federated survival analysis in a reproducible way.

Language: Jupyter Notebook - Size: 112 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

mzguntalan/vegetable

Vegetable contains a design/definition of a Vector Graphic that allows it to easily render it as equally an spaced point cloud/sequence. From this, vegetable offers a way to read .ttf font files, and render their glyphs into point clouds/sequences.

Language: Python - Size: 1.45 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 1

LeHongNgoc3820/06.Data_Preprocessing_and_EDA

Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

sbcgua/mockup_compiler 📦

ABAP Excel to zip converter for mockup loader tool

Language: ABAP - Size: 130 KB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 1

ericcornelissen/controlled-vocab 📦

A multi-threaded Python CLI tool to created a controlled vocabulary

Language: Python - Size: 217 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

mporcheron/navi-converter 📦

Convert the VUI Excel XLSX corpus file into a open file formats.

Language: PHP - Size: 7.81 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

AlbertoMarinelli/Data-Mining-UniPi Fork of alessandrocubic/dataMining

Data Mining project carried out on two datasets extracted from the Twitter platform, one on Users and one on Tweets

Language: Jupyter Notebook - Size: 253 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

oliverweissl/DataScience-Project Fork of ND-code-ai/KS_DSProject_VU

Data science project on predicting funding for kickstarter campaigns

Size: 25.5 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

IchfanKurniawan/idx-partner-internship-project

Develop model & scorecard to evaluate loan credit risk - IDX Partner V Internship Program

Language: Jupyter Notebook - Size: 1.15 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

IchfanKurniawan/airline-reviews-dashboard-project

Scrape & prepare data for building a dashboard of airlines reviews

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

IchfanKurniawan/quantium-intern-project-stats-test

Statistical testing of supermarket layouts to the total daily sales - Quantium Intern Project

Language: Jupyter Notebook - Size: 322 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

arrahtech/osdq-desktop

The classic desktop version of osDQ

Language: Java - Size: 106 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 8

OgeAno/Hotel-KPI-Analysis

An analysis of some trends in hotel KPIs

Language: Jupyter Notebook - Size: 257 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

umich-dbgroup/foofah

Foofah: programming-by-example data transformation program synthesizer

Language: CSS - Size: 4.31 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 25 - Forks: 10

HROlive/From-Data-to-Insights-with-Google-Cloud-Platform

Four-course accelerated online specialization teaches course participants how to derive insights through data analysis and visualization using the Google Cloud Platform

Language: Jupyter Notebook - Size: 715 KB - Last synced at: 14 days ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 3

IchfanKurniawan/quantium-intern-project

quantium-intern-project

Language: Jupyter Notebook - Size: 6.36 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

zilmabezerra/portfolio

👉 Click here to see some of my personal projects 👈

Language: Jupyter Notebook - Size: 41 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

cyrilico/premier-league-information-platform

Information platforms developed for DAPI course @ MIEIC, FEUP

Language: Python - Size: 21.7 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

amaralnt/Unicorn_Companies_Project

Data Cleaning, Preparation, and Analysis of an Unicorn Companies dataset

Size: 49.8 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ashish-kamboj/Market-Mix-Modeling

Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales

Language: R - Size: 5.05 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 35 - Forks: 28

bilalayaz2/Sentiment-Analysis-for-6-Basic-Emotion

In this project we clean the dataset and preprocess it to make it ready for classification for 6 basic emotions. Then we implement Sentiment Analysis and then deploy the Machine Learning Pipeline and then use it for Classification of emotions.

Language: Python - Size: 5.86 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ikmb/OmLiT

A Rust accelerated library for annotation and preparing multi-omics data for training deep learning models

Language: Rust - Size: 6.66 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

OgeAno/Online-Sales-Analysis

An analysis of a webshop's sales over a 2-year period

Language: Jupyter Notebook - Size: 405 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 0

FrenchKrab/msdwild-pyannote

Automatically setup the MSDWild dataset for usage with pyannote-database (and pyannote-audio)

Language: Python - Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

muharienal/adidas-prep-viz

Prepare the Adidas US Retail Products dataset using Python and build data visualization using Power BI

Language: HTML - Size: 1.12 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

aws-samples/sm-data-wrangler-mlops-workflows

Integrate SageMaker Data Wrangler into your MLOps workflows with Amazon SageMaker Pipelines, AWS Step Functions, and Amazon Managed Workflow for Apache Airflow (MWAA)

Language: Jupyter Notebook - Size: 2 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 2

miraehab/Airline-Passenger-Satisfaction

Predict the Satisfaction of a customer using a dataset from Kaggle.

Language: Jupyter Notebook - Size: 4.34 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 1

karlosdaniel451/data-analysis

Material and exercises on Data Analysis and Manipulation with Python. Sources: https://wesmckinney.com/book/ https://jakevdp.github.io/PythonDataScienceHandbook/

Language: Jupyter Notebook - Size: 161 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

phueb/Preppy

prepare ordered language data for RNN training

Language: Python - Size: 147 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

Godson199/Python-for-Data-science

Explored and used the techniques of data science such as data wrangling, cleaning, visualization, EDA, etc for data preparation.

Language: Jupyter Notebook - Size: 1.56 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Godson199/Multivariate-Multiple-Regressor

multivariate multiple regression model to study the effect of eight input variables on two output variables, which are the heating load and the cooling load, of residential buildings.

Language: Jupyter Notebook - Size: 3.42 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

lrb924/Collateral_Damage

Collateral Damage: The Effect of War on Stocks, Housing, and Unemployment

Language: Jupyter Notebook - Size: 11.9 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 3

ARUKCFE/BmiAlgorithm

Calculate body mass index in patients of the Clinical Practice Research Datalink (CPRD)

Language: Stata - Size: 22.5 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

AkashSDas/tensorflow-for-audio-101

Snippets from learning tensorflow for audio data and models.

Language: Jupyter Notebook - Size: 2.53 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

SuhailRafi/Artificial-Intelligence-Lab

This repository includes the Python Lab Assignments from my CSE422L Artificial Intelligence course at the School of Data and Sciences of BRAC University, Dhaka, Bangladesh.

Language: Jupyter Notebook - Size: 524 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

gaurang557/image-captioning

This was an advanced step in my machine learning ,I trained the model so that it generates a string caption when provided an image input, the output accuracy is quite impressive ,I used flask to make it interactive

Language: Python - Size: 5.69 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

Subhrajit91939/Preprocessing-CLI

Data Pre-processing CLI⚡- Command Line Interface python app to automate data pre-processing

Size: 24.4 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

ni3choudhary/Apple-Foliar-Disease-Classification-Deployment

A DL project that helps in identifying Foliar disease in apple trees weather its leaves are healthy, are infected with apple rust, those that have apple scab, and those with more than one disease.

Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

showman-sharma/titanic-survival-prediction

We build a predictive model that answers the question: “what sorts of people were more likely to survive the Titanic shipwreck?” using passenger data (ie name, age, gender, socio-economic class, etc).

Language: Jupyter Notebook - Size: 797 KB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

Subhrajit91939/Curated-Data-Preprocessing-EDA-Projects

A curated list of Data Pre-Processing techniques and EDA projects in Python.

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

srikanth-gedela/SriMLModels

Quick reference on various aspects of machines learning that I have come acrossed and my Machine Learning portfolio.

Language: Jupyter Notebook - Size: 66.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

prateeksahu147/OCR-PDF-Web-Scraper

Engine for automated the process of scraping PDFs into local and convert those PDFs into text by performing OCR.

Size: 0 Bytes - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

chomiczdawid/data-preparation

Process of data preparaton in R.

Language: R - Size: 3.63 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

rifqinvnd/Game-Recommender

Game Recommendation System with Content-based filtering

Language: Jupyter Notebook - Size: 134 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

ved93/ml-express

A Python library for day to day data analysis and machine learning. This aims to make data building, cleaning and machine learning much much faster. A library of extension and helper modules for Python's data analysis and machine learning libraries.

Language: Python - Size: 68.4 KB - Last synced at: 29 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

Laxman-Lakhan/Chess-Data-Analysis

User Specific Chess Data Analysis

Language: Jupyter Notebook - Size: 1.09 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

serdarakyol/Collaborative-filtering-data-preparation

Prepares data for collaborative filtering

Language: Jupyter Notebook - Size: 7.07 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

yes-its-shivam/image-processing-scripts

Language: Jupyter Notebook - Size: 31.3 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

FOehlschlaeger/udemy-tableau-fundamentals-of-data-visualization

This project contains data originating from the Udemy course Fundamentals of Data Visualization in Tableau.

Size: 4.42 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

saran-sankar/google-cloud-ai-tools

Tools for making machine learning on Google Cloud easier

Language: Python - Size: 2.2 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

daya6489/DriveML

Self-Drive Machine Learning Projects

Language: R - Size: 6.17 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 4

aiolii63/Netflix-Movie-Database

Supervised machine learning project based on Netflix IMdb movie database

Language: Jupyter Notebook - Size: 16.5 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 1

rifatrakib/data-lab

Python statistical data analysis project on stored preprocessed data, data manipulation and preparation

Size: 1000 Bytes - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

rifatrakib/mongo-postprocessing

A dynamic scripting for in place data processing on MongoDB collections with generalized postprocessing pipelines aimed to handle any types and sizes of data

Language: Python - Size: 43 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

DePacifier/TeleCo-Analysis

10 Academy on Training Project User Analytics in the Telecommunication Industry

Language: Jupyter Notebook - Size: 118 MB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 3

ST3LL/Data-Analysis-Project

December 2021 - Final 4th engineering year Project for the Python for Data Analysis module at ESILV | Blocks Classification & Seoul Bikes Rent Prediction

Language: Jupyter Notebook - Size: 14.2 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

SerhatDerya/medical_examination_research

This repository contains a research about medical examinations (such as body measurements, results from various blood tests, and lifestyle choices).

Language: Jupyter Notebook - Size: 1.08 MB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Cloud-SPAN/02genomics

Data preparation and organisation

Language: Python - Size: 51.2 MB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

SSusantAchary/Data-Annotator-for-SpaCy

🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.

Language: HTML - Size: 3.99 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 1

shinanna/Banking_Data_Analysis

Data Analysis Project on Bank Deposits Dataset

Language: Jupyter Notebook - Size: 515 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

CleverInsight/sparx

Data Munging, Data Wrangling and Data Preparation Simplified

Language: Python - Size: 1.51 MB - Last synced at: about 2 years ago - Pushed at: almost 8 years ago - Stars: 1 - Forks: 3

Minhaz78/Machine-Learning-Practice

Language: Jupyter Notebook - Size: 469 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

ecaybek/dp2fa

A simple Shiny application for preparing data to factor analysis

Language: R - Size: 21.5 KB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

OlehOleinikov/text_parser_otp_bank_statement

Get a structured table with the ability to sort and filter data (for simple office use) from text heap of PDF bank statement

Language: Python - Size: 1.28 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

KwokHing/Exploratory-Data-Analysis-on-SMRT-Tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

Language: Jupyter Notebook - Size: 1.25 MB - Last synced at: about 1 month ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 4

Faroja/Text-Classification-Email-Spam

Text Mining with Preprocessing Scheme, Lemmatization, TF-IDF, Using Model Machine Learning RF, KNN, LOGREG & Decision Tree

Language: Jupyter Notebook - Size: 3.01 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

micpah/football-prediction

My project for two advanced training courses about machine learning and neural networks at educx (https://educx.de/).

Language: Python - Size: 4.42 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

danielhaake/covid19-monitor-germany

The COVID-19 Monitor Germany is an interactive dashboard to give a better overview about the pandemic situation in Germany. It provides a multitude of plots and daily calculated figures. The data used come from official sources. On the one hand from the Robert-Koch-Institut (RKI), on the other hand from the Intensivregister.

Language: Python - Size: 39 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 2

Cloud-SPAN/genomics04-data-preparation-organisation

Data Preparation & Organisation

Language: Python - Size: 51.2 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

MostafaToema/Titanic-Survival-Predection-using-Sklearn

Data preparation, visualization and feature engineering and classification of survival people

Language: Jupyter Notebook - Size: 101 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

abrazinskas/machine-learning-data-pipeline

Pipeline module for parallel real-time data processing for machine learning models development and production purposes.

Size: 3.41 MB - Last synced at: 17 days ago - Pushed at: over 5 years ago - Stars: 22 - Forks: 2

alkashef/cleaning-excel-data

Tidying and cleaning data in Excel sheets

Size: 2.93 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 0

dspanizzo/open-data

Coleta e preparação de dados abertos.

Language: Jupyter Notebook - Size: 426 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

RodolfoLSS/machine_learning_classification

Machine Learning modelling for a classification problem.

Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

ashishpatel26/Audio-Classification-Data-Preparation

Dynamic Data-set preparation for audio, video, images

Language: Jupyter Notebook - Size: 73.3 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 2 - Forks: 2

dikoharyadhanto/Data-Preparation-Documentation

Dokumentasi Pembelajaran Tahap Data Cleansing

Language: HTML - Size: 900 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Data-Wrangling-with-JavaScript/Chapter-6

Code examples for Chapter 6 of Data Wrangling with JavaScript

Language: JavaScript - Size: 154 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

IBMDeveloperMEA/Speed-up-your-Data-Cleansing-with-Data-Refinery

We're going to take a quick tour of the Data Refinery tool. Data Refinery can quickly filter and mutate data, create quick visualizations, and do other data cleansing tasks from an easy to use user interface.

Size: 8.88 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

alihanozz/damp

A pre-machine-learning model package

Language: Python - Size: 49.8 KB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

wefindx/metaform

A utility for defining metadata for data types and formats.

Language: Python - Size: 2.15 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 5 - Forks: 0

samuelsoaress/Challenge-Neural-Networks-Capgemini

this repository contains the code used to develop the whale breed recognition challenge

Language: Jupyter Notebook - Size: 393 KB - Last synced at: 27 days ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

ahmadtc1/datasetBuilder

🗂 Simple and convenient dataset generation at the press of a key

Language: Python - Size: 13 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

prabhatk579/natural-language-processing

Applying Natural Language Processing (NLP) to simple conversations such that result is a computer capable of "understanding" the contents of the conversations.

Language: Jupyter Notebook - Size: 120 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

jranaraki/NCBIdataPrep

An R code to convert NCBI data files into CSV

Language: R - Size: 25.4 KB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 1

vintage-maeve/data-quality

A notebook used for assessing and for improving the quality of an uploaded data set.

Language: Jupyter Notebook - Size: 13 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 1

diegoct6/Machine-Learning

Predictive modeling projects for online competitions(Kaggle & DrivenData) and assignments from the Master in Business Analytics & Big Data at IE HST.

Language: Jupyter Notebook - Size: 2.79 MB - Last synced at: 8 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

AithraSpandagou/Tweets_on_BlackLivesMatter_BLM

In this project, we measure how the number of tweets, using the hashtags: #BlackLivesMatter, fluctuate in response to the Dutch elections

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 2

mohawk2/data-prepare

Module to prepare CSV (etc) data for automatic processing

Language: Perl - Size: 170 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

Rawan19/Hotel-bookings-analysis-using-R

Language: HTML - Size: 1.69 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

RosanaFSS/Data-Visualization-Nanodegree

Data Visualization Nanodegree

Size: 5.36 MB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

bharatsdev/production-ready-model

Make machine learning application production ready

Language: Python - Size: 143 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 6

vdeni/rdionica

Osnove pripreme podataka za obradu koristeći R

Language: R - Size: 10.3 MB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

a-paxton/scale-independent-aggression

Code for "Scale-Independent Aggression" (Blau & Paxton, 2020, Complexity)

Language: R - Size: 1.11 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

jabhinav/Data-Science-and-ML-for-Structured-Data-Classification

Repo contains scripts to perform data analysis on structure data. It also provides a comparison of various ML algorithms at different stages of data preparation.

Language: Jupyter Notebook - Size: 522 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

monomest/Kaldi_CU

Scripts for training acoustic and language models using CU Kids' Speech Corpus. Also includes the data preparation scripts.

Language: Shell - Size: 97.3 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

monomest/Kaldi_myST

Scripts for training acoustic and language models using myST Kids' Speech Corpus. Also includes data preparation scripts for the myST Kids' Speech Corpus. This corpus is a speech corpus with 10 times more English speech data than all other English children's speech corpora combined.

Language: Shell - Size: 14.2 MB - Last synced at: almost 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

Related Keywords
data-preparation 319 python 80 machine-learning 79 data-preprocessing 73 data-analysis 68 data-science 68 data-visualization 60 data-cleaning 52 pandas 34 exploratory-data-analysis 30 feature-engineering 22 deep-learning 22 classification 19 data 19 numpy 18 data-wrangling 17 sql 16 matplotlib 16 data-processing 15 python3 15 seaborn 14 r 14 logistic-regression 14 eda 13 scikit-learn 12 machine-learning-algorithms 11 random-forest 10 tableau 10 jupyter-notebook 9 linear-regression 9 regression 9 clustering 9 tensorflow 9 predictive-modeling 9 data-manipulation 8 nlp 8 data-analytics 8 statistics 8 data-mining 8 dataset 7 data-cleansing 7 feature-selection 7 excel 7 visualization 7 neural-network 7 image-processing 7 data-collection 6 text-processing 6 artificial-intelligence 6 data-transformation 6 statistical-analysis 6 data-engineering 6 neural-networks 6 feature-extraction 6 opencv 6 preprocessing 5 pca 5 docker 5 data-exploration 5 datasets 5 plotly 5 supervised-learning 5 keras 5 data-visualisation 5 data-quality 5 dashboard 5 time-series-analysis 5 pytorch 4 data-normalization 4 analytics 4 random-forest-classifier 4 svm-classifier 4 natural-language-processing 4 named-entity-recognition 4 mysql 4 data-modeling 4 sklearn 4 computer-vision 4 pipeline 4 analysis 4 large-language-models 4 sentiment-analysis 4 hypothesis-testing 4 web-scraping 4 decision-tree-classifier 4 deep-neural-networks 4 powerbi 4 train-test-split 4 image-classification 4 streamlit 4 decission-tree 4 model-training-and-evaluation 4 ml 4 data-prep 4 missing-values 4 model-deployment 3 cnn-classification 3 classification-model 3 imputation 3 feature-scaling 3