An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-preprocessing"

KhaledAshrafH/Logistic-Regression

This program implements logistic regression from scratch using the gradient descent algorithm in Python to predict whether customers will purchase a new car based on their age and salary.

Language: Python - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 3

alotlikar1010/PW-Skills-Data-Master-Assignment

Assignment Solution of PW Skills Data Master Course

Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 5

basiralab/Kaggle-BrainNetPrediction-Toolbox

A Python toolbox for predicting brain network (graph) evolution over time from a single observation. The codes of the 20 competing Kaggle teams along with the competition datasets are made available.

Language: Python - Size: 4.92 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 3

d4rk-lucif3r/LuciferML ๐Ÿ“ฆ

Semi-Auto Machine Learning Library by d4rk-lucif3r

Language: Python - Size: 2.52 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 6

DataPreprocessing/DataCleaning

Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.

Language: Python - Size: 117 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 3

TatevKaren/Deep-Learning-for-Data-Science

Deep Learning Case Studies with Tensorflow and Keras for Beginners-Advanced: ANN, CNN, RNN, Self-Organizing Maps, Boltzmann Machines, Stacked Autoencoders

Language: Python - Size: 2.54 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

seoulai/mdml

Medical data and machine learning group

Language: Jupyter Notebook - Size: 6.74 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

mansipatel2508/Yelp-Review-Stars-Prediction-with-Machine-Learning

The project has text vectorization, handling big data with merging and cleaning the text and getting the required columns while boosting the performance by feature extraction and parameter tuning for NN, compares the Performances through applied different models treating the problem as classification and regression both.

Language: Jupyter Notebook - Size: 3.68 MB - Last synced at: 15 days ago - Pushed at: almost 6 years ago - Stars: 8 - Forks: 3

DivyaKrishnani/Data-Preprocessing-with-Python

Implementation of Data Preprocessing techniques such as handling missing values, noise smoothing, PCA, etc.

Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 11

MahtaFetrat/GPTInformal-Persian-Speech-Dataset

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

nhauber99/degradr

Python library for realistically degrading images.

Language: Python - Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

Ruban2205/Machine_learning_fundamentals

This repository contains a collection of fundamental topics and techniques in machine learning. It aims to provide a comprehensive understanding of various aspects of machine learning through simplified notebooks. Each topic is covered in a separate notebook, allowing for easy exploration and learning.

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

KhaledAshrafH/Linear-Regression

This program implements linear regression from scratch using the gradient descent algorithm in Python. It predicts car prices based on selected features and uses a dataset of cars with their respective prices.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

miraehab/FIFA-23-ML-Project

Predict the Position of a Player Based on his skills and group the players based on their similarities.

Language: Jupyter Notebook - Size: 57.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

anshul1004/TweetsClustering

Clustering similar tweets using K-means clustering algorithm and Jaccard distance metric

Language: Python - Size: 3.32 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 4

sharmaroshan/Coursera-Reviews-Analysis

It is a Natural Language Processing Problem where we have to decide the sentiments of the users who reviewed the course. and then classifying the reviews into positive and negative.

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

ahmednabil950/JOBS-Information-Extraction

Project to extract entities from Job Description Articles.

Size: 2.14 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 1

xueyouluo/cn-data-enhance

ไฝฟ็”จ็ฟป่ฏ‘ๆŠ€ๆœฏๅšๆ•ฐๆฎๅขžๅผบใ€‚

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 3

partoftheorigin/predict-blog-author-features

Predicts gender, age, label, and zodiac sign of the writer from the given text.

Language: Python - Size: 56.6 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 2

Clearbox-AI/preprocessor

A fast and felxible data preprocessor based on polars.

Language: Python - Size: 1.88 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 6 - Forks: 0

sergio11/spam_email_classifier_lstm

This project uses a Bi-directional LSTM model ๐Ÿ“ง๐Ÿค– to classify emails as spam or legitimate, utilizing NLP techniques like tokenization, padding, and stopword removal. It aims to create an effective email classifier ๐Ÿ’ป๐Ÿ“Š while addressing overfitting with strategies like early stopping ๐Ÿšซ.

Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

sawallesalfo/Deployment_Data_Science_Project

Deploy maching learning model in local / server /API /cloud/ dockers

Language: Python - Size: 4.15 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 3

marcusGH/edain_paper

Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the MSc thesis I wrote in collaboration with American Express as part of my MSc in Statistics (Data Science) at Imperial College London

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

Eden-Kramer-Lab/loren_frank_data_processing

Python tools for reading in data from Loren Frank's lab

Language: Jupyter Notebook - Size: 5.32 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

Afkerian/Beijing-Multi-Site-Air-Quality-Data-Data-Set

The present project aims to predict air pollution in Beijing, China, using the data set "Beijing Multi-Site Air-Quality Data Data Set"

Language: Jupyter Notebook - Size: 36.2 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

shamspias/gpt3-data-preprocessing

This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting for GPT-3 input.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

xShaimaa/Udacity-Data-Analysis-and-Viz-with-Microsoft-Power-BI

Projects for Udacity's Data Analysis and Visualization with Microsoft power BI that covers: data pre-processing, visualization, and analysis using Power BI as primary tool.

Size: 24.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

prakhargurawa/Product-Category-Prediction

Machine learning pipeline to predict relevant categories based on a product name and product description, using NLP and Deep learning concepts.

Language: Jupyter Notebook - Size: 12 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

earthat/SMOTE-over-Sampling

This repository is for MATLAB code for balancing of multiclass data by SMOTE

Language: MATLAB - Size: 76.2 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2

xuefeng-xu/fedps

Federated data Preprocessing via aggregated Statistics

Language: Python - Size: 241 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 1

RenatoMaynard/Supervised-Machine-Learning-Models-Pytorch-Sklearn

This repository provides a comprehensive implementation of supervised machine learning models using PyTorch and Scikit-learn. It includes end-to-end workflows for both classification and regression tasks, covering data preprocessing, model training, evaluation, and comparison between traditional ML models

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Armanx200/Animal-Detector

๐Ÿพ Training a machine learning model to recognize 15 different animal classes and classify images accordingly.

Language: Python - Size: 35.1 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 5 - Forks: 1

AnshulOP/A-Z-Machine-Learning

This repository contains the code related to machine learning knowledge. Each code has been provided from start to end with systematical vew of each concept that you will need in your journey of learning ML.

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Samashi47/ML-Toolkit-Project

A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.

Language: Python - Size: 368 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 3

supriya811106/WhatsApp-Chat-Analyzer-App

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

Language: Python - Size: 2.44 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

opencodeiiita/News_Scraping ๐Ÿ“ฆ

Language: Jupyter Notebook - Size: 139 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 22

Samia35-2973/Learning-Data-Science

๐Ÿ“š This repository is my personal data science learning hub. Explore my journey from the very basics to advanced techniques. Dive into Python, data manipulation, analysis, visualization, and machine learning. Join me as I learn, grow, and experiment in the world of data science.

Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

IBM-EPBL/IBM-Project-17145-1659629015

Smart Lender - Applicant Credibility Prediction for Loan Approval

Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 4

karamolegkos/EverAnalyzer

EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.

Language: HTML - Size: 761 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

MsnAmiri/Customer-Segmentation-with-IBM-SPSS-Modeler

Creating a banking customer segmentation dataset using 3 initial datasets in the IBM SPSS environment

Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

AtlasmanYevgenii/Practicum100-by-Yandex

Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection and Storage (PostgreSQL), Business Analytics, Making Business Decisions Based on Data (Hypotheses testing), How to Tell a Story Using Data (Presentation and Data Visualization - Maplotlib, Seaborn, Plotly), Automation (Dash, Tableau), Forecasts and Predictions, 2 Integrated projects.

Language: Jupyter Notebook - Size: 4.99 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

kozodoi/dptools

Python package with utilities for data processing, aggregation, feature engineering and data versioning

Language: Python - Size: 108 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

Lokesh-Attarde/Swiggy_Data_Analysis

iNeuron Project - Swiggy Data Analysis, automated EDA & reports, Visualizing the variety of Use Cases along with WordCloud on Textual Data.

Language: HTML - Size: 7.04 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

sdhani/face-mask-detection

๐Ÿ˜ท Machine learning models determining whether or not an individual is wearing a face mask, not wearing a face mask, or wearing a face mask incorrectly. Course project

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 6

shreyas219/creditCardFraudDetection

Credit Card Fraud Detection using machine learning

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 6

sanjeevai/sparkify-capstone

Data Analysis in Spark to Identify Customer Churn for a fictional music service(like Spotify)

Language: Jupyter Notebook - Size: 6.47 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 3

trnet4334/img_colorization

This project uses Keras and Python to convert a grayscale image to color without any additional information.

Language: Python - Size: 20 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 1

lykmapipo/US-Gas-Prices

Python scripts that scrape US gas prices

Language: Python - Size: 2.28 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 1

Amirreza81/Applied-Data-Science-Course

Comprehensive notes, practical exercises, and problem-solving solutions from the Applied Data Science course, covering data preprocessing, machine learning algorithms, statistical analysis, data visualization, and real-world applications.

Language: Jupyter Notebook - Size: 5.58 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

sergio11/online_payment_fraud

Fraud detection using Deep Neural Networks to predict fraudulent transactions in financial data. ๐Ÿšจ๐Ÿค– Complete process from EDA and data preprocessing to model training and evaluation. ๐Ÿ“Š๐Ÿ”

Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

mayurasandakalum/breast-cancer-detection

Code for classifying breast cancer tumors using machine learning. Includes preprocessing, visualizations, and models like Logistic Regression, Decision Tree, and Random Forest. Evaluated with accuracy, precision, recall, and F1-score. Clone, install dependencies, and run the Jupyter notebook for full analysis.

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

swapnita-pandey/Identification-of-Edited-Images

Detection of Human Edited Images using CNN, VGG16, Xception, ELA, Ensemble Learning.

Language: Jupyter Notebook - Size: 32.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

sondosaabed/Preprocessing-for-Machine-Learning-in-Python

DataCamp inetrmediate course on how and when to perform data preprocessing in any machine learning project to get the data ready for modeling

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Armanx200/Fruit-Detector

๐Ÿ‡๐Ÿ” Fruit Detector: A machine learning model to identify fruits from images, powered by TensorFlow and Keras. Train the model, predict fruits, and explore the world of AI fruit recognition! ๐Ÿ“๐Ÿ

Language: Python - Size: 110 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

yashpandey474/Identification-of-fake-reviews

Fake review detection using machine learning and deep learning techniques such as CNNs, SOMs, K-means clustering, various supervised models and natural language processing tools such as Word2Vec & TFIDF, GloVe etc.

Language: Jupyter Notebook - Size: 36.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

nafisalawalidris/Predicting-Credit-Card-Approvals

Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

WalidAlsafadi/UFO-Sighting

UFO Sightings cross the world!

Language: Jupyter Notebook - Size: 6.1 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

spChalk/Vaccine-Sentiment-Classifier

:syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.

Language: Jupyter Notebook - Size: 10 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

AmirAli5/Data-Preprocessing

In this repo, all about Data preprocessing. Data preprocessing is a required first step before any machine learning machinery can be applied, because the algorithms learn from the data and the learning outcome for problem solving heavily depends on the proper data needed to solve a particular problem โ€“ which are called features. Examples of data preprocessing include cleaning, instance selection, normalization, one hot encoding, transformation, feature extraction and selection, etc.

Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MsnAmiri/Customer-Loyalty-Prediction-in-Python

A machine learning and deep learning project from ground up.

Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

prkskrs/Exploratory-Data-Analysis

exploratory data analysis best arranged notebooks (beginner to advance)

Language: Jupyter Notebook - Size: 3.45 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

kbelisar/datalark

Like the mudlark finding treasures on the foreshore, the datalark seeks treasures hidden within messy data!

Language: R - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

yashpatel-py/earthquake_prediction

Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

NatthanonNon/NOAA-grb2

GRIB2 Tutorial with NOAA dataset

Language: Jupyter Notebook - Size: 139 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

toukirnaim08/Rainfall-Prediction-Australian-Weather-Dataset

Rainfall prediction from Australian weather dataset using multiple machine learning models.

Language: R - Size: 5.88 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

carrdelling/multilabel_split

Sample algorithm for stratified train/test split in multi-label problems

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

recepayddogdu/Data-Science

Data Science alanฤฑnda kendimi geliลŸtirmek iรงin รงalฤฑลŸtฤฑฤŸฤฑm kaynaklardan bazฤฑlarฤฑnฤฑ burada depoluyorum. ร‡alฤฑลŸma notlarฤฑma Notes klasรถrรผnden ulaลŸabilirsiniz.

Language: Jupyter Notebook - Size: 458 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 3

turhancan97/Python_for_DataScience

Verili Bilimi iรงin Temel Python Kodlarฤฑ

Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 3

stxupengyu/Credit-Data-Analysis

ๅฎž็Žฐๅฏนไฟก่ดทๆ•ฐๆฎ็š„ๆ•ฐๆฎ้ข„ๅค„็†๏ผŒๆ•ฐๆฎๅˆ†ๆžใ€‚ไน‹ๅŽๅˆฉ็”จๅคš็งๅˆ†็ฑป็ฎ—ๆณ•ๅฏนๅ…ฌๅธๆ˜ฏๅฆ่ฟ็บฆ่ฟ›่กŒ้ข„ๆต‹ใ€‚Realize the data preprocessing and data analysis of credit data. Then, it uses a variety of classification algorithms to predict whether the company defaults.

Language: Jupyter Notebook - Size: 39.1 MB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

Ashleshk/Power-BI-A-Z-Hands-On-Power-BI-Training-For-Data-Science-Udemy

Learn data visualization through Microsoft Power BI and create opportunities for you or key decision makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. You'll learn all of the features in Power BI that allow you to explore, experiment with, fix, prepare, and present data easily, quickly, and beautifully.

Size: 5.82 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

konradmalik/ann-laminar-burning-velocity ๐Ÿ“ฆ

Models trained in my article on LBV predictions.

Language: C - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

podondra/data-preprocessing

data preprocessing examples

Language: Jupyter Notebook - Size: 4.64 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 4

Vishal1999-33/K-Nearest-Neighbors

Credit Card Fraud Detection

Language: Jupyter Notebook - Size: 524 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

djgarcia/PCARD

PCARD Ensemble classifier for Big Data

Language: Scala - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 2

MrKhan0747/New-York-City-Taxi-Trips-Fare

In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 2 Crore records and 8 features along with GPS coordinates of pickup and dropoff

Language: Jupyter Notebook - Size: 2.43 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 2

bharathgs/dframeutils

simple utility tools for dataframes in Python || WIP ||

Language: Python - Size: 98.6 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

geograz/TBM_advance_classification

Code and data repository for the paper "Challenges and Opportunities of Data-Driven Advance Classification for Hard Rock TBM excavations"

Language: Python - Size: 526 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

dvamsidhar2002/Audio-Data-Analysis-Techniques

This repository contains data preprocessing and analysis techniques for audio data.

Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

stefanherdy/SAM-GAN-Clustering

Implementation of Meta AI's Segment Anything Model to do an automated image annotation of simple microscope images and a modified GAN to cluster the preprocessed images according to their qualitative features

Language: Python - Size: 245 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

MvMukesh/ProblemSolving-FrameWork-ML

Detailed industry specific framework to solve Machine Learning Problem

Language: Python - Size: 139 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

SamSamhuns/ml_data_processing

ML data processing (For Computer Vision)

Language: Python - Size: 5.59 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Suhrud1511/DataLens

DataLens is a robust online tool for automatic dataset preprocessing. With just a click, generate detailed visualizations and comprehensive reports. Simplify your data preparation and uncover insights instantly.

Language: TypeScript - Size: 3.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1

psmyth94/biosets

A bioinformatics extension of ๐Ÿค— Datasets library, built for ML applications on biological and omics data, offering easy integration of metadata and low-code data management tools.

Language: Python - Size: 279 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Armanx200/Income-Predictor

๐Ÿ”โœจ A machine learning project that predicts income based on various demographic factors using Random Forest and Gradient Boosting algorithms. Includes data preprocessing, hyperparameter tuning, and model evaluation with detailed performance metrics. ๐Ÿ“Š๐Ÿค–

Language: Python - Size: 652 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

ThecoderPinar/Credit-Card-Fraud-Detection-Project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. ๐Ÿš€โœจ

Language: Jupyter Notebook - Size: 5.02 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 3

TimKong21/Medical-Appointment-No-Show-Prediction

A machine learning solution predicting patient no-shows in healthcare appointments. This project integrates EDA, data processing, feature engineering, and XGBoost modeling, with a workflow spanning from Snowflake data retrieval to AWS deployment (S3, SageMaker, Lambda, API Gateway), aiming to enhance appointment management in medical ERP systems.

Language: Jupyter Notebook - Size: 35.4 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MartinKalema/Malaria-In-Africa

This project is aimed at understanding, mitigating, and controlling the impact of malaria in Africa.

Language: Jupyter Notebook - Size: 1.13 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

that-coding-kid/Saarthi

The problem statement includes building a voice-powered chatbot for the BHUVAN portal. This project has been selected in Top 50 among various projects from across the country for ISRO'S SIF Hackathon.

Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ehtisham-sadiq/Movie-Recommendation-System

The Movie Recommendation System is an all-encompassing data science project that utilizes sophisticated machine learning methods, including collaborative and content-based filtering, to provide users with personalized movie suggestions based on their preferences and viewing history. I

Language: Jupyter Notebook - Size: 9.12 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

LukaszMalucha/Springboard-Insights

Scraping and analyzing latest Springboard offer. Django REST + Vue.js

Language: Python - Size: 2.62 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

zeynpaktasli/StrokePrediciton

Stroke prediciton with EDA, data preprocessing, model building and sampling

Language: Jupyter Notebook - Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Arvindhh931/Mileage-prediction

Fuel Efficiency of car in miles per gallon

Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

waqarg2001/PakWheels-Data-Analysis

ETL process and EDA was performed on used cars dataset scraped from PakWheels.com. The analysis was done through Jupyter Notebook. Insights were shared.

Language: Jupyter Notebook - Size: 5.77 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

jmoro0408/Spotify_Analysis_2.0

Extracting, analyzing, and using my Spotify listening history to make predictions.

Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

SayamAlt/Stellar-Classification---Sloan-Digital-Sky-Survey-17

Successfully established a machine learning model which can predict an appropriate stellar class, on the basis of a distinct set of spectral characteristics, to a substantially high level of accuracy.

Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

0xmahdirostami/Kaggle

The main focus of this repository is to do some projects of Kaggle("Kaggle, a subsidiary of Google LLC, is an online community of data scientists and machine learning practitioners").

Language: Jupyter Notebook - Size: 19.6 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

adamavip/multitrait-nirs-model

Modeling of nutritional traits from multiple crops using NIRS and machine learning/statistics

Language: Python - Size: 26.6 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

caesarmario/heart-disease-prediction-with-logistic-regression-SAS-studio

Heart disease prediction with logistic regression using SAS Studio. The dataset is taken from UCI Machine Learning about heart disease.

Language: SAS - Size: 907 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

caesarmario/big-mart-sales-preprocessing-SAS-studio

Data preprocessing, feature engineering, and EDA for "Big Mart Sales" data set using SAS Studio. The dataset is taken from Kaggle (https://www.kaggle.com/mrmorj/big-mart-sales).

Language: SAS - Size: 2.4 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

arunsinghbabal/Click_through_vs_conversion_classification

The model classifies the advertisements by clicked or not-clicked and later identifies zero, low or high conversion rate.

Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

Related Topics
machine-learning 659 python 491 data-visualization 400 data-science 377 data-analysis 284 pandas 211 data-cleaning 205 exploratory-data-analysis 199 deep-learning 165 feature-engineering 158 numpy 124 classification 122 scikit-learn 118 logistic-regression 110 jupyter-notebook 105 python3 100 matplotlib 96 machine-learning-algorithms 95 eda 92 seaborn 88 random-forest 75 data-preparation 74 linear-regression 71 feature-selection 68 model-evaluation 68 tensorflow 65 data 63 sklearn 62 natural-language-processing 62 predictive-modeling 62 nlp 57 data-mining 54 regression 52 data-analytics 50 hyperparameter-tuning 49 neural-networks 46 artificial-intelligence 43 clustering 41 supervised-learning 40 keras 39 data-wrangling 38 visualization 36 neural-network 36 pytorch 36 data-processing 35 regression-models 35 feature-extraction 34 streamlit 33 decision-trees 33 random-forest-classifier 31 xgboost 31 computer-vision 31 r 29 ai 28 sql 28 model-training 28 data-engineering 27 cross-validation 27 decision-tree-classifier 26 sentiment-analysis 26 unsupervised-learning 25 k-means-clustering 25 powerbi 24 outlier-detection 24 time-series-analysis 24 time-series 23 decision-tree 22 lstm 22 data-exploration 22 text-classification 22 data-transformation 21 dimensionality-reduction 21 prediction 21 statistical-analysis 21 image-classification 21 data-collection 21 cnn 21 pipeline 20 image-processing 20 convolutional-neural-networks 20 gradient-boosting 20 confusion-matrix 20 model-selection 19 pca 19 data-augmentation 19 data-manipulation 19 naive-bayes-classifier 18 svm 18 flask 18 web-scraping 18 statistics 18 knn 18 tableau 18 regression-analysis 16 ml 16 ensemble-learning 16 plotly 16 deep-neural-networks 16 big-data 16 data-modeling 16