Topic: "data-preprocessing"
KhaledAshrafH/Logistic-Regression
This program implements logistic regression from scratch using the gradient descent algorithm in Python to predict whether customers will purchase a new car based on their age and salary.
Language: Python - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 3

alotlikar1010/PW-Skills-Data-Master-Assignment
Assignment Solution of PW Skills Data Master Course
Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 5

basiralab/Kaggle-BrainNetPrediction-Toolbox
A Python toolbox for predicting brain network (graph) evolution over time from a single observation. The codes of the 20 competing Kaggle teams along with the competition datasets are made available.
Language: Python - Size: 4.92 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 3

d4rk-lucif3r/LuciferML ๐ฆ
Semi-Auto Machine Learning Library by d4rk-lucif3r
Language: Python - Size: 2.52 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 6

DataPreprocessing/DataCleaning
Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.
Language: Python - Size: 117 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 3

TatevKaren/Deep-Learning-for-Data-Science
Deep Learning Case Studies with Tensorflow and Keras for Beginners-Advanced: ANN, CNN, RNN, Self-Organizing Maps, Boltzmann Machines, Stacked Autoencoders
Language: Python - Size: 2.54 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

seoulai/mdml
Medical data and machine learning group
Language: Jupyter Notebook - Size: 6.74 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

mansipatel2508/Yelp-Review-Stars-Prediction-with-Machine-Learning
The project has text vectorization, handling big data with merging and cleaning the text and getting the required columns while boosting the performance by feature extraction and parameter tuning for NN, compares the Performances through applied different models treating the problem as classification and regression both.
Language: Jupyter Notebook - Size: 3.68 MB - Last synced at: 15 days ago - Pushed at: almost 6 years ago - Stars: 8 - Forks: 3

DivyaKrishnani/Data-Preprocessing-with-Python
Implementation of Data Preprocessing techniques such as handling missing values, noise smoothing, PCA, etc.
Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 11

MahtaFetrat/GPTInformal-Persian-Speech-Dataset
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

nhauber99/degradr
Python library for realistically degrading images.
Language: Python - Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

Ruban2205/Machine_learning_fundamentals
This repository contains a collection of fundamental topics and techniques in machine learning. It aims to provide a comprehensive understanding of various aspects of machine learning through simplified notebooks. Each topic is covered in a separate notebook, allowing for easy exploration and learning.
Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

KhaledAshrafH/Linear-Regression
This program implements linear regression from scratch using the gradient descent algorithm in Python. It predicts car prices based on selected features and uses a dataset of cars with their respective prices.
Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

miraehab/FIFA-23-ML-Project
Predict the Position of a Player Based on his skills and group the players based on their similarities.
Language: Jupyter Notebook - Size: 57.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

anshul1004/TweetsClustering
Clustering similar tweets using K-means clustering algorithm and Jaccard distance metric
Language: Python - Size: 3.32 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 4

sharmaroshan/Coursera-Reviews-Analysis
It is a Natural Language Processing Problem where we have to decide the sentiments of the users who reviewed the course. and then classifying the reviews into positive and negative.
Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

ahmednabil950/JOBS-Information-Extraction
Project to extract entities from Job Description Articles.
Size: 2.14 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 1

xueyouluo/cn-data-enhance
ไฝฟ็จ็ฟป่ฏๆๆฏๅๆฐๆฎๅขๅผบใ
Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 3

partoftheorigin/predict-blog-author-features
Predicts gender, age, label, and zodiac sign of the writer from the given text.
Language: Python - Size: 56.6 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 2

Clearbox-AI/preprocessor
A fast and felxible data preprocessor based on polars.
Language: Python - Size: 1.88 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 6 - Forks: 0

sergio11/spam_email_classifier_lstm
This project uses a Bi-directional LSTM model ๐ง๐ค to classify emails as spam or legitimate, utilizing NLP techniques like tokenization, padding, and stopword removal. It aims to create an effective email classifier ๐ป๐ while addressing overfitting with strategies like early stopping ๐ซ.
Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

sawallesalfo/Deployment_Data_Science_Project
Deploy maching learning model in local / server /API /cloud/ dockers
Language: Python - Size: 4.15 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 3

marcusGH/edain_paper
Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the MSc thesis I wrote in collaboration with American Express as part of my MSc in Statistics (Data Science) at Imperial College London
Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

Eden-Kramer-Lab/loren_frank_data_processing
Python tools for reading in data from Loren Frank's lab
Language: Jupyter Notebook - Size: 5.32 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

Afkerian/Beijing-Multi-Site-Air-Quality-Data-Data-Set
The present project aims to predict air pollution in Beijing, China, using the data set "Beijing Multi-Site Air-Quality Data Data Set"
Language: Jupyter Notebook - Size: 36.2 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

shamspias/gpt3-data-preprocessing
This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting for GPT-3 input.
Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

xShaimaa/Udacity-Data-Analysis-and-Viz-with-Microsoft-Power-BI
Projects for Udacity's Data Analysis and Visualization with Microsoft power BI that covers: data pre-processing, visualization, and analysis using Power BI as primary tool.
Size: 24.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

prakhargurawa/Product-Category-Prediction
Machine learning pipeline to predict relevant categories based on a product name and product description, using NLP and Deep learning concepts.
Language: Jupyter Notebook - Size: 12 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

earthat/SMOTE-over-Sampling
This repository is for MATLAB code for balancing of multiclass data by SMOTE
Language: MATLAB - Size: 76.2 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2

xuefeng-xu/fedps
Federated data Preprocessing via aggregated Statistics
Language: Python - Size: 241 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 1

RenatoMaynard/Supervised-Machine-Learning-Models-Pytorch-Sklearn
This repository provides a comprehensive implementation of supervised machine learning models using PyTorch and Scikit-learn. It includes end-to-end workflows for both classification and regression tasks, covering data preprocessing, model training, evaluation, and comparison between traditional ML models
Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Armanx200/Animal-Detector
๐พ Training a machine learning model to recognize 15 different animal classes and classify images accordingly.
Language: Python - Size: 35.1 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 5 - Forks: 1

AnshulOP/A-Z-Machine-Learning
This repository contains the code related to machine learning knowledge. Each code has been provided from start to end with systematical vew of each concept that you will need in your journey of learning ML.
Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Samashi47/ML-Toolkit-Project
A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.
Language: Python - Size: 368 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 3

supriya811106/WhatsApp-Chat-Analyzer-App
Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.
Language: Python - Size: 2.44 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

opencodeiiita/News_Scraping ๐ฆ
Language: Jupyter Notebook - Size: 139 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 22

Samia35-2973/Learning-Data-Science
๐ This repository is my personal data science learning hub. Explore my journey from the very basics to advanced techniques. Dive into Python, data manipulation, analysis, visualization, and machine learning. Join me as I learn, grow, and experiment in the world of data science.
Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

IBM-EPBL/IBM-Project-17145-1659629015
Smart Lender - Applicant Credibility Prediction for Loan Approval
Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 4

karamolegkos/EverAnalyzer
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Language: HTML - Size: 761 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

MsnAmiri/Customer-Segmentation-with-IBM-SPSS-Modeler
Creating a banking customer segmentation dataset using 3 initial datasets in the IBM SPSS environment
Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

AtlasmanYevgenii/Practicum100-by-Yandex
Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection and Storage (PostgreSQL), Business Analytics, Making Business Decisions Based on Data (Hypotheses testing), How to Tell a Story Using Data (Presentation and Data Visualization - Maplotlib, Seaborn, Plotly), Automation (Dash, Tableau), Forecasts and Predictions, 2 Integrated projects.
Language: Jupyter Notebook - Size: 4.99 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

kozodoi/dptools
Python package with utilities for data processing, aggregation, feature engineering and data versioning
Language: Python - Size: 108 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

Lokesh-Attarde/Swiggy_Data_Analysis
iNeuron Project - Swiggy Data Analysis, automated EDA & reports, Visualizing the variety of Use Cases along with WordCloud on Textual Data.
Language: HTML - Size: 7.04 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

sdhani/face-mask-detection
๐ท Machine learning models determining whether or not an individual is wearing a face mask, not wearing a face mask, or wearing a face mask incorrectly. Course project
Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 6

shreyas219/creditCardFraudDetection
Credit Card Fraud Detection using machine learning
Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 6

sanjeevai/sparkify-capstone
Data Analysis in Spark to Identify Customer Churn for a fictional music service(like Spotify)
Language: Jupyter Notebook - Size: 6.47 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 3

trnet4334/img_colorization
This project uses Keras and Python to convert a grayscale image to color without any additional information.
Language: Python - Size: 20 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 1

lykmapipo/US-Gas-Prices
Python scripts that scrape US gas prices
Language: Python - Size: 2.28 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 1

Amirreza81/Applied-Data-Science-Course
Comprehensive notes, practical exercises, and problem-solving solutions from the Applied Data Science course, covering data preprocessing, machine learning algorithms, statistical analysis, data visualization, and real-world applications.
Language: Jupyter Notebook - Size: 5.58 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

sergio11/online_payment_fraud
Fraud detection using Deep Neural Networks to predict fraudulent transactions in financial data. ๐จ๐ค Complete process from EDA and data preprocessing to model training and evaluation. ๐๐
Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

mayurasandakalum/breast-cancer-detection
Code for classifying breast cancer tumors using machine learning. Includes preprocessing, visualizations, and models like Logistic Regression, Decision Tree, and Random Forest. Evaluated with accuracy, precision, recall, and F1-score. Clone, install dependencies, and run the Jupyter notebook for full analysis.
Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

swapnita-pandey/Identification-of-Edited-Images
Detection of Human Edited Images using CNN, VGG16, Xception, ELA, Ensemble Learning.
Language: Jupyter Notebook - Size: 32.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

sondosaabed/Preprocessing-for-Machine-Learning-in-Python
DataCamp inetrmediate course on how and when to perform data preprocessing in any machine learning project to get the data ready for modeling
Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Armanx200/Fruit-Detector
๐๐ Fruit Detector: A machine learning model to identify fruits from images, powered by TensorFlow and Keras. Train the model, predict fruits, and explore the world of AI fruit recognition! ๐๐
Language: Python - Size: 110 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

yashpandey474/Identification-of-fake-reviews
Fake review detection using machine learning and deep learning techniques such as CNNs, SOMs, K-means clustering, various supervised models and natural language processing tools such as Word2Vec & TFIDF, GloVe etc.
Language: Jupyter Notebook - Size: 36.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

nafisalawalidris/Predicting-Credit-Card-Approvals
Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.
Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

WalidAlsafadi/UFO-Sighting
UFO Sightings cross the world!
Language: Jupyter Notebook - Size: 6.1 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

spChalk/Vaccine-Sentiment-Classifier
:syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.
Language: Jupyter Notebook - Size: 10 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

AmirAli5/Data-Preprocessing
In this repo, all about Data preprocessing. Data preprocessing is a required first step before any machine learning machinery can be applied, because the algorithms learn from the data and the learning outcome for problem solving heavily depends on the proper data needed to solve a particular problem โ which are called features. Examples of data preprocessing include cleaning, instance selection, normalization, one hot encoding, transformation, feature extraction and selection, etc.
Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MsnAmiri/Customer-Loyalty-Prediction-in-Python
A machine learning and deep learning project from ground up.
Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

prkskrs/Exploratory-Data-Analysis
exploratory data analysis best arranged notebooks (beginner to advance)
Language: Jupyter Notebook - Size: 3.45 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

kbelisar/datalark
Like the mudlark finding treasures on the foreshore, the datalark seeks treasures hidden within messy data!
Language: R - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

yashpatel-py/earthquake_prediction
Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

NatthanonNon/NOAA-grb2
GRIB2 Tutorial with NOAA dataset
Language: Jupyter Notebook - Size: 139 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

toukirnaim08/Rainfall-Prediction-Australian-Weather-Dataset
Rainfall prediction from Australian weather dataset using multiple machine learning models.
Language: R - Size: 5.88 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

carrdelling/multilabel_split
Sample algorithm for stratified train/test split in multi-label problems
Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

recepayddogdu/Data-Science
Data Science alanฤฑnda kendimi geliลtirmek iรงin รงalฤฑลtฤฑฤฤฑm kaynaklardan bazฤฑlarฤฑnฤฑ burada depoluyorum. รalฤฑลma notlarฤฑma Notes klasรถrรผnden ulaลabilirsiniz.
Language: Jupyter Notebook - Size: 458 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 3

turhancan97/Python_for_DataScience
Verili Bilimi iรงin Temel Python Kodlarฤฑ
Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 3

stxupengyu/Credit-Data-Analysis
ๅฎ็ฐๅฏนไฟก่ดทๆฐๆฎ็ๆฐๆฎ้ขๅค็๏ผๆฐๆฎๅๆใไนๅๅฉ็จๅค็งๅ็ฑป็ฎๆณๅฏนๅ ฌๅธๆฏๅฆ่ฟ็บฆ่ฟ่ก้ขๆตใRealize the data preprocessing and data analysis of credit data. Then, it uses a variety of classification algorithms to predict whether the company defaults.
Language: Jupyter Notebook - Size: 39.1 MB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

Ashleshk/Power-BI-A-Z-Hands-On-Power-BI-Training-For-Data-Science-Udemy
Learn data visualization through Microsoft Power BI and create opportunities for you or key decision makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. You'll learn all of the features in Power BI that allow you to explore, experiment with, fix, prepare, and present data easily, quickly, and beautifully.
Size: 5.82 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

konradmalik/ann-laminar-burning-velocity ๐ฆ
Models trained in my article on LBV predictions.
Language: C - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

podondra/data-preprocessing
data preprocessing examples
Language: Jupyter Notebook - Size: 4.64 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 4

Vishal1999-33/K-Nearest-Neighbors
Credit Card Fraud Detection
Language: Jupyter Notebook - Size: 524 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

djgarcia/PCARD
PCARD Ensemble classifier for Big Data
Language: Scala - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 2

MrKhan0747/New-York-City-Taxi-Trips-Fare
In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 2 Crore records and 8 features along with GPS coordinates of pickup and dropoff
Language: Jupyter Notebook - Size: 2.43 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 2

bharathgs/dframeutils
simple utility tools for dataframes in Python || WIP ||
Language: Python - Size: 98.6 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

geograz/TBM_advance_classification
Code and data repository for the paper "Challenges and Opportunities of Data-Driven Advance Classification for Hard Rock TBM excavations"
Language: Python - Size: 526 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

dvamsidhar2002/Audio-Data-Analysis-Techniques
This repository contains data preprocessing and analysis techniques for audio data.
Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

stefanherdy/SAM-GAN-Clustering
Implementation of Meta AI's Segment Anything Model to do an automated image annotation of simple microscope images and a modified GAN to cluster the preprocessed images according to their qualitative features
Language: Python - Size: 245 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

MvMukesh/ProblemSolving-FrameWork-ML
Detailed industry specific framework to solve Machine Learning Problem
Language: Python - Size: 139 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

SamSamhuns/ml_data_processing
ML data processing (For Computer Vision)
Language: Python - Size: 5.59 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Suhrud1511/DataLens
DataLens is a robust online tool for automatic dataset preprocessing. With just a click, generate detailed visualizations and comprehensive reports. Simplify your data preparation and uncover insights instantly.
Language: TypeScript - Size: 3.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1

psmyth94/biosets
A bioinformatics extension of ๐ค Datasets library, built for ML applications on biological and omics data, offering easy integration of metadata and low-code data management tools.
Language: Python - Size: 279 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Armanx200/Income-Predictor
๐โจ A machine learning project that predicts income based on various demographic factors using Random Forest and Gradient Boosting algorithms. Includes data preprocessing, hyperparameter tuning, and model evaluation with detailed performance metrics. ๐๐ค
Language: Python - Size: 652 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

ThecoderPinar/Credit-Card-Fraud-Detection-Project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. ๐โจ
Language: Jupyter Notebook - Size: 5.02 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 3

TimKong21/Medical-Appointment-No-Show-Prediction
A machine learning solution predicting patient no-shows in healthcare appointments. This project integrates EDA, data processing, feature engineering, and XGBoost modeling, with a workflow spanning from Snowflake data retrieval to AWS deployment (S3, SageMaker, Lambda, API Gateway), aiming to enhance appointment management in medical ERP systems.
Language: Jupyter Notebook - Size: 35.4 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MartinKalema/Malaria-In-Africa
This project is aimed at understanding, mitigating, and controlling the impact of malaria in Africa.
Language: Jupyter Notebook - Size: 1.13 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

that-coding-kid/Saarthi
The problem statement includes building a voice-powered chatbot for the BHUVAN portal. This project has been selected in Top 50 among various projects from across the country for ISRO'S SIF Hackathon.
Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ehtisham-sadiq/Movie-Recommendation-System
The Movie Recommendation System is an all-encompassing data science project that utilizes sophisticated machine learning methods, including collaborative and content-based filtering, to provide users with personalized movie suggestions based on their preferences and viewing history. I
Language: Jupyter Notebook - Size: 9.12 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

LukaszMalucha/Springboard-Insights
Scraping and analyzing latest Springboard offer. Django REST + Vue.js
Language: Python - Size: 2.62 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

zeynpaktasli/StrokePrediciton
Stroke prediciton with EDA, data preprocessing, model building and sampling
Language: Jupyter Notebook - Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Arvindhh931/Mileage-prediction
Fuel Efficiency of car in miles per gallon
Language: Jupyter Notebook - Size: 3.05 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 2

waqarg2001/PakWheels-Data-Analysis
ETL process and EDA was performed on used cars dataset scraped from PakWheels.com. The analysis was done through Jupyter Notebook. Insights were shared.
Language: Jupyter Notebook - Size: 5.77 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

jmoro0408/Spotify_Analysis_2.0
Extracting, analyzing, and using my Spotify listening history to make predictions.
Language: Jupyter Notebook - Size: 10.4 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

SayamAlt/Stellar-Classification---Sloan-Digital-Sky-Survey-17
Successfully established a machine learning model which can predict an appropriate stellar class, on the basis of a distinct set of spectral characteristics, to a substantially high level of accuracy.
Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 1

0xmahdirostami/Kaggle
The main focus of this repository is to do some projects of Kaggle("Kaggle, a subsidiary of Google LLC, is an online community of data scientists and machine learning practitioners").
Language: Jupyter Notebook - Size: 19.6 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 0

adamavip/multitrait-nirs-model
Modeling of nutritional traits from multiple crops using NIRS and machine learning/statistics
Language: Python - Size: 26.6 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 3 - Forks: 2

caesarmario/heart-disease-prediction-with-logistic-regression-SAS-studio
Heart disease prediction with logistic regression using SAS Studio. The dataset is taken from UCI Machine Learning about heart disease.
Language: SAS - Size: 907 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

caesarmario/big-mart-sales-preprocessing-SAS-studio
Data preprocessing, feature engineering, and EDA for "Big Mart Sales" data set using SAS Studio. The dataset is taken from Kaggle (https://www.kaggle.com/mrmorj/big-mart-sales).
Language: SAS - Size: 2.4 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

arunsinghbabal/Click_through_vs_conversion_classification
The model classifies the advertisements by clicked or not-clicked and later identifies zero, low or high conversion rate.
Language: Jupyter Notebook - Size: 1.32 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0
