data-preprocessing | Topic | Ecosyste.ms: Repos

Topic: "data-preprocessing"

KhaledAshrafH/Logistic-Regression

This program implements logistic regression from scratch using the gradient descent algorithm in Python to predict whether customers will purchase a new car based on their age and salary.

Language: Python - Size: 12.7 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 8 - Forks: 3

alotlikar1010/PW-Skills-Data-Master-Assignment

Assignment Solution of PW Skills Data Master Course

Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 8 - Forks: 5

basiralab/Kaggle-BrainNetPrediction-Toolbox

A Python toolbox for predicting brain network (graph) evolution over time from a single observation. The codes of the 20 competing Kaggle teams along with the competition datasets are made available.

Language: Python - Size: 4.92 MB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 8 - Forks: 3

d4rk-lucif3r/LuciferML 📦

Semi-Auto Machine Learning Library by d4rk-lucif3r

Language: Python - Size: 2.52 MB - Last synced at: 9 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 6

DataPreprocessing/DataCleaning

Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.

Language: Python - Size: 117 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 8 - Forks: 3

TatevKaren/Deep-Learning-for-Data-Science

Deep Learning Case Studies with Tensorflow and Keras for Beginners-Advanced: ANN, CNN, RNN, Self-Organizing Maps, Boltzmann Machines, Stacked Autoencoders

Language: Python - Size: 2.54 MB - Last synced at: 3 months ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

seoulai/mdml

Medical data and machine learning group

Language: Jupyter Notebook - Size: 6.74 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 8 - Forks: 0

mansipatel2508/Yelp-Review-Stars-Prediction-with-Machine-Learning

The project has text vectorization, handling big data with merging and cleaning the text and getting the required columns while boosting the performance by feature extraction and parameter tuning for NN, compares the Performances through applied different models treating the problem as classification and regression both.

Language: Jupyter Notebook - Size: 3.68 MB - Last synced at: 15 days ago - Pushed at: almost 6 years ago - Stars: 8 - Forks: 3

DivyaKrishnani/Data-Preprocessing-with-Python

Implementation of Data Preprocessing techniques such as handling missing values, noise smoothing, PCA, etc.

Language: Jupyter Notebook - Size: 1.64 MB - Last synced at: 7 months ago - Pushed at: over 6 years ago - Stars: 8 - Forks: 11

MahtaFetrat/GPTInformal-Persian-Speech-Dataset

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 0

nhauber99/degradr

Python library for realistically degrading images.

Language: Python - Size: 8.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

Ruban2205/Machine_learning_fundamentals

This repository contains a collection of fundamental topics and techniques in machine learning. It aims to provide a comprehensive understanding of various aspects of machine learning through simplified notebooks. Each topic is covered in a separate notebook, allowing for easy exploration and learning.

Language: Jupyter Notebook - Size: 2.6 MB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

KhaledAshrafH/Linear-Regression

This program implements linear regression from scratch using the gradient descent algorithm in Python. It predicts car prices based on selected features and uses a dataset of cars with their respective prices.

Language: Python - Size: 19.5 KB - Last synced at: 3 months ago - Pushed at: almost 2 years ago - Stars: 7 - Forks: 0

miraehab/FIFA-23-ML-Project

Predict the Position of a Player Based on his skills and group the players based on their similarities.

Language: Jupyter Notebook - Size: 57.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

anshul1004/TweetsClustering

Clustering similar tweets using K-means clustering algorithm and Jaccard distance metric

Language: Python - Size: 3.32 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 7 - Forks: 4

sharmaroshan/Coursera-Reviews-Analysis

It is a Natural Language Processing Problem where we have to decide the sentiments of the users who reviewed the course. and then classifying the reviews into positive and negative.

Language: Jupyter Notebook - Size: 20.1 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 7 - Forks: 3

ahmednabil950/JOBS-Information-Extraction

Project to extract entities from Job Description Articles.

Size: 2.14 MB - Last synced at: 5 months ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 1

xueyouluo/cn-data-enhance

使用翻译技术做数据增强。

Language: Python - Size: 11.7 KB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 7 - Forks: 3

partoftheorigin/predict-blog-author-features

Predicts gender, age, label, and zodiac sign of the writer from the given text.

Language: Python - Size: 56.6 KB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 2

Clearbox-AI/preprocessor

A fast and felxible data preprocessor based on polars.

Language: Python - Size: 1.88 MB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 6 - Forks: 0

sergio11/spam_email_classifier_lstm

This project uses a Bi-directional LSTM model 📧🤖 to classify emails as spam or legitimate, utilizing NLP techniques like tokenization, padding, and stopword removal. It aims to create an effective email classifier 💻📊 while addressing overfitting with strategies like early stopping 🚫.

Language: Jupyter Notebook - Size: 3.2 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

sawallesalfo/Deployment_Data_Science_Project

Deploy maching learning model in local / server /API /cloud/ dockers

Language: Python - Size: 4.15 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 3

marcusGH/edain_paper

Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the MSc thesis I wrote in collaboration with American Express as part of my MSc in Statistics (Data Science) at Imperial College London

Language: Jupyter Notebook - Size: 26.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

Eden-Kramer-Lab/loren_frank_data_processing

Python tools for reading in data from Loren Frank's lab

Language: Jupyter Notebook - Size: 5.32 MB - Last synced at: 11 months ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 1

Afkerian/Beijing-Multi-Site-Air-Quality-Data-Data-Set

The present project aims to predict air pollution in Beijing, China, using the data set "Beijing Multi-Site Air-Quality Data Data Set"

Language: Jupyter Notebook - Size: 36.2 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 1

shamspias/gpt3-data-preprocessing

This repository containing code for preprocessing text data from PDF and DOCX files for use with GPT-3. It includes steps such as tokenization, removal of stop words and punctuation, and formatting for GPT-3 input.

Language: Python - Size: 11.7 KB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

xShaimaa/Udacity-Data-Analysis-and-Viz-with-Microsoft-Power-BI

Projects for Udacity's Data Analysis and Visualization with Microsoft power BI that covers: data pre-processing, visualization, and analysis using Power BI as primary tool.

Size: 24.4 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 1

prakhargurawa/Product-Category-Prediction

Machine learning pipeline to predict relevant categories based on a product name and product description, using NLP and Deep learning concepts.

Language: Jupyter Notebook - Size: 12 MB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

earthat/SMOTE-over-Sampling

This repository is for MATLAB code for balancing of multiclass data by SMOTE

Language: MATLAB - Size: 76.2 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2

xuefeng-xu/fedps

Federated data Preprocessing via aggregated Statistics

Language: Python - Size: 241 KB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 5 - Forks: 1

RenatoMaynard/Supervised-Machine-Learning-Models-Pytorch-Sklearn

This repository provides a comprehensive implementation of supervised machine learning models using PyTorch and Scikit-learn. It includes end-to-end workflows for both classification and regression tasks, covering data preprocessing, model training, evaluation, and comparison between traditional ML models

Size: 1.95 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Armanx200/Animal-Detector

🐾 Training a machine learning model to recognize 15 different animal classes and classify images accordingly.

Language: Python - Size: 35.1 MB - Last synced at: 2 days ago - Pushed at: 10 months ago - Stars: 5 - Forks: 1

AnshulOP/A-Z-Machine-Learning

This repository contains the code related to machine learning knowledge. Each code has been provided from start to end with systematical vew of each concept that you will need in your journey of learning ML.

Language: Jupyter Notebook - Size: 16.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

Samashi47/ML-Toolkit-Project

A general-purpose toolkit for data preprocessing, machine learning modeling, and visualization.

Language: Python - Size: 368 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 3

supriya811106/WhatsApp-Chat-Analyzer-App

Analyze WhatsApp chats with Python, Streamlit, and data visualization. Explore messaging patterns, content trends, and emoji usage to uncover insights from your conversations.

Language: Python - Size: 2.44 MB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 5 - Forks: 1

opencodeiiita/News_Scraping 📦

Language: Jupyter Notebook - Size: 139 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 22

Samia35-2973/Learning-Data-Science

📚 This repository is my personal data science learning hub. Explore my journey from the very basics to advanced techniques. Dive into Python, data manipulation, analysis, visualization, and machine learning. Join me as I learn, grow, and experiment in the world of data science.

Language: Jupyter Notebook - Size: 79.1 KB - Last synced at: 4 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

IBM-EPBL/IBM-Project-17145-1659629015

Smart Lender - Applicant Credibility Prediction for Loan Approval

Language: Jupyter Notebook - Size: 19.8 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 4

karamolegkos/EverAnalyzer

EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.

Language: HTML - Size: 761 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

MsnAmiri/Customer-Segmentation-with-IBM-SPSS-Modeler

Creating a banking customer segmentation dataset using 3 initial datasets in the IBM SPSS environment

Size: 1.02 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

AtlasmanYevgenii/Practicum100-by-Yandex

Course included such topics, as Data Preprocessing, Exploratory Data Analysis (EDA), Statistical Data Analysis (SDA), Data Collection and Storage (PostgreSQL), Business Analytics, Making Business Decisions Based on Data (Hypotheses testing), How to Tell a Story Using Data (Presentation and Data Visualization - Maplotlib, Seaborn, Plotly), Automation (Dash, Tableau), Forecasts and Predictions, 2 Integrated projects.

Language: Jupyter Notebook - Size: 4.99 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0

kozodoi/dptools

Python package with utilities for data processing, aggregation, feature engineering and data versioning

Language: Python - Size: 108 KB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 5 - Forks: 2

Lokesh-Attarde/Swiggy_Data_Analysis

iNeuron Project - Swiggy Data Analysis, automated EDA & reports, Visualizing the variety of Use Cases along with WordCloud on Textual Data.

Language: HTML - Size: 7.04 MB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

sdhani/face-mask-detection

😷 Machine learning models determining whether or not an individual is wearing a face mask, not wearing a face mask, or wearing a face mask incorrectly. Course project

Language: Jupyter Notebook - Size: 13.7 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 6

shreyas219/creditCardFraudDetection

Credit Card Fraud Detection using machine learning

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 6

sanjeevai/sparkify-capstone

Data Analysis in Spark to Identify Customer Churn for a fictional music service(like Spotify)

Language: Jupyter Notebook - Size: 6.47 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 3

trnet4334/img_colorization

This project uses Keras and Python to convert a grayscale image to color without any additional information.

Language: Python - Size: 20 MB - Last synced at: over 2 years ago - Pushed at: over 7 years ago - Stars: 5 - Forks: 1

lykmapipo/US-Gas-Prices

Python scripts that scrape US gas prices

Language: Python - Size: 2.28 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4 - Forks: 1

Amirreza81/Applied-Data-Science-Course

Comprehensive notes, practical exercises, and problem-solving solutions from the Applied Data Science course, covering data preprocessing, machine learning algorithms, statistical analysis, data visualization, and real-world applications.

Language: Jupyter Notebook - Size: 5.58 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

sergio11/online_payment_fraud

Fraud detection using Deep Neural Networks to predict fraudulent transactions in financial data. 🚨🤖 Complete process from EDA and data preprocessing to model training and evaluation. 📊🔍

Language: Jupyter Notebook - Size: 1.77 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 2

mayurasandakalum/breast-cancer-detection

Code for classifying breast cancer tumors using machine learning. Includes preprocessing, visualizations, and models like Logistic Regression, Decision Tree, and Random Forest. Evaluated with accuracy, precision, recall, and F1-score. Clone, install dependencies, and run the Jupyter notebook for full analysis.

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: 14 days ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

swapnita-pandey/Identification-of-Edited-Images

Detection of Human Edited Images using CNN, VGG16, Xception, ELA, Ensemble Learning.

Language: Jupyter Notebook - Size: 32.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0

sondosaabed/Preprocessing-for-Machine-Learning-in-Python

DataCamp inetrmediate course on how and when to perform data preprocessing in any machine learning project to get the data ready for modeling

Language: Jupyter Notebook - Size: 1.34 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

Armanx200/Fruit-Detector

🍇🔍 Fruit Detector: A machine learning model to identify fruits from images, powered by TensorFlow and Keras. Train the model, predict fruits, and explore the world of AI fruit recognition! 🍓🍍

Language: Python - Size: 110 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 4 - Forks: 0

yashpandey474/Identification-of-fake-reviews

Fake review detection using machine learning and deep learning techniques such as CNNs, SOMs, K-means clustering, various supervised models and natural language processing tools such as Word2Vec & TFIDF, GloVe etc.

Language: Jupyter Notebook - Size: 36.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

nafisalawalidris/Predicting-Credit-Card-Approvals

Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.

Language: Python - Size: 33.2 KB - Last synced at: 3 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

WalidAlsafadi/UFO-Sighting

UFO Sightings cross the world!

Language: Jupyter Notebook - Size: 6.1 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 0

spChalk/Vaccine-Sentiment-Classifier

:syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.

Language: Jupyter Notebook - Size: 10 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

AmirAli5/Data-Preprocessing

In this repo, all about Data preprocessing. Data preprocessing is a required first step before any machine learning machinery can be applied, because the algorithms learn from the data and the learning outcome for problem solving heavily depends on the proper data needed to solve a particular problem – which are called features. Examples of data preprocessing include cleaning, instance selection, normalization, one hot encoding, transformation, feature extraction and selection, etc.

Language: Jupyter Notebook - Size: 4.51 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

MsnAmiri/Customer-Loyalty-Prediction-in-Python

A machine learning and deep learning project from ground up.

Language: Jupyter Notebook - Size: 1.12 MB - Last synced at: 7 months ago - Pushed at: almost 3 years ago - Stars: 4 - Forks: 0

prkskrs/Exploratory-Data-Analysis

exploratory data analysis best arranged notebooks (beginner to advance)

Language: Jupyter Notebook - Size: 3.45 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 4 - Forks: 1

kbelisar/datalark

Like the mudlark finding treasures on the foreshore, the datalark seeks treasures hidden within messy data!

Language: R - Size: 32.2 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

yashpatel-py/earthquake_prediction

Language: Jupyter Notebook - Size: 1.75 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 4 - Forks: 0

NatthanonNon/NOAA-grb2

GRIB2 Tutorial with NOAA dataset

Language: Jupyter Notebook - Size: 139 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

toukirnaim08/Rainfall-Prediction-Australian-Weather-Dataset

Rainfall prediction from Australian weather dataset using multiple machine learning models.

Language: R - Size: 5.88 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

carrdelling/multilabel_split

Sample algorithm for stratified train/test split in multi-label problems

Language: Python - Size: 4.88 KB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 4 - Forks: 0

recepayddogdu/Data-Science

Data Science alanında kendimi geliştirmek için çalıştığım kaynaklardan bazılarını burada depoluyorum. Çalışma notlarıma Notes klasöründen ulaşabilirsiniz.

Language: Jupyter Notebook - Size: 458 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 3

turhancan97/Python_for_DataScience

Verili Bilimi için Temel Python Kodları

Language: Jupyter Notebook - Size: 6.11 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 3

stxupengyu/Credit-Data-Analysis

实现对信贷数据的数据预处理，数据分析。之后利用多种分类算法对公司是否违约进行预测。Realize the data preprocessing and data analysis of credit data. Then, it uses a variety of classification algorithms to predict whether the company defaults.

Language: Jupyter Notebook - Size: 39.1 MB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

Ashleshk/Power-BI-A-Z-Hands-On-Power-BI-Training-For-Data-Science-Udemy

Learn data visualization through Microsoft Power BI and create opportunities for you or key decision makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. You'll learn all of the features in Power BI that allow you to explore, experiment with, fix, prepare, and present data easily, quickly, and beautifully.

Size: 5.82 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 0

konradmalik/ann-laminar-burning-velocity 📦

Models trained in my article on LBV predictions.

Language: C - Size: 3.91 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 4 - Forks: 0

podondra/data-preprocessing

data preprocessing examples

Language: Jupyter Notebook - Size: 4.64 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 4

Vishal1999-33/K-Nearest-Neighbors

Credit Card Fraud Detection

Language: Jupyter Notebook - Size: 524 KB - Last synced at: 6 months ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

djgarcia/PCARD

PCARD Ensemble classifier for Big Data

Language: Scala - Size: 19.5 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 2

MrKhan0747/New-York-City-Taxi-Trips-Fare

In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 2 Crore records and 8 features along with GPS coordinates of pickup and dropoff

Language: Jupyter Notebook - Size: 2.43 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 4 - Forks: 2

bharathgs/dframeutils

simple utility tools for dataframes in Python || WIP ||

Language: Python - Size: 98.6 KB - Last synced at: 17 days ago - Pushed at: over 7 years ago - Stars: 4 - Forks: 1

geograz/TBM_advance_classification

Code and data repository for the paper "Challenges and Opportunities of Data-Driven Advance Classification for Hard Rock TBM excavations"

Language: Python - Size: 526 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 1

dvamsidhar2002/Audio-Data-Analysis-Techniques

This repository contains data preprocessing and analysis techniques for audio data.

Language: Jupyter Notebook - Size: 3.64 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 2

stefanherdy/SAM-GAN-Clustering

Implementation of Meta AI's Segment Anything Model to do an automated image annotation of simple microscope images and a modified GAN to cluster the preprocessed images according to their qualitative features

Language: Python - Size: 245 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 3 - Forks: 0

MvMukesh/ProblemSolving-FrameWork-ML

Detailed industry specific framework to solve Machine Learning Problem

Language: Python - Size: 139 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 3 - Forks: 0

SamSamhuns/ml_data_processing

ML data processing (For Computer Vision)

Language: Python - Size: 5.59 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Suhrud1511/DataLens

DataLens is a robust online tool for automatic dataset preprocessing. With just a click, generate detailed visualizations and comprehensive reports. Simplify your data preparation and uncover insights instantly.

Language: TypeScript - Size: 3.5 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 1

psmyth94/biosets

A bioinformatics extension of 🤗 Datasets library, built for ML applications on biological and omics data, offering easy integration of metadata and low-code data management tools.

Language: Python - Size: 279 KB - Last synced at: 11 days ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

Armanx200/Income-Predictor

🔍✨ A machine learning project that predicts income based on various demographic factors using Random Forest and Gradient Boosting algorithms. Includes data preprocessing, hyperparameter tuning, and model evaluation with detailed performance metrics. 📊🤖

Language: Python - Size: 652 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

ThecoderPinar/Credit-Card-Fraud-Detection-Project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

Language: Jupyter Notebook - Size: 5.02 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 3

TimKong21/Medical-Appointment-No-Show-Prediction

A machine learning solution predicting patient no-shows in healthcare appointments. This project integrates EDA, data processing, feature engineering, and XGBoost modeling, with a workflow spanning from Snowflake data retrieval to AWS deployment (S3, SageMaker, Lambda, API Gateway), aiming to enhance appointment management in medical ERP systems.

Language: Jupyter Notebook - Size: 35.4 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

MartinKalema/Malaria-In-Africa

This project is aimed at understanding, mitigating, and controlling the impact of malaria in Africa.

Language: Jupyter Notebook - Size: 1.13 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

that-coding-kid/Saarthi

The problem statement includes building a voice-powered chatbot for the BHUVAN portal. This project has been selected in Top 50 among various projects from across the country for ISRO'S SIF Hackathon.

Language: Jupyter Notebook - Size: 1.01 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

ehtisham-sadiq/Movie-Recommendation-System

The Movie Recommendation System is an all-encompassing data science project that utilizes sophisticated machine learning methods, including collaborative and content-based filtering, to provide users with personalized movie suggestions based on their preferences and viewing history. I

Language: Jupyter Notebook - Size: 9.12 MB - Last synced at: 25 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0