GitHub topics: dataprocessing
SourabhSingh9/Data-Analytics-Projects-by-Sourabh
HR Analytics (Attrition Insights) Portfolio
Size: 1.99 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

soheb120/Breast-Cancer-Classification
This repository contains a notebook for classifying breast cancer tumors using neural networks. It covers dataset exploration, model training with TensorFlow and Keras, and performance evaluation. 🩺💻
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Ricco1010/ricco
A personal toolset built over time by Ricco
Language: Python - Size: 5.38 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

Saffrontea/dx
dx - Enhanced JavaScript Data Processing for the Command Line
Language: TypeScript - Size: 95.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

BioFSharp/BioFSharp
Open source bioinformatics and computational biology toolbox written in F#. This is the core package containing type models and parsers/writers.
Language: F# - Size: 311 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 111 - Forks: 33

SSahas/Implementing-GPT-From-Scratch
Building a decoder-only (GPT-style) LLM from scratch using PyTorch and training it for text generation.
Language: Python - Size: 350 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

VigneshKanna18/foodhunter-revenue-drop-analysis
A BI solution developed for FoodHunter to investigate a significant drop in revenue over a four month period. This analysis helps uncover actionable insights through data exploration, visualization and hypothesis-driven analysis to support informed decision-making.
Language: Jupyter Notebook - Size: 9.74 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

Rafat-decodis/Robust-ASR-for-Low-Resource-Languages
Exploring Benchmark Gaps and Real-World Speech Generalization for Language in Low Resource
Language: Jupyter Notebook - Size: 121 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

dawidolko/DataFusion-App-Python
Project as part of the Data Warehousing subject.
Language: Python - Size: 14.7 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Vidhi1290/ZOMATO-DATA-ANALYSIS
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
Language: Jupyter Notebook - Size: 167 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

srafay/Machine_Learning_A-Z
Learning to create Machine Learning Algorithms
Language: Python - Size: 10.8 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 389 - Forks: 195

l3s-learnweb/interweb
Versatile API that consolidates multiple data providers into one unified interface
Language: Java - Size: 66.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

msamij/zig-flow
Data Engineering pipeline.
Language: Java - Size: 559 KB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Jorgen98/Lissy
Aplication for public transport system analysis
Language: JavaScript - Size: 25.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

aadityasikder/Object-Detection-with-raspberry-pi-implementing-TinyML-models
Repository for Raspberry Pi-based object detection with TinyML models like TensorFlow Lite, PyTorch Nano, including data gathering, mAP evaluation, and image data preparation in Jupyter notebooks.
Language: Jupyter Notebook - Size: 35.7 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

brooks-code/data_utils
Collection of data related tools.
Language: Python - Size: 652 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

waikato-datamining/multiway-algorithms
Java library of multi-way algorithms.
Language: Java - Size: 7.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 2

nf-core/deepmodeloptim
Stochastic Testing and Input Manipulation for Unbiased Learning Systems
Language: Nextflow - Size: 4.2 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 29 - Forks: 14

gojibjib/voice-grabber 📦
Collection of scripts to gather training (meta) data for the ML model
Language: Python - Size: 3.26 MB - Last synced at: 24 days ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

navyapatel2123/cancer_prediction
This project implements a machine learning model (Logistic Regression) trained on the Breast Cancer dataset to predict if a tumor is benign or malignant. It includes a Python script for training the model, a terminal-based prediction tool, and a web application built with Streamlit for interactive predictions.
Language: Python - Size: 60.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Major-Cod3/major_lang
Uma linguagem orientada a hardware projetada para fornecer controle direto e preciso sobre dispositivos e componentes físicos.
Language: Python - Size: 56.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

datascientiafoundation/feature-engineering
Feature engineering on LivePeople dataset
Language: Python - Size: 1.83 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

fern-flower-lab/sqlg-clj
The SQL Graph with Tinkerpop3 and Clojure
Language: Clojure - Size: 40 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 5 - Forks: 1

Aryan22163/Fraud_detection_bank
A machine learning project for detecting fraudulent credit card transactions using classification algorithms. Focused on handling imbalanced data and optimizing for real-world fraud detection
Language: Jupyter Notebook - Size: 47.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

BatthulaVinay/Time-Series-Analysis-Visualization-in-Python
This project demonstrates time series analysis and visualization techniques using Python. The dataset consists of stock market data, and the project includes various preprocessing, statistical tests, and visualization techniques to analyze trends and patterns over time.
Language: Jupyter Notebook - Size: 293 KB - Last synced at: about 16 hours ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BlockNotes-4515/Spotify-Music-Recommedation-System
Language: Python - Size: 37.4 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

nkaz001/data-tardis
Process tardis.dev cryptocurrency data, reconstructing the market depth and computing imbalance.
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 21 - Forks: 6

AnyegaAlex/Sentiment-Driven-Stock-Price-Prediction-Using-News-Headlines
An end-to-end application that predicts stock price movements using sentiment analysis of financial news headlines. Powered by machine learning, NLP, and real-time data integration, this project offers investors a reliable tool for data-driven decision-making.
Language: JavaScript - Size: 461 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Dinesh-2311/predict_flight_delay_datasets
This project explores weather prediction of flight delay data sets building data analysis process using machine learning models and various data analysis & manuplation libraries
Language: Jupyter Notebook - Size: 458 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Emplifi/BakeryJS
Dataprocessing framework for JavaScript 🎂
Language: TypeScript - Size: 2.61 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 6

srimantapal205/DataEngineerWireframeDesigns
Data Engineer Wireframe Designs are essential for planning and visualizing data pipelines, architecture, and workflows before implementation.
Size: 10.7 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

FGonzalesc/Transcripcion_AI
Transcripción de audios con Azure Speech y extracción de insights con Open AI
Language: Python - Size: 2.56 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hq969/Credit-card-fraud-Detection
The Credit Card Fraud Detection project is a machine learning-based system designed to identify fraudulent transactions in real-time. Using historical transaction data, the model classifies transactions as either fraudulent or legitimate, helping financial institutions reduce financial losses and improve security.
Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

KarolGongola/yadp
Yet Another Data Platform - Kubernetes based data platform built using open source components
Language: Python - Size: 145 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hydroboom/Geophysics-Survey-Image-Processing-image-file-to-HEXCODEs
Using PIL and collections to perform image analysis, extracting most common HEXCODE values for grid cells of choice
Language: Jupyter Notebook - Size: 351 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

pedrokehl/caminho
Tool for creating efficient data pipelines in a JavaScript environment
Language: TypeScript - Size: 914 KB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 62 - Forks: 1

prakhar21/50-Days-of-ML
A day to day plan for this challenge (50 Days of Machine Learning) . Covers both theoretical and practical aspects
Language: Jupyter Notebook - Size: 1.15 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 256 - Forks: 59

kkumyk/server-logs-daily-data-pipeline
A data engineering project with dbt, Docker, Kestra, Terraform, GCP and Looker.
Language: HCL - Size: 755 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

neelimabonangi/Real-Time-Weather-Data-Processing
Processes and analyzes near real-time weather data using the Kappa architecture,Apache Kafka,Spark,Cassandra,docker,AWS EC2,spring boot API
Language: Java - Size: 5.73 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

erfannjz/Instagram-NFB
A tool to identify users who do not follow back on Instagram
Language: HTML - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

RushikeshBihade/Django_Bsased_DataAnalyzer_WebApp
Data Analyzer is a Django web application that enables users to upload CSV files, perform data analysis using pandas and numpy, and view results and visualizations on an interactive web interface. It simplifies data analysis by offering a user-friendly platform for data upload, processing, and visualization.
Language: Python - Size: 2.91 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

chunoti/Census-Income-Project
Used various Machine Learning Algorithms to performed a predictive task of classification to predict whether an individual makes over 50K a year or less on the 'US Census Income' dataset.
Language: Jupyter Notebook - Size: 749 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

matancohen1205/SparkleBot-IOT-project
IOT final course project
Language: Python - Size: 688 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

Kaushik-Puttaswamy/Dynamic-Movie-Booking-Insights-Platform-Using-Snowflake
The Dynamic Movie Booking Insights Platform processes real-time booking data using Snowflake’s Dynamic Tables, Streams, and Tasks to deliver actionable insights. It features an interactive Streamlit dashboard for visualizing revenue, sales trends, and booking metric.
Language: Python - Size: 940 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

tanisha10101/Machine-Learning-Assignment-1
This project processes a dataset through cleaning, normalization, and transformation, then applies regression algorithms to predict target values and visualize the results.
Language: Jupyter Notebook - Size: 109 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mei-pan/Tastey_Bytes_in-process
Size: 660 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Lynk4/Kaggle
This repository houses Python notebooks and scripts that contain solutions to Kaggle competitions.
Language: Jupyter Notebook - Size: 19.3 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

cagandemirmr/Airbnb_Available_Houses
In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.
Language: Jupyter Notebook - Size: 151 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Trident09/net-sec-ai-MP
This project predicts network traffic patterns using a machine learning model trained on the CICIDS dataset. It includes a Streamlit app for real-time predictions, displaying predicted labels and probabilities for uploaded CSV data. The project is structured into three parts: dataset, model training, and frontend (Streamlit app).
Language: Jupyter Notebook - Size: 3.29 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ngangawairimu/regression-model-for-predicting-house-prices
This project focuses on applying statistical modeling techniques to predict house prices in Melbourne using the Melbourne House Price dataset. It involves data cleaning, exploratory data analysis (EDA), feature selection, and fitting a regression model to predict the target variable, which is the house price.
Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jigyasaG18/HR-Analytics-Power-BI-Dashboard
The HR Analytics Power BI Dashboard project focuses on developing a comprehensive tool to analyze and visualize key performance indicators related to employee attrition and retention. It features interactive visualizations that enable HR professionals to explore data, identify trends, and make informed decisions. The dashboard integrates the data.
Size: 17.5 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

jigyasaG18/Financial-Risk-Analysis-Project
The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics
Size: 2.54 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

GIPSYDANGER-1/PreppyData
We provide Auto Data Preprocessing for anyone
Language: HTML - Size: 67.4 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Jean-njoroge/Breast-cancer-risk-prediction
Classification of Breast Cancer diagnosis Using Support Vector Machines
Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 237 - Forks: 127

divithraju/divith-raju-Immigration-Data-Engineering
A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)
Language: Jupyter Notebook - Size: 2.5 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

NouranHaitham/ML_WaterQuality
A notebook aimed at predicting and improving water safety by analyzing contaminants and pollution levels in water sources, enhancing public health and ensuring access to clean drinking water.
Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 1

Silent0Wings/TA-Management-System
The TA Management System is a C++ project designed to manage records of Teaching Assistants (TAs) within a department. The system ensures that only eligible TAs—those who are currently registered students—are maintained in the records. The project involves filtering out records of TAs who have graduated and updating the TA file accordingly.
Language: C++ - Size: 110 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

1401Dev/Customer-Lifetime-Value-Prediction
A data science project leveraging Python and Scikit-Learn to build predictive models that estimate customer lifetime value (CLV). Includes data cleaning, feature engineering, and model selection to identify key drivers of CLV, supporting strategic decision-making in customer retention and marketing.
Language: Jupyter Notebook - Size: 173 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

DevPabloOliveira/MatriXplore
Web app for processing, uploading, and downloading matrices using FastAPI. Users can upload CSV files, manually input data, and download pre-set matrices. Includes analysis of matrix properties like functionality, injectivity, and surjectivity, with support for matrix combinations and transpose calculations. Built with FastAPI and Jinja2.
Language: HTML - Size: 486 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Project_Resources
Explore and replicate Amazon EMR (Elastic MapReduce) setup and utilization for big data processing and analytics tasks, featuring comprehensive demonstrations from VPC creation to Spark job execution.
Language: Jupyter Notebook - Size: 561 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vinayakdon/Machine-Learning-project-Sentimental-classifier-
A sentiment classification tool using machine learning in Python to analyze and predict the sentiment of text data. Features preprocessing, model training, hyperparameter tuning, and evaluation for accurate sentiment analysis.
Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Zeeshanahmad4/WebScrapeSummarizer
WebScrapeSummarizer 🌐✍️: A web tool that fetches and summarizes content from any domain, offering insights in a compact CSV format.
Language: JavaScript - Size: 38.1 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

huseyincenik/data_science
Data Science materials
Language: Jupyter Notebook - Size: 53.1 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

Tanzim-prog/sentiment_analysis_bing_lexicon
The motive of this project is to find out the customer satisfaction of some residential hotels of Dhaka.
Language: R - Size: 1.22 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

BlockNotes-4515/Multiple-Disease-Detection-System-All_In_One-
🩺 Heart Disease Detection System 💓 This AI tool predicts heart disease risk by analyzing key health metrics like age, cholesterol, and blood pressure. 🧠🔍 It provides quick, accurate results to help prevent serious conditions and support early treatment. Perfect for both healthcare pros and patients! 🌟
Language: Jupyter Notebook - Size: 557 KB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

Udhay2898/Analyzing-Accident-Data-in-Tamil-Nadu-Districts
This project aims to analyze accident data across various districts in Tamil Nadu to understand patterns, trends, and contributing factors. The analysis will leverage data manipulation, statistical techniques, and visualization to derive insights and potentially inform safety measures and policies.
Language: Jupyter Notebook - Size: 3.56 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OtenMoten/data-chaos-wizard
It's designed to organize your messy data files with one click on your system.
Language: Python - Size: 39.1 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OtenMoten/real-estate-listings-processor
It's designed to process and analyze real estate listings data, providing valuable market insights through statistical analysis and data visualization.
Language: Python - Size: 37.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

melesita/TAMU-Degree-Planner
The TAMU-Degree-Planner is a tool designed to help Texas A&M University students generate a comprehensive degree plan.
Language: Python - Size: 19.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

nivasharmaa/FriskWatch
A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.
Language: Java - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Sravani13Kasarla/VrindaStore_Analysis
Uncover insights, trends, and patterns within the retail data. Harness the power of data analytics to optimize inventory management, understand customer preferences, and drive strategic decision-making
Size: 5.68 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

niteshkuwarbi/RoadAccident_DataAnalysis-PowerBI
This PowerBI project aims to provide comprehensive insights into road accidents using interactive visualizations and data analysis.
Size: 12.4 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

fazildgr8/wrist_control_CNN_AWEAR
This is the cumulative repository for the research project Deep Learning Approach to Robotic Prosthetic Wrist Control using EMG Signals done in the AWEAR lab. This repository would consist of all the Data processing pipelines codes, custom data preprocessing library built for this project, and all the time series CNN training Jupyter notebooks using the Data collected within the AWEAR Lab, University at Buffalo.
Language: Jupyter Notebook - Size: 541 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

sravanigodavarthi/Automated-ELT-Pipeline-AWS
An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.
Language: Python - Size: 48 MB - Last synced at: 18 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Serverless_Demonstration
Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with Apache Spark, showcasing efficient big data analysis without managing clusters.
Language: Python - Size: 556 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

flaviuvadan/pipe-flow
A data processing pipeline library with a common vocabulary API
Language: Go - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

x3O8/Material-Dataset-Analysis
Analysis of Material Datasets to find trends based on composition
Language: Python - Size: 23.1 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

soumyagautam/Sign-Sense
Deep Learning and Neural Network based Sign Sense or 'Sign Language' to Speech converter is an desktop app which can detect hand signs in a frame and can convert them to Speech, according to their respective meaning. Opposite to this, it can also recognise your voice and can convert it to sign language.
Language: Python - Size: 8.81 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

csimplestring/delta-go
Native Delta Lake Implementation in Go
Language: Go - Size: 577 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 7

hediyeorhan/LogisticRegressionWithArduino
Language: C - Size: 4.09 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Jay-Amgoth/Customer-Churn-Analysis---Telecom-Industry
This project aims to analyze customer churn in the telecom industry using machine learning techniques. By leveraging Python and various data science libraries, we preprocess the data, perform exploratory data analysis (EDA), and build predictive models to identify factors contributing to customer churn.
Language: Jupyter Notebook - Size: 4.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bobikhsann/indonesia-election-news
displays 2024 election news data from web scraping at detiknews.com
Language: Jupyter Notebook - Size: 601 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shubham-Trivedi/Data-analysis-Case-Study
case study to implement and showcase my data analysis skills
Size: 2.49 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yugantgajera/Smarter-speech-recognition-system
English, Madarian and chienese End-to-End Automatic Speech Recognition Using Tensorflow
Language: Python - Size: 127 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

pavlutom/kakulator
Kakulator is a Django application, which uses cutting-edge mathematical software to satisfy all your kakulation needs.
Language: JavaScript - Size: 747 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

asolimando/xqueryprojector
XQuery query processing optimization based on XML projection
Language: HTML - Size: 15.3 MB - Last synced at: 12 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

HiAudrey/Clustering-Analysis
How to Make Models Perform Better? NOT JUST Basics of K-means and Hierarchical Clustering Methods!
Language: Jupyter Notebook - Size: 1.43 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

toitoi/kl-properties Fork of hafizio/kl-properties
Shiny framework for Data Science
Language: R - Size: 18.1 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

rachitdani/Book-Recommender-System
A streamlit app to recommend books based on users interest
Language: Jupyter Notebook - Size: 26.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BobErgot/Large-Scale-Data-Processing-Design-Patterns
Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.
Language: Java - Size: 37 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Priyanshu9898/Medicine-Recommendation-System
Language: Python - Size: 1.33 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sand47/Udacity-Webinar
Watch my Pytorch FB challenge course webinar on the topic Data preprocessing in DL and ML
Language: Jupyter Notebook - Size: 635 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

sand47/AI-Projects
List of all my AI Projects
Language: Jupyter Notebook - Size: 5.54 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

xeniaqian94/11-775-Large-Scale-Multimedia-Analysis
Implementations on computer vision, audio and speech processing, multi-media files and streaming, multi-modal signal processing, video retrieval, semantics, and text generation.
Language: Python - Size: 4.2 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 3

Huan-YiShen/QPL
This repository contains the author's ongoing work at the QPL research lab in the University of Waterloo as an undergraduate research assistant. For more information about the research group, please visit the lab website.
Language: Python - Size: 31.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MA-Repos/Kafka-stream-from-reddit-twitter
(Work in Progress) Real time data from reddit and then apply sentimental analysis. Store the data in Hadoop as well.
Language: Java - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SyedFahad7/Data-Processing
📊 Data Processing Script Automates data processing for CSV datasets using Python. Performs tasks like handling missing values, data transformation, and generating visualizations. Versatile and efficient for various data processing needs
Language: Python - Size: 407 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tkrojer/XChemExplorer
A graphical batch data processing tool for protein crystallography
Language: Python - Size: 70.6 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 13

kanishkan91/Python-DataUpdate-DataProcessor-kbn
The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process data in general. The module can be used to process data from the following sources, 1) World Bank World Development Indicators (WDI) 2) UNESCO Education indicators(UIS) 3) FAO Food Balance Sheets (FAO) 4) IMF Global Finance Statistics (IMF GFS) 5) Health data from the Institute for Health and Metric Evaluation (IHME) 6) Water data from FAO AQUASTAT 7) Energy data from EIA Currently this module can be run as is on Windows. For usage on Macs, the user may have to make changes to the code lines which specify paths.
Language: Python - Size: 1.32 GB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 8

mehadihn/Data-Preparation-Techniques-Project
This project was completed for the data preparation techniques course.
Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
