An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: dataprocessing

SourabhSingh9/Data-Analytics-Projects-by-Sourabh

HR Analytics (Attrition Insights) Portfolio

Size: 1.99 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

soheb120/Breast-Cancer-Classification

This repository contains a notebook for classifying breast cancer tumors using neural networks. It covers dataset exploration, model training with TensorFlow and Keras, and performance evaluation. 🩺💻

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

Ricco1010/ricco

A personal toolset built over time by Ricco

Language: Python - Size: 5.38 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

Saffrontea/dx

dx - Enhanced JavaScript Data Processing for the Command Line

Language: TypeScript - Size: 95.7 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

BioFSharp/BioFSharp

Open source bioinformatics and computational biology toolbox written in F#. This is the core package containing type models and parsers/writers.

Language: F# - Size: 311 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 111 - Forks: 33

SSahas/Implementing-GPT-From-Scratch

Building a decoder-only (GPT-style) LLM from scratch using PyTorch and training it for text generation.

Language: Python - Size: 350 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

VigneshKanna18/foodhunter-revenue-drop-analysis

A BI solution developed for FoodHunter to investigate a significant drop in revenue over a four month period. This analysis helps uncover actionable insights through data exploration, visualization and hypothesis-driven analysis to support informed decision-making.

Language: Jupyter Notebook - Size: 9.74 MB - Last synced at: 6 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

Rafat-decodis/Robust-ASR-for-Low-Resource-Languages

Exploring Benchmark Gaps and Real-World Speech Generalization for Language in Low Resource

Language: Jupyter Notebook - Size: 121 KB - Last synced at: 17 days ago - Pushed at: 18 days ago - Stars: 2 - Forks: 0

dawidolko/DataFusion-App-Python

Project as part of the Data Warehousing subject.

Language: Python - Size: 14.7 MB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

Vidhi1290/ZOMATO-DATA-ANALYSIS

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

Language: Jupyter Notebook - Size: 167 KB - Last synced at: 17 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

srafay/Machine_Learning_A-Z

Learning to create Machine Learning Algorithms

Language: Python - Size: 10.8 MB - Last synced at: 2 days ago - Pushed at: about 4 years ago - Stars: 389 - Forks: 195

l3s-learnweb/interweb

Versatile API that consolidates multiple data providers into one unified interface

Language: Java - Size: 66.5 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

msamij/zig-flow

Data Engineering pipeline.

Language: Java - Size: 559 KB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Jorgen98/Lissy

Aplication for public transport system analysis

Language: JavaScript - Size: 25.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

aadityasikder/Object-Detection-with-raspberry-pi-implementing-TinyML-models

Repository for Raspberry Pi-based object detection with TinyML models like TensorFlow Lite, PyTorch Nano, including data gathering, mAP evaluation, and image data preparation in Jupyter notebooks.

Language: Jupyter Notebook - Size: 35.7 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

brooks-code/data_utils

Collection of data related tools.

Language: Python - Size: 652 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

waikato-datamining/multiway-algorithms

Java library of multi-way algorithms.

Language: Java - Size: 7.36 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 2

nf-core/deepmodeloptim

Stochastic Testing and Input Manipulation for Unbiased Learning Systems

Language: Nextflow - Size: 4.2 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 29 - Forks: 14

gojibjib/voice-grabber 📦

Collection of scripts to gather training (meta) data for the ML model

Language: Python - Size: 3.26 MB - Last synced at: 24 days ago - Pushed at: almost 7 years ago - Stars: 3 - Forks: 0

navyapatel2123/cancer_prediction

This project implements a machine learning model (Logistic Regression) trained on the Breast Cancer dataset to predict if a tumor is benign or malignant. It includes a Python script for training the model, a terminal-based prediction tool, and a web application built with Streamlit for interactive predictions.

Language: Python - Size: 60.5 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Major-Cod3/major_lang

Uma linguagem orientada a hardware projetada para fornecer controle direto e preciso sobre dispositivos e componentes físicos.

Language: Python - Size: 56.6 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

datascientiafoundation/feature-engineering

Feature engineering on LivePeople dataset

Language: Python - Size: 1.83 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

fern-flower-lab/sqlg-clj

The SQL Graph with Tinkerpop3 and Clojure

Language: Clojure - Size: 40 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 5 - Forks: 1

Aryan22163/Fraud_detection_bank

A machine learning project for detecting fraudulent credit card transactions using classification algorithms. Focused on handling imbalanced data and optimizing for real-world fraud detection

Language: Jupyter Notebook - Size: 47.4 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

BatthulaVinay/Time-Series-Analysis-Visualization-in-Python

This project demonstrates time series analysis and visualization techniques using Python. The dataset consists of stock market data, and the project includes various preprocessing, statistical tests, and visualization techniques to analyze trends and patterns over time.

Language: Jupyter Notebook - Size: 293 KB - Last synced at: about 16 hours ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

BlockNotes-4515/Spotify-Music-Recommedation-System

Language: Python - Size: 37.4 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 4 - Forks: 0

nkaz001/data-tardis

Process tardis.dev cryptocurrency data, reconstructing the market depth and computing imbalance.

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 21 - Forks: 6

AnyegaAlex/Sentiment-Driven-Stock-Price-Prediction-Using-News-Headlines

An end-to-end application that predicts stock price movements using sentiment analysis of financial news headlines. Powered by machine learning, NLP, and real-time data integration, this project offers investors a reliable tool for data-driven decision-making.

Language: JavaScript - Size: 461 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Dinesh-2311/predict_flight_delay_datasets

This project explores weather prediction of flight delay data sets building data analysis process using machine learning models and various data analysis & manuplation libraries

Language: Jupyter Notebook - Size: 458 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Emplifi/BakeryJS

Dataprocessing framework for JavaScript 🎂

Language: TypeScript - Size: 2.61 MB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 6

srimantapal205/DataEngineerWireframeDesigns

Data Engineer Wireframe Designs are essential for planning and visualizing data pipelines, architecture, and workflows before implementation.

Size: 10.7 KB - Last synced at: 22 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

FGonzalesc/Transcripcion_AI

Transcripción de audios con Azure Speech y extracción de insights con Open AI

Language: Python - Size: 2.56 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hq969/Credit-card-fraud-Detection

The Credit Card Fraud Detection project is a machine learning-based system designed to identify fraudulent transactions in real-time. Using historical transaction data, the model classifies transactions as either fraudulent or legitimate, helping financial institutions reduce financial losses and improve security.

Language: Jupyter Notebook - Size: 2.78 MB - Last synced at: 5 days ago - Pushed at: 8 months ago - Stars: 2 - Forks: 1

KarolGongola/yadp

Yet Another Data Platform - Kubernetes based data platform built using open source components

Language: Python - Size: 145 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

hydroboom/Geophysics-Survey-Image-Processing-image-file-to-HEXCODEs

Using PIL and collections to perform image analysis, extracting most common HEXCODE values for grid cells of choice

Language: Jupyter Notebook - Size: 351 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

pedrokehl/caminho

Tool for creating efficient data pipelines in a JavaScript environment

Language: TypeScript - Size: 914 KB - Last synced at: 28 days ago - Pushed at: 4 months ago - Stars: 62 - Forks: 1

prakhar21/50-Days-of-ML

A day to day plan for this challenge (50 Days of Machine Learning) . Covers both theoretical and practical aspects

Language: Jupyter Notebook - Size: 1.15 MB - Last synced at: 2 months ago - Pushed at: over 5 years ago - Stars: 256 - Forks: 59

kkumyk/server-logs-daily-data-pipeline

A data engineering project with dbt, Docker, Kestra, Terraform, GCP and Looker.

Language: HCL - Size: 755 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

neelimabonangi/Real-Time-Weather-Data-Processing

Processes and analyzes near real-time weather data using the Kappa architecture,Apache Kafka,Spark,Cassandra,docker,AWS EC2,spring boot API

Language: Java - Size: 5.73 MB - Last synced at: 1 day ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

erfannjz/Instagram-NFB

A tool to identify users who do not follow back on Instagram

Language: HTML - Size: 15.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

RushikeshBihade/Django_Bsased_DataAnalyzer_WebApp

Data Analyzer is a Django web application that enables users to upload CSV files, perform data analysis using pandas and numpy, and view results and visualizations on an interactive web interface. It simplifies data analysis by offering a user-friendly platform for data upload, processing, and visualization.

Language: Python - Size: 2.91 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

chunoti/Census-Income-Project

Used various Machine Learning Algorithms to performed a predictive task of classification to predict whether an individual makes over 50K a year or less on the 'US Census Income' dataset.

Language: Jupyter Notebook - Size: 749 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

matancohen1205/SparkleBot-IOT-project

IOT final course project

Language: Python - Size: 688 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

Kaushik-Puttaswamy/Dynamic-Movie-Booking-Insights-Platform-Using-Snowflake

The Dynamic Movie Booking Insights Platform processes real-time booking data using Snowflake’s Dynamic Tables, Streams, and Tasks to deliver actionable insights. It features an interactive Streamlit dashboard for visualizing revenue, sales trends, and booking metric.

Language: Python - Size: 940 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

tanisha10101/Machine-Learning-Assignment-1

This project processes a dataset through cleaning, normalization, and transformation, then applies regression algorithms to predict target values and visualize the results.

Language: Jupyter Notebook - Size: 109 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

mei-pan/Tastey_Bytes_in-process

Size: 660 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Lynk4/Kaggle

This repository houses Python notebooks and scripts that contain solutions to Kaggle competitions.

Language: Jupyter Notebook - Size: 19.3 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

cagandemirmr/Airbnb_Available_Houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

Language: Jupyter Notebook - Size: 151 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Trident09/net-sec-ai-MP

This project predicts network traffic patterns using a machine learning model trained on the CICIDS dataset. It includes a Streamlit app for real-time predictions, displaying predicted labels and probabilities for uploaded CSV data. The project is structured into three parts: dataset, model training, and frontend (Streamlit app).

Language: Jupyter Notebook - Size: 3.29 MB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

ngangawairimu/regression-model-for-predicting-house-prices

This project focuses on applying statistical modeling techniques to predict house prices in Melbourne using the Melbourne House Price dataset. It involves data cleaning, exploratory data analysis (EDA), feature selection, and fitting a regression model to predict the target variable, which is the house price.

Language: Jupyter Notebook - Size: 1.66 MB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jigyasaG18/HR-Analytics-Power-BI-Dashboard

The HR Analytics Power BI Dashboard project focuses on developing a comprehensive tool to analyze and visualize key performance indicators related to employee attrition and retention. It features interactive visualizations that enable HR professionals to explore data, identify trends, and make informed decisions. The dashboard integrates the data.

Size: 17.5 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

jigyasaG18/Financial-Risk-Analysis-Project

The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics

Size: 2.54 MB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

GIPSYDANGER-1/PreppyData

We provide Auto Data Preprocessing for anyone

Language: HTML - Size: 67.4 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

Jean-njoroge/Breast-cancer-risk-prediction

Classification of Breast Cancer diagnosis Using Support Vector Machines

Language: Jupyter Notebook - Size: 1.74 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 237 - Forks: 127

divithraju/divith-raju-Immigration-Data-Engineering

A Capstone Project that covers several aspects of Data Engineering (Data Exploration, Cleaning, Modeling, Pipelining, Processing)

Language: Jupyter Notebook - Size: 2.5 MB - Last synced at: 29 days ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

NouranHaitham/ML_WaterQuality

A notebook aimed at predicting and improving water safety by analyzing contaminants and pollution levels in water sources, enhancing public health and ensuring access to clean drinking water.

Language: Jupyter Notebook - Size: 4.81 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 1

Silent0Wings/TA-Management-System

The TA Management System is a C++ project designed to manage records of Teaching Assistants (TAs) within a department. The system ensures that only eligible TAs—those who are currently registered students—are maintained in the records. The project involves filtering out records of TAs who have graduated and updating the TA file accordingly.

Language: C++ - Size: 110 KB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

1401Dev/Customer-Lifetime-Value-Prediction

A data science project leveraging Python and Scikit-Learn to build predictive models that estimate customer lifetime value (CLV). Includes data cleaning, feature engineering, and model selection to identify key drivers of CLV, supporting strategic decision-making in customer retention and marketing.

Language: Jupyter Notebook - Size: 173 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

DevPabloOliveira/MatriXplore

Web app for processing, uploading, and downloading matrices using FastAPI. Users can upload CSV files, manually input data, and download pre-set matrices. Includes analysis of matrix properties like functionality, injectivity, and surjectivity, with support for matrix combinations and transpose calculations. Built with FastAPI and Jinja2.

Language: HTML - Size: 486 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Project_Resources

Explore and replicate Amazon EMR (Elastic MapReduce) setup and utilization for big data processing and analytics tasks, featuring comprehensive demonstrations from VPC creation to Spark job execution.

Language: Jupyter Notebook - Size: 561 KB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

vinayakdon/Machine-Learning-project-Sentimental-classifier-

A sentiment classification tool using machine learning in Python to analyze and predict the sentiment of text data. Features preprocessing, model training, hyperparameter tuning, and evaluation for accurate sentiment analysis.

Language: Jupyter Notebook - Size: 13.7 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

Zeeshanahmad4/WebScrapeSummarizer

WebScrapeSummarizer 🌐✍️: A web tool that fetches and summarizes content from any domain, offering insights in a compact CSV format.

Language: JavaScript - Size: 38.1 KB - Last synced at: 2 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

huseyincenik/data_science

Data Science materials

Language: Jupyter Notebook - Size: 53.1 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 1

Tanzim-prog/sentiment_analysis_bing_lexicon

The motive of this project is to find out the customer satisfaction of some residential hotels of Dhaka.

Language: R - Size: 1.22 MB - Last synced at: 4 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

BlockNotes-4515/Multiple-Disease-Detection-System-All_In_One-

🩺 Heart Disease Detection System 💓 This AI tool predicts heart disease risk by analyzing key health metrics like age, cholesterol, and blood pressure. 🧠🔍 It provides quick, accurate results to help prevent serious conditions and support early treatment. Perfect for both healthcare pros and patients! 🌟

Language: Jupyter Notebook - Size: 557 KB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

Udhay2898/Analyzing-Accident-Data-in-Tamil-Nadu-Districts

This project aims to analyze accident data across various districts in Tamil Nadu to understand patterns, trends, and contributing factors. The analysis will leverage data manipulation, statistical techniques, and visualization to derive insights and potentially inform safety measures and policies.

Language: Jupyter Notebook - Size: 3.56 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OtenMoten/data-chaos-wizard

It's designed to organize your messy data files with one click on your system.

Language: Python - Size: 39.1 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

OtenMoten/real-estate-listings-processor

It's designed to process and analyze real estate listings data, providing valuable market insights through statistical analysis and data visualization.

Language: Python - Size: 37.1 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

melesita/TAMU-Degree-Planner

The TAMU-Degree-Planner is a tool designed to help Texas A&M University students generate a comprehensive degree plan.

Language: Python - Size: 19.5 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

nivasharmaa/FriskWatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

Language: Java - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

Sravani13Kasarla/VrindaStore_Analysis

Uncover insights, trends, and patterns within the retail data. Harness the power of data analytics to optimize inventory management, understand customer preferences, and drive strategic decision-making

Size: 5.68 MB - Last synced at: 11 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

niteshkuwarbi/RoadAccident_DataAnalysis-PowerBI

This PowerBI project aims to provide comprehensive insights into road accidents using interactive visualizations and data analysis.

Size: 12.4 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

fazildgr8/wrist_control_CNN_AWEAR

This is the cumulative repository for the research project Deep Learning Approach to Robotic Prosthetic Wrist Control using EMG Signals done in the AWEAR lab. This repository would consist of all the Data processing pipelines codes, custom data preprocessing library built for this project, and all the time series CNN training Jupyter notebooks using the Data collected within the AWEAR Lab, University at Buffalo.

Language: Jupyter Notebook - Size: 541 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

sravanigodavarthi/Automated-ELT-Pipeline-AWS

An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.

Language: Python - Size: 48 MB - Last synced at: 18 days ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

kevinndungu-source/Amazon_EMR_Serverless_Demonstration

Explore the capabilities of Amazon EMR Serverless by processing semi-structured review data with Apache Spark, showcasing efficient big data analysis without managing clusters.

Language: Python - Size: 556 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

flaviuvadan/pipe-flow

A data processing pipeline library with a common vocabulary API

Language: Go - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 0 - Forks: 0

x3O8/Material-Dataset-Analysis

Analysis of Material Datasets to find trends based on composition

Language: Python - Size: 23.1 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

soumyagautam/Sign-Sense

Deep Learning and Neural Network based Sign Sense or 'Sign Language' to Speech converter is an desktop app which can detect hand signs in a frame and can convert them to Speech, according to their respective meaning. Opposite to this, it can also recognise your voice and can convert it to sign language.

Language: Python - Size: 8.81 MB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

csimplestring/delta-go

Native Delta Lake Implementation in Go

Language: Go - Size: 577 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 35 - Forks: 7

hediyeorhan/LogisticRegressionWithArduino

Language: C - Size: 4.09 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

Jay-Amgoth/Customer-Churn-Analysis---Telecom-Industry

This project aims to analyze customer churn in the telecom industry using machine learning techniques. By leveraging Python and various data science libraries, we preprocess the data, perform exploratory data analysis (EDA), and build predictive models to identify factors contributing to customer churn.

Language: Jupyter Notebook - Size: 4.27 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bobikhsann/indonesia-election-news

displays 2024 election news data from web scraping at detiknews.com

Language: Jupyter Notebook - Size: 601 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Shubham-Trivedi/Data-analysis-Case-Study

case study to implement and showcase my data analysis skills

Size: 2.49 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

yugantgajera/Smarter-speech-recognition-system

English, Madarian and chienese End-to-End Automatic Speech Recognition Using Tensorflow

Language: Python - Size: 127 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

pavlutom/kakulator

Kakulator is a Django application, which uses cutting-edge mathematical software to satisfy all your kakulation needs.

Language: JavaScript - Size: 747 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

asolimando/xqueryprojector

XQuery query processing optimization based on XML projection

Language: HTML - Size: 15.3 MB - Last synced at: 12 days ago - Pushed at: over 7 years ago - Stars: 1 - Forks: 0

HiAudrey/Clustering-Analysis

How to Make Models Perform Better? NOT JUST Basics of K-means and Hierarchical Clustering Methods!

Language: Jupyter Notebook - Size: 1.43 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

toitoi/kl-properties Fork of hafizio/kl-properties

Shiny framework for Data Science

Language: R - Size: 18.1 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

rachitdani/Book-Recommender-System

A streamlit app to recommend books based on users interest

Language: Jupyter Notebook - Size: 26.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

BobErgot/Large-Scale-Data-Processing-Design-Patterns

Explore essential MapReduce design patterns for big data processing! This repository includes practical implementations of patterns from the "MapReduce Design Patterns" book, complete with examples across summarization, filtering, organization, joins, and more.

Language: Java - Size: 37 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Priyanshu9898/Medicine-Recommendation-System

Language: Python - Size: 1.33 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

sand47/Udacity-Webinar

Watch my Pytorch FB challenge course webinar on the topic Data preprocessing in DL and ML

Language: Jupyter Notebook - Size: 635 KB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

sand47/AI-Projects

List of all my AI Projects

Language: Jupyter Notebook - Size: 5.54 MB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

xeniaqian94/11-775-Large-Scale-Multimedia-Analysis

Implementations on computer vision, audio and speech processing, multi-media files and streaming, multi-modal signal processing, video retrieval, semantics, and text generation.

Language: Python - Size: 4.2 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 2 - Forks: 3

Huan-YiShen/QPL

This repository contains the author's ongoing work at the QPL research lab in the University of Waterloo as an undergraduate research assistant. For more information about the research group, please visit the lab website.

Language: Python - Size: 31.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MA-Repos/Kafka-stream-from-reddit-twitter

(Work in Progress) Real time data from reddit and then apply sentimental analysis. Store the data in Hadoop as well.

Language: Java - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

SyedFahad7/Data-Processing

📊 Data Processing Script Automates data processing for CSV datasets using Python. Performs tasks like handling missing values, data transformation, and generating visualizations. Versatile and efficient for various data processing needs

Language: Python - Size: 407 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

tkrojer/XChemExplorer

A graphical batch data processing tool for protein crystallography

Language: Python - Size: 70.6 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 6 - Forks: 13

kanishkan91/Python-DataUpdate-DataProcessor-kbn

The python module can be used to scrape data and process data from different sources. The python module can output data as either as a dataframe in the country year format or it will output data in excel files This module has primarily been created for processing data for the International Futures (IFs) Project however, it can be used to process data in general. The module can be used to process data from the following sources, 1) World Bank World Development Indicators (WDI) 2) UNESCO Education indicators(UIS) 3) FAO Food Balance Sheets (FAO) 4) IMF Global Finance Statistics (IMF GFS) 5) Health data from the Institute for Health and Metric Evaluation (IHME) 6) Water data from FAO AQUASTAT 7) Energy data from EIA Currently this module can be run as is on Windows. For usage on Macs, the user may have to make changes to the code lines which specify paths.

Language: Python - Size: 1.32 GB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 15 - Forks: 8

mehadihn/Data-Preparation-Techniques-Project

This project was completed for the data preparation techniques course.

Language: Jupyter Notebook - Size: 1.24 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0