An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: data-mining-python

rainman226/holte-1r

An implementation of Holte's 1R discretizer

Language: Python - Size: 8.79 KB - Last synced at: 8 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 0

daniau23/topic_modelling_one

Use of Topic modelling on scraped tweets

Language: HTML - Size: 46.9 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 1 - Forks: 0

cintia-shinoda/univesp

Data Science Undergrad Notes, Code, and Homeworks

Language: Jupyter Notebook - Size: 69 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 0 - Forks: 0

Shakiba-Alipour/Data-Mining-Project

Data mining on university of twente website

Language: Python - Size: 48.8 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 1 - Forks: 0

shubhro2002/ECLAT-and-CLOSET-plus-Algorithms

The project focuses on exploring two specific Association Rule Mining Algorithms - ECLAT and CLOSET+. This is a continuation of Market Basket Analysis project. A transaction dataset has been used containing grocery data. Link to the dataset is given below.

Language: Jupyter Notebook - Size: 146 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

andreist3fan/vitejii-dm-2

Second Data Mining assignment of the TU Delft course, centered on Recommender Systems.

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

LatiefDataVisionary/data-mining-college-task

Language: Jupyter Notebook - Size: 38.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

MiraZzle/cz-real-estate-analysis

Analysis of Czech real estate prices

Language: Python - Size: 4.25 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Carmoruda/Data_Mining

Prácticas de la asignatura Data Mining, enfocadas en la exploración, visualización y análisis de datos

Language: Jupyter Notebook - Size: 27.6 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

ahmedshahriar/youtube-comment-scraper

This script will dump youtube video comments to a CSV from youtube video links. Video links can be placed inside a variable or list or CSV

Language: Jupyter Notebook - Size: 256 KB - Last synced at: 2 months ago - Pushed at: over 3 years ago - Stars: 42 - Forks: 15

InPhyT/IMDb_Sentiment_Analysis_BERT

BERT Sentiment Classification on the IMDb Large Movie Review Dataset.

Language: Jupyter Notebook - Size: 972 KB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 16 - Forks: 0

geoamins/Machine-Learning-IMS

Learning for Learners

Language: Jupyter Notebook - Size: 59.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 7 - Forks: 8

Fraggle460/Data-Mining-

These are a series of data mining workshops using Jupyter Notebook v 7.0.8 running on Python 3.12.4 programming language.

Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

harshita2234/Potato-Prices-Prediction

Project aims to forecast potato prices in India using LSTM, KNN, and Random Forest Regression, integrating historical data on prices, regional stats, and rainfall patterns. Targeting agricultural stakeholders for informed decision-making.

Language: Python - Size: 862 KB - Last synced at: 23 days ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

stupidcucumber/elephant-crawler

System for mining texts from websites.

Language: Python - Size: 111 KB - Last synced at: 4 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

IftekherAziz/Causality-Mining

Data Mining

Language: Jupyter Notebook - Size: 122 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

karthik-d/nyc-taxi-dataset-eda

Clearning, transformation and analysis large datasets as part of coursework for UCS1629: Data Warehousing and Data Mining.

Language: Jupyter Notebook - Size: 9.79 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 1

bjam24/krs-web-scraper

Language: Python - Size: 11.5 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

TufayelLUS/Leadersleague.com-Scraper

This is a Python based web scraper that scrapes a list of URLs from LeadersLeague.com website that stores email, first name, last name and other leads from the website automatically

Language: Python - Size: 6.84 KB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

codeasarjun/web-scraping

This repo contains working example for web scraping

Language: Python - Size: 22.5 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Santa-Clara-Imaginarium-Lab/twitter-scraping-with-python

Twitter Scraping with Python!

Language: Python - Size: 36.1 KB - Last synced at: 11 months ago - Pushed at: almost 4 years ago - Stars: 6 - Forks: 2

iigorsap/data-projects

Here in this repository I share my data projects and challenges that I used to put my knowledge into practice. 🎯📚☕

Language: Jupyter Notebook - Size: 32.1 MB - Last synced at: 11 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Isurie/Text-Classification-Module

Sinhala text extraction, preprocessing, and classification considering subject and domain.

Language: Jupyter Notebook - Size: 2.7 MB - Last synced at: 12 months ago - Pushed at: almost 4 years ago - Stars: 3 - Forks: 2

dj-riley/harley-dealership-mining 📦

Data mining US Harley Davidson dealership information

Language: Python - Size: 6.07 MB - Last synced at: 12 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Sykr-31/Data-Mining-Association-Algorithm-FP-Growth-GoogleCollab-JupyterNotebook

Script tersebut menerapkan kerangka kerja Knowledge Discovery in Database (KDD) yang mencakup data mining dengan metode algoritma FP-Growth untuk menemukan hubungan asosiasi antar item.

Language: Jupyter Notebook - Size: 16.6 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

MichalRedm/DM-project1

This repository contains the code for the first project for Data Mining classes at the Poznań University of Technology, created by the team Kung Fu Pandas.

Language: Jupyter Notebook - Size: 79.6 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

J-E-J-S/pyminer

A Python CLI for Mining Scientific Literature.

Language: Python - Size: 30.8 MB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

Eng-ZeyadTarek/data-mining-dojo

Implementations of data mining techniques using machine learning and deep learning models.

Language: Jupyter Notebook - Size: 5.77 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ketgg/Flix

Movie recommendation system API. Part of which was made for university course project.

Language: Python - Size: 16.2 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

l0g1c-80m8/data-mining-assignments

Repo to contain the assignments for DSCI 553: Foundations and Applications of Data Mining course at USC

Language: Python - Size: 34.6 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

nadim365/EECS_4412_A2

Learning how to make a simple Decision Tree

Language: Jupyter Notebook - Size: 118 KB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Redrrx/ProxyNest

Managing proxies for scaled data scraping and other automation operations will eventually require something like ProxyNest. ProxyNest is a proxy managment API that is well-suited for mid-scale and will soon be made for large ones.

Language: Python - Size: 2.8 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0

WillCaton2350/Real-Estate-WebCrawler

A Real Estate Web Crawler and data pipeline, developed using Python and Scrapy, facilitates the ETL process through multiple stages. It extracts metadata for apartments in Milan, Italy, from various web pages and URLs on sublet.com. The extracted information is then structured using Scrapy items and saved in JSON format

Language: Python - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sheetalkalburgi/web-scraping

Web scraping algorithm for FDA and Health Canada website

Language: Python - Size: 13.7 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

saifalimz/sudobotz.com

Transforming Ideas into Intelligent Automation

Language: SCSS - Size: 11.8 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

mghorbani2357/TT-Miner-Topology-Transaction-Miner-for-Mining-Closed-Itemset

Language: Python - Size: 28 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

ShreyPatel4/Advanced-MOOC-Result-Scraper-

Advanced Automated Data-Mining Tool For MOOC Result to Scrap in one click.

Language: Python - Size: 6.87 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

lekhazadapa/Coding

This Repo gives you an idea of my work with Python codes using different libraries (TensorFlow, NumPy, SciPy, Pandas, Matplotlib, Keras, SciKit-Learn,) and methods in the field of Machine Learning.

Language: Jupyter Notebook - Size: 213 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

vraul92/NLP-on-Whatsapp-Group-Chat

Applying NLP techniques on WhatsApp text to gain insights.

Language: Jupyter Notebook - Size: 128 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 1

DABallentine/knowledge_discovery_charlotte

This project is a course requirement for DSBA-6162, Knowledge Discovery in Databases, at UNCC, Fall 2021.

Language: Jupyter Notebook - Size: 286 MB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 2

knyghtmare/vscode-remote-try-python-data-science

A Template Repository which sets as up a Python environment packed with necessary and popular data science packages.

Language: Python - Size: 16.6 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sunnysidedenver/swpc_27day

This study evaluates the performance of F10.7 cm forecasts found in SWPC's 'Weekly Highlights and 27-day Forecast' product in PDF format from 2020-2023.

Language: Jupyter Notebook - Size: 10.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lokhiufung/webscraping-buddy

Web scrapers for instagram, XHS and investor contacts

Language: Python - Size: 83 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

Sitaras/Data-Mining

Project 1: 🎬🍿 Movie-Recommendation-System, Project 2: 📰🔍Fake News Detection System

Language: Jupyter Notebook - Size: 9.3 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

yaelahgus/Data-Mining

Data Mining on Data.csv with colab research google

Language: Jupyter Notebook - Size: 73.2 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

YaruZeng/Passage-Retrieval-System

📖 The project aims to build an information retrieval system involved with 200 queries and 182469 passages. (UCL COMP0084).

Language: Python - Size: 55.3 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 2

JairoLopes/Analises_R_Python

Meus ipynb e projetos relacionados

Language: HTML - Size: 38.7 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 1

zhiming97/Analyzing-Whatsapp-Chatlogs

This project attempts to visualise and analyse chat logs on messaging apps

Language: Python - Size: 33.2 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

gavinmcclelland/CISC-873

Data Mining course delivered at Queen's University during the Fall 2021 term, instructed by Prof. Steven Ding.

Language: Jupyter Notebook - Size: 10.1 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 1

ksek87/data-mining-algos-from-scratch

The essential Data Mining Algorithms... implemented from scratch

Language: Python - Size: 16.6 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

leibn/Data-Mining-Course-OpenU

Projects in Course: 20595-Information(Data) Mining, Semester: 2022 b.

Language: Jupyter Notebook - Size: 66.2 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

sadnanMohosin/Data-Mining-complete-Bootcamp

Language: Jupyter Notebook - Size: 5.37 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

Palakpatel31/Machine_Learning

A case study on Hotel booking status prediction using Predicting Analytics

Language: Jupyter Notebook - Size: 2.98 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

akirabroo/Data-Mining

Repository ini adalah tempat untuk mengumpulkan tugas maupun ujian mengenai mata kuliah Data Mining

Language: Jupyter Notebook - Size: 3.31 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

loganbonsignore/Real-Estate-Data-Mining

Web scraping program using the ETL process to mine real-estate metadata in Washington, USA.

Language: Python - Size: 1.47 MB - Last synced at: almost 2 years ago - Pushed at: about 4 years ago - Stars: 6 - Forks: 2

alanmgg/Alpha-Mining-NextJS

Proyecto escolar que permitirá desplegar un dashboard con NextJS que utiliza la API de Yahoo Finance, para desplegar diversos algoritmos de Minería de Datos.

Language: CSS - Size: 17.6 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

AtaUllahB/Python-Practice-Notebooks-Machine-Learning-Data-Mining

Practice codes for Machine Learning, Data Mining and NLP in Python

Language: HTML - Size: 2.22 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

alanmgg/Data-Mining

Repositorio de la asignatura de minería de datos de la Facultad de Ingeniería de la UNAM.

Language: Jupyter Notebook - Size: 25.8 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Ewurama-A/Data-Analysis

This repository is to showcase Data analysis skills. Here Pandas, Siuba, SQL and other dataFrames are used to clean, manipulate and analyze data to draw conclusions, prepare the data for visualization or prepare the data for machine learning.

Language: Jupyter Notebook - Size: 4.05 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Devwarlt/pirple-py-data-mining-course 📦

This repository contains all practices from Pirple's "Data Mining With Python" course.

Language: Jupyter Notebook - Size: 53.4 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

Manas5789/Uber-Data-Anaysis

Analyzing the Uber dataset to draw insights using pyhton

Language: Jupyter Notebook - Size: 72.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

Manas5789/Netflix_Data_Analysis

Analyzing the Netflix dataset to draw various insights using python

Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

MartinMashalov/CV19DataMining Fork of covid-19-testing/locations

WebAPI and Data Collection for COVID-19 testing location across all 50 U.S States. Data Mining accomplished with Python scripts from various websites and official sources.

Size: 9.3 MB - Last synced at: about 1 year ago - Pushed at: about 5 years ago - Stars: 1 - Forks: 0

adonitakos/Data-Mining-course

Python and Jupyter Notebook programs written from my university Data Mining course

Language: Jupyter Notebook - Size: 135 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

BenediktLueth/heartheroes

Master Data Mining Project at the University of Mannheim

Language: Jupyter Notebook - Size: 55.5 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

janfrl/iris-data-mining

The Iris Data Set is a classic data set used for machine learning. This repository is used for educational purposes.

Language: Jupyter Notebook - Size: 602 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 1

elon-fask/Google-search-scraper

Scraping Peaple also ask search results from google search engine

Language: Python - Size: 41.7 MB - Last synced at: about 1 month ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 1

caiquemiranda/python-data-mining

Language: Jupyter Notebook - Size: 775 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

OPEN-NEXT/wp2.2_dev

Initial proof-of-concept of open source development (OSD) status dashboard with data-mining & visualisation components

Language: Python - Size: 9.96 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 1

lwdovico/human-activity-recognition

Second Data Mining Project - Using more Advanced Techniques

Language: Jupyter Notebook - Size: 25.6 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 0 - Forks: 0

alisola21/Data-mining1-project

Basic Data mining analysis on Glasgow Norms Dataset

Language: Jupyter Notebook - Size: 6.67 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

lwdovico/glasgow-norms

First Data Mining Project

Language: Jupyter Notebook - Size: 16.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

ozanmujde/BloomFilter-Flajolet-Martin

Basic implementation of Bloom filter and Flajolet-Martin algorithms in python with hashes and test files

Language: Jupyter Notebook - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

TheJorseman/DataMining

The Jorseman Mining Tool es una herramienta web para la aplicación de algoritmos de minería de datos.

Language: Python - Size: 2.52 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

donRumata03/Literature_downloader

It`s part of the project Literature_analyzer. It`s task is to download as much data from site royallib.com about literature as possible

Language: Python - Size: 25 MB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 0

MikeMMattinson/Data_Mining_I_D209

Data Mining I expands predictive modeling into nonlinear dimensions, enhancing the capabilities and effectiveness of the data analytics lifecycle. In this course, learners implement supervised models—specifically classification and prediction data mining models—to unearth relationships among variables that are not apparent with more surface-level techniques. The course provides frameworks for assessing models’ sensitivity and specificity. D208 Predictive Modeling is a prerequisite to this course.

Language: HTML - Size: 24.8 MB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 1

camara94/analysediscriminant

L'analyse factorielle discriminante (AFD) ou simplement analyse discriminante est une technique statistique qui vise à décrire, expliquer et prédire l'appartenance à des groupes prédéfinis (classes, modalités de la variable à prédire…) d'un ensemble d'observations (individus, exemples…)

Size: 1.79 MB - Last synced at: 3 months ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

pedrovernetti/github-scraper

Endlessly scrapes GitHub repo pages, given a seed user, collecting source code samples to be used as dataset.

Language: Python - Size: 3.91 KB - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 0 - Forks: 0

masher1/SocialMediaMining

This repository contains the code I created in the Social Media Mining course

Language: Python - Size: 41.9 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

rezapci/Data-Mining-with-Python

Language: Jupyter Notebook - Size: 266 KB - Last synced at: over 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 1

Related Keywords
data-mining-python 80 data-mining 39 python 27 data-science 19 data-mining-algorithms 18 machine-learning 12 python3 8 machine-learning-algorithms 7 pandas 7 data-visualization 6 data-analysis-python 6 datamining 6 data-analysis 5 numpy 5 data 5 data-mining-assignments 5 deep-learning 5 scraper 4 selenium 4 natural-language-processing 4 matplotlib 4 web-scraping 4 scikit-learn 4 selenium-python 3 data-cleaning 3 statistics 3 seaborn 3 sentiment-analysis 3 automation 3 predictive-modeling 2 web-scraping-python 2 data-mining-algorithm 2 data-engineering 2 lstm 2 scraping-python 2 university-coursework 2 pandas-python 2 data-visualization-python 2 random-forest 2 knn 2 scikitlearn-machine-learning 2 data-cleaning-and-preprocessing 2 eda 2 scraping 2 sentiment-classification 2 social-media 2 scrapy-crawler 2 webscraping 2 scrapy 2 twitter-api 2 algorithms 2 webcrawling 2 python-3 2 text-mining 2 naive-bayes 2 jupiter-notebook 2 data-structures 2 bot 2 confusion-matrix 2 python-scraper 2 jupyter-notebook 2 frequent-pattern-mining 2 python-script 2 closed-frequent-itemset 2 closed-frequent-itemset-mining 2 dataset 2 deep-neural-networks 2 frequent-itemsets 2 logistic-regression 2 pdf 1 pdf-converter 1 scraping-pdf 1 frequent-itemset-mining 1 statistical-analysis 1 verification 1 visualization 1 crawler 1 crawler-python 1 instagram 1 instagram-scraper 1 investor-connect 1 playwright 1 spider 1 bag-of-words 1 cosine-similarity 1 fake-news-detection 1 jaccard-similarity 1 social-media-mining 1 web-crawling 1 telegram-bots 1 stock-monitor 1 serp-scraping 1 playwright-python 1 pdf-to-excel 1 ocr-recognition 1 google-maps-scraping 1 discord-bot 1 craigslist-email-scraper 1 chatgpt 1 automated-testing 1