An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: web-mining

clips/pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Language: Python - Size: 49.9 MB - Last synced at: 27 days ago - Pushed at: 11 months ago - Stars: 8,789 - Forks: 1,588

velvirt/Data_Collection_and_Storage_-SQL-

A Data Collection and Storage project using SQL from TripleTen

Size: 1000 Bytes - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

joisino/seafaring

Code for "Active Learning from the Web" (WWW 2023)

Language: Python - Size: 622 KB - Last synced at: 8 days ago - Pushed at: about 2 years ago - Stars: 116 - Forks: 23

berksudan/Success-Prediction-on-E-Learning

A distributed classification tool developed as a graduation project. Used: Big Data Analysis, Java 8, Spark, Web Mining, Machine Learning.

Language: Python - Size: 217 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

MohamedHmini/iww

AI based web-wrapper for web-content-extraction

Language: Python - Size: 59.2 MB - Last synced at: 2 days ago - Pushed at: about 2 years ago - Stars: 100 - Forks: 14

LKEthridge/Data_Collection_and_Storage_-SQL-

A Data Collection and Storage project using SQL from TripleTen

Language: Jupyter Notebook - Size: 162 KB - Last synced at: 25 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

notgiven688/webminerpool 📦

Complete sources for a monero webminer.

Language: C - Size: 2.84 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 266 - Forks: 175

Shuaijun-LIU/Employment_Analysis_and_Recommendation_System

Key Words: NLP, Web Mining, Recommendation System, Employment Analysis

Language: Jupyter Notebook - Size: 8.1 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

ParhamPishro/Quotes-Crawling

Crawl Anne Shirley's Quotes from Web | استخراج نقل قول های آن شرلی از وب

Language: Jupyter Notebook - Size: 56.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

codeasarjun/web-scraping

This repo contains working example for web scraping

Language: Python - Size: 22.5 KB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Food-Ninja/WikiHow-Instruction-Extraction

Extracting and Analyzing instructions for robot manipulation actions from WikiHow

Language: Java - Size: 465 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

NearZeK/Cloud-Mining-BTC

BTC cloud python miner

Language: Python - Size: 138 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 21 - Forks: 3

AqueeqAzam/web-scraping-for-data-gathering-and-mining

Web scraping is used by data mining experts and hackers to imitate conventional browsers and visit websites by following their hypertext structure. They then extract HTML content and data according to predetermined settings and store the data in local databases. 

Language: Jupyter Notebook - Size: 4.88 KB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

onapte/web-mining_implementations

Implementations of basic 'Web-Mining' algorithms using Python

Language: Jupyter Notebook - Size: 247 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

aadhityasw/VIT-Labs

A bunch of codes done in the labs of various courses during Undergrad.

Language: Jupyter Notebook - Size: 102 MB - Last synced at: 10 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 3

jozi/iranian-developers-in-telegram

Curated List of Persian Groups and Channels for Iranian Developers in Telegram

Size: 22.5 KB - Last synced at: 8 months ago - Pushed at: over 7 years ago - Stars: 68 - Forks: 11

jokruger/rl3examples

RL3 examples repository (information extraction, NER, NLP, web & text mining, etc).

Language: Python - Size: 89.8 KB - Last synced at: 6 months ago - Pushed at: over 6 years ago - Stars: 14 - Forks: 1

sotirisnikoletos/Master-Thesis

Data Fusion on PM 2.5, Weather and News Dataset regarding the city of Patras. Combination of word embeddings and numerical features into Multi Layer Perceptron. News dataset was crawled from thebest.gr, while the PM 2.5 and weather data were kindly given by Post-Doctoral Student Fotis Anagnostopoulos. Continuation of the PROJECT ENIRISST+.

Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

fuchsia-programming/scrape 📦

When you need those jobs hypersonic 🚀 scrape 🔪

Language: JavaScript - Size: 2.79 MB - Last synced at: 12 months ago - Pushed at: over 5 years ago - Stars: 10 - Forks: 3

deepwn/deepMiner 📦

deepMiner webminer proxy (update for cryptoNight R)

Language: C - Size: 3.33 MB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 541 - Forks: 247

andrealenzi11/py-web-miner

Extensible Web Miner to extract information from web pages. It is based on HTTP Requests library, Beautiful Soup parser, and Selenium WebDriver.

Language: Python - Size: 24.4 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

rutujadhage/Web-Page-Path-Prediction

Language: Java - Size: 1.95 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

fatihkykc/HITS-Implementation

Hyperlink-Induced Topic Search Algorithm Implementation and a web interface with animation

Language: JavaScript - Size: 2.49 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 1

griesyuli/semanticMSE

Implementation query expansion in semantic meta-search engine. The resulting expansion system is called Wiki-MetaSemantik.

Language: Python - Size: 113 MB - Last synced at: over 1 year ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 10

baha-a/WebMining

Web Usage Mining project for analyzing websites' log files

Language: C# - Size: 8.62 MB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 5 - Forks: 5

Antonios-Kagias/Network_Analysis_and_Web_Mining

Modelling and management of network data in the form of graphs, analysis and visualisation of the relationships between the entities participating in a network, experimenting with Neo4j graph database and queries in Cypher, generating node embeddings

Language: Jupyter Notebook - Size: 2.92 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

harshit7962/CSE3024-Web-Mining

Lab Assignments of Course Web Mining (CSE-3024)

Language: Jupyter Notebook - Size: 8.33 MB - Last synced at: 18 days ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

ramyar-rmn/sort-googlescholar-results

Sort results from Google Scholar

Language: Python - Size: 63.5 KB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 6 - Forks: 0

ayseceyda/koeri-boun-R-deprem-data

A repository which examines the latest earthquakes on Turkey. The project uses R as the script language.

Language: R - Size: 27.3 KB - Last synced at: over 1 year ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

akash-r34/Bus-Schedule-Optimization

A project to optimize bus schedules using a genetic algorithm to improve public transportation efficiency and reduce passenger wait times.

Language: Jupyter Notebook - Size: 876 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

christakakis/graph_network_analysis

Study, analysis and extraction of knowledge from the web and social networks.

Language: Jupyter Notebook - Size: 3.11 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

literallysofia/webwire

✍️ WebWire: A tool capable of generating images of hand-drawn wireframes from real websites.

Language: TypeScript - Size: 140 MB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 2 - Forks: 0

Dhanya-Abhirami/Twitter-Sentiment-Analysis

Analyzing tweets on GST Bill for Sentiment Classification

Language: Python - Size: 5.86 KB - Last synced at: 21 days ago - Pushed at: over 4 years ago - Stars: 1 - Forks: 0

AnanthVivekanand/WebGRLC.js

A plug-and-play cryptocurrency miner script. Written with Javascript and WebAssembly. Highly effcient :rocket:

Language: JavaScript - Size: 69.3 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 4 - Forks: 4

jpdias/kugsha 📦

Reverse Engineering Static Content and Dynamic Behaviour of E-Commerce Sites for Fun and Profit

Language: Scala - Size: 6.03 MB - Last synced at: 12 months ago - Pushed at: over 6 years ago - Stars: 2 - Forks: 0

clytaemnestra/Scrapy 📦

Practical part of the semester work for the subject 4IZ470 - Knowledge Mining From Web.

Language: Python - Size: 24.4 KB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

Svestis/Data-Science-and-Web-Mining-Regression-Problem 📦

As part of the course "Data Science and Web Mining"

Language: Jupyter Notebook - Size: 2.61 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 0 - Forks: 0

mihirs16/Coursera-Web-Scraper 📦

A multiprocessing webscraper for Coursera.org to build a dataset for all courses with details like ratings, difficulty, etc.

Language: Python - Size: 21.2 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 5

archanakalburgi/machine_learning_coursework

coursework materials and assignments

Language: Jupyter Notebook - Size: 680 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

ansegura7/TwitterAnalytics

Web Mining project in which Descriptive Statistics and NLP techniques are used to analyze the behavior of a Twitter account and the content of their respective tweets.

Language: HTML - Size: 32.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 8 - Forks: 4

AnanthVivekanand/WebMine

Browser cryptocurrency miner using native Allium C code.

Language: JavaScript - Size: 188 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 7 - Forks: 5

jpmondoni/instagram_explorer

:camera: An app to scrap instagram posts and analyze data.

Language: Python - Size: 616 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 18 - Forks: 5

hans-strudle/CoinJack

Extension to HiJack/control web miners (like CoinHive)

Language: JavaScript - Size: 210 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 25 - Forks: 11

gagan-gv/Web-Mining-Codes

Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

SidtheKidx/web-scraper-flipkart

Developed web scraper for an e-commerce website-Flipkart utilising BeautifulSoup and Selenium for data collection

Language: Jupyter Notebook - Size: 6.84 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

MagedMohamedTurk/Web-Scraping-Test-Sites

This is a script to scrape a test E-commerce site using Selenium-Python, Pandas-Python, and Tqdm.

Language: Python - Size: 2.93 KB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

m-elkhou/Web_Mining

Academic projects in NLP, Text Mining, Web Mining, Data Scraping...

Language: Jupyter Notebook - Size: 24.4 MB - Last synced at: 4 months ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

karthik0309/Web_Mining_lab

Web mining lab experiments

Language: Python - Size: 2.19 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

Block-Lab/kNight.js

⛏️ CryptoNight Miner (WIP)

Language: C - Size: 3.15 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 4 - Forks: 2

Ludovit-Laca/WM-cvicenia

Cvičenia z predmetu Web mining 2021

Language: Jupyter Notebook - Size: 12.6 MB - Last synced at: almost 2 years ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

usametov/boilerpipe Fork of kohlschutter/boilerpipe

Work in progress transmit from Google Code

Language: Java - Size: 2.27 MB - Last synced at: almost 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

danielzhangau/Data-Mining

Techniques used for data cleaning, finding patterns in structured, text, and web data; with application to areas such as customer relationship management, fraud detection & homeland security.

Language: Jupyter Notebook - Size: 71.7 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

AnanthVivekanand/WebGRLC

A repository for using native Allium (GRLC) PoW functions to verify transactions and secure the blockchain, *through the browser*

Language: JavaScript - Size: 490 KB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 0 - Forks: 0

trasherdk/xmr-wasm Fork of jtgrassie/xmr-wasm

A Monero WebAssembly based miner

Language: C - Size: 247 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

rishivamshi/Web-Mining-Project

Language: Python - Size: 1.35 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 0 - Forks: 4

chrisPiemonte/bachelor-thesis

Bachelor's thesis about Web Graph Clustering with Word Embeddings

Language: TeX - Size: 31.7 MB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 2

mithun2595/cse258f17

covers solutions to homeworks, assignments & the like for CSE 258 (ucsdfall17)

Size: 80.1 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 0 - Forks: 0

RomanKyrychenko/Q-Q-training

Web content mining

Language: HTML - Size: 2 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 1

pandeydivesh15/grasp Fork of textgain/grasp

Language: Python - Size: 3.33 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0